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Statistical Theory of Multiple Meson Production* 


RICHARD H. MILBURN 


Harvard University, Cambrid; ge, Massachusetts 


* The Fermi statistical theory of multiple meson production is examined, and its application to medium- 

energy pion-nucleon and nucleon-nucleon collisions is discussed. Various approximate calculational pro- 

«cedures are described and their results compared critically. It is found, by the use of a precise integration 

z procedure, that a rigorous relativistic inclusion of the meson rest mass is necessary if one is to draw accurate 

ezm <. conclusions from the postulates of the theory. The consequences of isotopic spin conservation are described 

* and tables are given for the expected distribution of charged products from both pion-nucleon and nucleon- 

nucleon collisions. The theory is applied to the case of pion-nucleon collisions at 1.4 Bev. and is found to 

|@edict only poorly the tentative results of recent experiments. The failure of the theory to predict rucleon- 
niicleon collisions is also described, and possible modifications and improvements are discussed. 
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o I. PREFACE 


INCE the original introduction by Fermi of a sta- 
tistical theory of multiple meson production,! nu- 
merous workers have sought to apply such a theory to 
various sorts of high-energy interactions. Recent experi- 
ments with the Cosmotron at the Brookhaven National 
i «sKaboratory, find impending experiments at the Beva- 
kó tron in Berkelgy have stimulated interest in the applica- 
tion of the theory to nucleon-nucleon and meson- 
nwcleon events occurring at energies of the order of 
one to en billion electron volts. Except for two basic 
papers by Fermi,'? and a paper by Lepore and Stuart,’ 
‘much of this work has appeared only in the form of 
internal or project reports of limited circulation.?-5 
It seems desirable, therefore, to present a summary 

~ describing the present state of the Fermi theory for the 
>m, piunarily, of experimental workers. One must 
- observe in addition, as we shall illustrate, that a strict 
application of Fermi’s simple basic postulates requires 
meticulous attention to detail. Most existing calcula- 
tions have, -of practical necessity, been based upon 


* Supported by thé Qffice-of Naval Research. 
1 E. Fermi, Progr. Theoret. Phys. (Japan) 5, 570 (1950). 
~ 4E. Fermi, Phys. Rev. 92, 452 (1953); Phys. Rev. 93, 1434 
(554). 2 
‘ 3J. V. Lepore and R. N. Start, UCRL-2386, Nov. 17, 1953; 
Phys. Rev. 94, 1724 (1954). Note: In Eq. (33) of UCRL-2386 
© and Eq. (20) of the Phys. Rev. article, the factor (4n—3)! skould 
read (4n—4)! í 
4J. V. Lepere, UCRL-2396, UCRL-2398. ie 
5C. Yang înd ÉK. Christien, Brookhaven Cosmotron Internal 
Report. 7 
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approximations of a somewhat unsatisfactory sort, and 
in general have been directed toward the prediction of 
events occurring at one or another specific primary 
energy. To facilitate and stimulate more exact calcula- 
tions, and applications to a wider variety of problems, 
there is included in the following a discussion of the 
various methods of computation which may be utilized, 
and a comparison of the several sorts of approximation 
which may be made. We shall not discuss, however, the. 
thermodynamic approximation, which applies to events 
of extremely high energy, and which involves theoretical 
methods of a different nature from those useful at the 
lower energies of interest here. This theory has been 
described in detail elsewhere.!.*-$ 


I. INTRODUCTION TO THE THEORY 


We wish to predict the outcome of-nucleon-nucleon 
and meson-nucleon collisions occurring at energies suffi- 
ciently high for meson production Tand other possible 
inelastic processes to become significant. We assume, = 
with Fermi,'® that the two incident particles.ip each = f 
of such collisions coalesce ari that the energy brought=~— ~ 
in by them is released within a common region of inter- z ; 
action. After a time this region of,interaction is"then 
considered to disintegrate intoeone or another of the 


et, aad 
SE. Fermi, Phys. Rev. 81, 683 (1951). ° ə a E 
7R. Marshak, Meson Physics (McGraw-Hill Book Confpan;, 4 


Inc., New York, 1952), B 290 f. a es 4 ae 
3 W. Kraushaar and L. Magks, Phys. Rev. 93, 326 (1954). a77 
° E. Fermi, Elementary Partigles (Yale University Press, New _ 
Haven, 1951), p. 84 ff. zt : é i E 
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possible final states of the system. We further assume 
> the interactions involved are sufficiently strong, 
and that the interaction region survives sufficiently long, 
“fer all of the possible final statés to have become equally 
excited. By thus assuming the attainment of a statistical 
equilibrium among all possible final states we may 
prorate the probability for a given outcome according 
to the number of states embraced by that outcome. 
The ensemble of possible final states is to be selected on 
the basis of those conservation laws which are expected 
to apply during the interaction. Besides the usual laws 
of eliergy, momentum, and charge conservation, one 
may also require vhat the total isotopic spin of the 
system, the number of nucleonic particles, and similar 
variables be subject to appropriate constraints. 

The foregoing assumptions should be phrased in 
formal terms before proceeding with calculations based 
upon the Fermi statistical theory. The transition rate, 
w, from the initial state, |z), of a quantum mechanical 
system to a final state, | /), is given by the customary 
formula 


27 
w=— lll), (2.1) 


where (/]5C|7) is the matrix element of the Hamiltonian 
connecting the two states and where the factor p(W) 
represents the density of final states of the system 
evaluated for a total energy W. In general we shall 
specify the final states in terms of the numbers and 
charges of the nucleons, mesons, and perhaps other 
particles which emerge from the interaction region. 
The matrix element (f|5C|7) vanishes, of course, for 
interactions which do not connect states satisfying the 
laws of energy, momentum, angular momentum, and 
charge conservation. It may also be assumed to vanish 
for other interactions, such as those which, a priori, do 
not conserve isotopic spin. To apply the Fermi assump- 
tion that equilibrium is attained, we consider that the 
interaction 5¢ projects the initial state vector |i) uni- 
formly over the space of all state vectors which are 
compatible with the conservation laws and which repre- 
sent the existence of virtual particles of all possible 
momenta confined to the interaction volume Q. The 
final state vector | f) represents a state containing free 
particles, and will consist of the direct product of inde- 
pendent free-parti ie wave functions. Let us assume 
“ that these individual wave functions are normalized 
“with rrspect to a large spatial volume V. For a total 
of Ņ particles in the fina state the function | f) will 
thus contain the normalizing factor (V)-“/2", On the 
other hand; each of the N-particle virtual states con- 
tained initially within the volume 9, will have a normal- 
= izing factor of the order of (Q)-“*, Upon computing 
ene verlep integrals for each of the N particles the 


Bre 


matrix element becomes 
ba $ P 2 
ce Ys] )=cortQ/7) ey, 


x . A 
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(2.2) 
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i 

where the constant, by the assumption of statistical =o» 
equilibrium, is independent of the final state. Put"more 3 
physically, the transition rate w to a given final state " 
is proportional to (Q/V)*, which is the probability 
that N particles confined to a box of volume V will ý 
simultaneously be observed within a region.c4 volume 
Q. The constant appearing above is usefub only in de- 
termining the over-all transition rate for all types of 


i 


final states. Since the purpose here is to establish_the, y: A 
relative abundance of different final states for a given ae 
sort of interaction, this constant will be neglected in e 


calculations. The only remaining parameter in the 
theory is the interaction volume Q. Fermi assumes that 


Q will be of nucleonic dimensions, say Q= 4/3)rr,3, k 
f 

h: 

t 


where ro is the approximate radius of the meson cloud 
surrounding a nucleon. Also 2 may be made energy q 
dependent, if desired. However, for the moment we 
shall consider Q to be a free parameter of the system. 


F 


Refinements may be made in the above estimate of i 
the matrix element. If 7 of the total of V particles are 
considered to be physically indistinguishable? then only y 


pees 


those wave functions for the final state, | f) arad a 

missible which are completely symmetric (or, appro- days 
priately, antisymmetric) in the coordinates of the, ý : 
identical particles. The degeneracy so introduced is 
reflected by the replacement of |f) with (n) =] f), 


ar 

wherever the former function occurs. Hence the matrix ; 
element becomes, in this case, ts 

ONE Sl 

(seli) =const( ) . : (2.3) 

V (m!)1/2 
Generalizations of this procedure are evident. It should a4 
be remarked that the concept of indistinguishability & 


must be carefully applied. Consider the Case of plonsa-» 
Were one to treat positive pions as being distinguishable 

from negative pions, that is, as being essentially differ- 

ent particles, then one would allow for possible de- ° » 
generacies through dividing the matrix element by 


Cnt)! 


where nt and n~ represent the number of positive and i 
negative pions present in the final state. In this scheme, 
say, a state of one positive pion and one negative pion ¢ 
would be considered to have twice the weight—froga> 
the (2!)— of a state containing two positive pions. If, - 
on the other hand, we wish to impose the requirement 
that the total isotopic spin of the system is conserved 
during the interaction, we.must then censider that 
positive, neutral, and negative pions are three states of 
a single particle. The matrix element for the process 
must then be divided by (x!)!2, where n represents the 
total number of pions present. In this case the weights 
for the various combinations of charges must he estab- 
lished by a more complicated process which will be © 
described in Sec. IV of this paper. 

One may, of course, modify the ma‘rix element in 
still other manners in order to refine the thecry further. 
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“A few such refinements will be discussed at the end of 

J. Sec. IIT. 
othe density of final states, p(W), depends both upon 
the phase space available to the particles in the final 
state of energy IV, and upon the number of possible 
charge arsd.angular momentum states consistent with 
the constrairtts on the system. Henceforth we shall 
consider only situations in which pions and nucleons are 
produced. We shall specify the final state according to 
the number and charge of these particles, irrespective 


on 


p 
. » “of their individual energies, momenta, and angular 
momenta. Since one state for each particle produced 
d corresponds to a volume (27/)* in the phase space of 
hi that particle, V independent particles, of total energy 
ies . . . s 
W, will possess the density of states per unit energy : 
$. c 
Mie > N 
Ni (JaW) nh) | II d’'xid’pi. (2.4) 
zi y vip iml 
Í ý 
Li The spatial, integral for each particle is taken over the 
w large. normalization volume V, while each momentum 
Th 


> 


a 


p > iftegra iarequired to be consistent with a total energy 
Teke W: The relative probability that pions and s nucleons 
wt emerge feom a high-energy interaction will then be 
given by z 


i S'(n,s) = (Q/V)* X (d/dW) (2nh)-2% 


<S 


with N= (n-+s), where it has been assumed for the 
> moment that the pions are all distinguishable. The inte- 

grations abovg must be restricted to conform to the 
’varfous constraints upon the system. The conservation 

of linear momentum will, for example, reduce the num- 

ber of dynamically independent particles from V to 
> (\®=1). Not only will the integrations above be taken 
| over thé phase space of only (N—1) particles, but also 
the matrix element factor (Q/V)%? will become 
* +. (Q/V)- 29 for the same reason. The relative prob- 
abilities then become 


5 FS Stns) =(/V)¥E (d/aW) (eh) 


N 
I] @°x.d°p;, (2.5) 


i=l 


K 


d N-1 
cae x f [ JI @*xid%p:. (2.6) 
y Yw i=l 


A summation symbol has been included in the above 
equations to indicate that a suitable sum must be taken 
over the various possible charge and angular momentum 
States. 3 
' From this point on, the determination of the relative 
probabilities, S(v,s), is largely one of mathematical 
“calculation. We may simplify the problem considerably 
by neglecting entirely the conservation of angular 
momentum. Thf effects, of such a neglect have been 
‘estimated ky Fermi.! The spatial icer on in (2.6) 


| 
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¥ e 
becomes (¥)*~', cancelling a similar factor in the 


matrix element. Thus we find . a 
° N ait ge~ 
S(n,s)=9(2rh) d X (d/dW) | TI d'p: 
wil 
= (2mh) N- I (d/dWyOw (ns). (2.7) 


Here Uw is the volume of the 3¢V—1) dimensional 
momentum space corresponding to the (V—1) inde- 
pendent particles, calculated to be*consistent with the 
law of momentum conservation, and such that the total 
energy of all V particles is less than Ôr equal to WW. 

In Sec. HI we shall discyss various procedures and 
results for the calculation of (d/dW)Uy. In Sec. IV, 
on the other hand, we shall consider the calculation of 
the sum over charge states which is required if we are 
to assume that the total isotopic spin is conserved 
during the interaction. We shall then illustrate the 
application of the theory to experimental situations 
and shall also describe certain modifications which have 
been proposed. 


II. THE PHASE SPACE CALCULATIONS 
e 


The problem is to calculate the quantity Uw as seen 
in Eq. (2.7), and from it dUw/dW, where Uw is the total 
volume of momentum space available to all the inde- 
pendent particles of the system, subject to the law of 
conservation of momentum, and to the requirement 
that the total energy of all particles, independent or not, 
is less than or equal to W. The calculation of this 
quantity becomes very difficult if one tries to treat all 
particles concerned in a proper relativistic manner. To 
date, numerical results have been yielded only by 
treatments in which one or another drastic approxima- 
tion hastbeen made. In the following sections we shall 
summarize and illustrate some of these results. 


A. The General Formula of Lepore and Stuart 


First we shall describe a general formule developed 
by Lepore and Stuart.’ This formula, although resistant 
to numerical calculation in the general case, still repre- 
sents a powerful approach to the problem. The quantity 
of interest in Eq. (2.7) may be written, 


dUw dfn N Pa 
-fi fehe ps) 4 
dW dWlet 1 + 


e 


N > e 4 
KUWE (p2+-M 27], BIA E 
Hi : 


where the integrals are taken over the momentum space 

of all .V particles, with masses M: The units are such 
that c=1. The delta function in the momentum insures 
that momentum will be conserved (effectively xed cing. 
the number of independent int€grations By Se i 
U(x) is a step function equalling unity for 
values of its argument ands o for othet 
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momentum integrals may now bë taken to plus-or-minus 
“infinity. Tke discontinuous functions have the Fourier 
representations ; 


w= ff d3), exp (7a-x), (3.2) 


wie 


and ' 


Us) = (2r) f (e>0). (3.3) 


—»v— ie 


da-a™! Te 


Thus Eq. (3.1) breaks down into a product of integrals 
of similar form, 


dU 1 pora 3 N 
a ERE da- eiw f dL I, (3.4) 
aw Ons... mete 


where 


i Í e a AMA. 85) 


The angular integration may be conducted by taking 2 
to be the polar axis. If in addition one substitutes 
p=M sinhé, Mćı= MA, and a1= Ma, one finds that 


Ir a 


== dep expli 
1 —0 


-exp[i(Ai sinh@—a; cosh6) J. (3.6) 


We may write 
Ai sinh9—ay coshO= — (a;?—),?)!? cosh (0— y), (3.7) 


with cosh y=ay(a@;*—),?)—/. Considering à to be fixed 
and positive, we observe (a’—),")! to be an analytic 
function of the complex variable au, having branch 
points at +À; on the real axis. The function is specified 
completely by requiring that (as°—),2)!? be positive 
for large positive values of a1. In particular (a2—),2)! 
is analytic in a neighborhood of the line Ima= —«, along 
which a will later be integrated. Letting z= (a2—\2)!, 
and requiring that e be infinitesimal, we see that the 
phase of z will be 0, for ai>\1>0; (—ir/2), for 
Ai>ai>—Ai; andie(—ir), for aı<—Àı. We may then 


- write, putting (@—¢)=0', 


“rM: d a. ee 
` f do” -coshé’ 
Àl dÀi (av—A2 12 — 


-expl —i(ar—Ay?)!? cosh] 


»-expl_—iz cosh’ ] 
am r f 
ə 


£ 


l. (3.8) 


w 


MILLBURN 


When the phase of z is either 0 or —ir we may writes 


the integral in terms of Hankel functions!’ 


f d0’-cosh6’ -exp(—iz cosh’) = —rH:® (2); (3.9) 
where the relation 
MO (Ze) 3H, (2), GID 


has been used." Hence, utilizing the recursion formula,” 5 


1 d /H,(Z) HL (2) 
( ) ; (3.11) 
Z dZ Z Li 
we may write 
HM ;(a?—)?)"!2] 
i= 2r°M fa— = (3.12) 


CAE) 


This has been derived only for a lying on those parts of 


the real axis such that |a| >A. However, both £. (3.5)-- 


and Eq. (3.12) represent analytic functions of a. By an 
analytic continuation argument Eq. ($12) can tz 
stated to hold for all a’s lying on Ima=—‘.. Indeed, the 
equation holds for the entire plane provided only that 
one does not cross the branch line between +A. Finally 
we may write that í 


dUw 2 


aw (2r) 


dw 


N w—ie 
-(27?)% TI mef daa -etW 
i=1 


—w—ie 


an NNN 5 
xh = HTM; (3), 
ggr D FOME 


Sai) 


(3:13) 
which is the formula derived by Lepore and Stuart* 


B. Calculation for Zero-Mass Particles 


The general formula just derived is such that the 
remaining integrations are difficult to perform, a prob- 
lem arising from the fact that the points +À are essen- 
tial singularities with respect to the a plane. Results 
may be obtained, however, for the case of fero-mas> 
particles, that is, for the case in which all particles are 
treated as being extreme-relativistic. Following the pro- 
cedure of Lepore and Stuart,’ we expand the Hankel 
function about the origin and keep only the first 
term. Thus, : ‘ 


4i 


HOM: (e-na 
= MEX) 


1 G. Watson, Bessel Functions (Cambridge University Press, 
0. 


Cambridge, England, 1952), p. 18 
See reference 10, p. 75. 

12 See reference 10, p. 74. e 

13 See reference 10, p. 84. 
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Substituting this in Eq. (3.13) we have 


dUwER 23N-3jN poi ” x 
= = : f da-:aN.giaW f eae 
dw RA ee : o (a? —)2)28 
Groans? (4N—4) \((2N— 1) 
eee [(2V—-1)!7 
: i . wie pial 
è a da TOLI) 
C nie QNEN 
or finally, 
dy? m (4N—4)!(2N— 1) 
8 E z —W2N-4, (3.16) 


dW 2% (2N) PGN 4)! 


where both the à and the « integrals are evaluate? by 
the method of residues upon closing the contours above 
the real axis. It must be remembered that a has an 
infinitesimal negative imagifary part. 

It is well to compare this result with that obtained 
originally by Fermi for the same quantity! Fermi 


eS. OT . 
initially weglected momentum conservation and calcu- 


lated the phase space available to N extreme-relativistic 
particles. Tl#is was subject only to the condition that 
the upper bound of the total energy be fixed. Therefore, 


N 
D DiS W. 
i=l 


o 


The integrals may be calculated directly, with the result 


w W—pı 
(Um ” P) Femi = (4a) f pép | pr'dpo:-- 
0 0 
mo e w- E p; 23N 
1 7 
x il PEO LE, (3.17) 
@ ° (3N) 


Fermi {then assumed that momentum conservation 
could be taken into account simply by reducing the 
.numbet of independent particles by unity, that is, by 
letting N—(V—1). Doing this and taking the energy 
derivative one obtains 


a N—-123(N-1) © 


= (25) z Wor, 
F dw Fermi (E= 4) ! 


This has the same energy dependence as the exact value 
in Eq. (3.16), but quite a different coefficient. Table I 
shows thé numerical results for small values of V, and 
arbitrary W. It isevident from this table that the strict 
imposition of momentum conservation has a marked 
effect upon the relative statistical weights for different 


(3.18) 


a 


$ 


Taste I. A comparisoñ ofe statistical weights for massless 
particles. The numbers in brackets show the weights relative to 
the case N=2. 


Number of particles mo 
Method N =2 N=3 N =4 
Fermi 12.6W? 5.26Ws 0.394Ws 0.01009" 
(1) (0.42W3) (0.031W%)* (0.000791) 
Exact 1.57W? 0.237 Ws O.0127W4 0.000218" 
(1) 


(0.18) (0.0081W6) (0.000140) 


(3.5) may be written as ‘ 


Ivn=— (2n/d)(d/dr) f dp 


XexplilAp—alM-+p/2M))} 


= — r? e (2M Ja) e iMi UNa (3.19) 
where the calculation is carried out by completing the 
square in the exponent. The phase space calculation 
then is reduced to the determination of two elementary 
integrals, and yields 

a—ie 


dO wrk 
dW 


= (27) 4 dae**™ . tr f dÀ- MU» r) s 
9 


-nie 


236 8-1) [2233-1 /2 3.4/2 (W— sM) (38/2—5/2) 


(sas 


(3.20) 


TE3(s—1)/2] 


When classical particles of different masses are involved 
the same formula holds, provided one makes the 
replacement 


sM—> M, 


i=l 


(3.21)" 


Me] (M3), 


i=l 
° 
e 


D. Approximate Calculations for 
Nucleon-Nucleon Collisions 


It is of considerable practical interest to examine the 
case in which s nucleons and » pions are emitted. 
Lepore and Stuart? studied this case under the assump- 
tion that the nucleons are classical particles of mass M, 


and the pions are extreme-relativistic particles. Account „ 


was taken of the pion mass „by assuming that tle pion 


total energy has the expression (p+). This sefves tõ 


limit pion production to within energetically feasible 
limits, but is a somewhat dubioys, if necessary, approxi- 
mation. The calculation proceeds as befare, yielding, in 


; multiplicities. process, the integral ey : 
A C. Classical Particles P : <= < 
An exactfcalgzlation may also be made for the cdse 4x f N Oe exp{i°sM/2a},. (3.22) 
of s classical Pa of mass M. The integral in Eq. 0 2 oe : 4 ; 
CC-O. In Public Domain. Gurukul Kangri Collečtionş Haridwar : + = ‘ ` r 
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P (w) 


Fic. 1. Momentum configuration for two particles. 


Although perfectly convergent, because of the in- 
finitəsimal negative imaginary part of œ, this integral 
leads on computation to Fresnel integrals which are 
resistant to further analysis. Lepore and Stuart make 
the approximation that the denominator may be re- 
placed by at”. Thus it follows that 


dU Mel? (Qa 3(s—1)/2 
( ”) = [ar] n [237 
dW 7 approx. (sM)3?2 


[W—sM — np ]ss2t3n-5/2 
T[3(s—1)/2+3n] 


It is seen that this result is essentially the product of 
Eqs. (3.20) and (3.17), where energy conservation 
among the increased number of particles has been taken 
into account through the adoption of a common energy 
factor, and through a suitable change in the argument 
of the gamma-function. Essentially, this approximation 
assumes that momentum conservation holds only for 
the s nucleons, while the » pions are allowed to take on 
any energies consistent with over-all energy conserva- 
tion. If there are a variety of classical particles they may 
again be accounted for by the substitution in Eq. (3.21). 

From the above formula Lepore and Stuart have 
calculated the relative probabilities for various pion 


(3.23) 


‘multiplicities, as obtained by substituting Eq. (3.23) in 


Eq. (2.7) for S(n,s). Their results for several different 
total energies in the center-of-mass system are given in 
Table IT,? where s=2 for all cases. In these calculations 
the interaction volume Q was taken to be, following 
Fermi,! 
Q= (2M/W)(4/3)rR?, R=h/uc=14X 10-8 cm. (3.24) 
This is simply the volume of the nucleonic pion cloud 
TABLE II. The approximate weights for pion multiplicities in 
nucleon-nucleon collisions (reference 3) where s=2=number of 


nucleons emitted, »=number of pions produced, and W=total 
center.of-mass energy. No account is taken of charge states or of 


, particle indistinguishability. 


Å 


n- W= 2.5 Bey 3.9 Bev 4.71 Bev 
a a 
0 0.5846 0.0074 0.0008 

=] 0.4127 0.2302 0.0647 

Pheer S 0.0027 0.3679 0.2957 
EE 0.0000 0.0919 0.2547 
St a £ 0.0050 0.0601 
5 0.0001 0.0046 


SS —— 
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contracted by the appropriate Lorentz factor. The 
variation of multiplicity with energy is quite apparent’ 
in Table II. An accurate allowance for pion momentum 
conservation would probably somewhat reduce the 
above multiplicites. Calculations for the production of 
classical heavy mesons and of nucleon pairs are also to 
be found in the report of Lepore and Stuart. 


E. Step-by-Step Calculations 


However satisfactory or unsatisfactory it may be in 
nucleon-nucleon collisions to require momentum con- 
servation of the heavy particles alone, such an approxi- 
mation clearly will not suffice for meson-nucleon 
collisions. This is particularly true for energies at which 
only one nucleon is expected to be present. Rather than 
attgmpting to solve the problem in general terms by an 
attack on the formula in Eq. (3.13), we may proceed 
in the manner now to be described. 

Consider two particles, having total energies w, w’ 
and momenta p and p’, respectively. Suppose that these 
two particles are combined into a system having a fixed 


Fic. 2. Region of momentum integration. 


na 


total momentum P=p-+-p’. We then asx, what is the 
total volume of momentum space available to one of 
these particles, such that the total energy of the pair, 
w-++w’, is less than or equal to some fixed energy W. The 
momentum configuration may be drawn as shown in 
Fig. 1. Let the vector p be taken to represent the inde- 
pendent particle of the system. Then the momentum 
volume available to this particle will be given by 


p(W) 
Dy, p= On | ap.» f 


0 ww IW 


a oo 


d(cos#). . (3.25) 


Here the upper limit to the integration has been 
indicated as (W), or the momentum that the one 
particle would have were it given tke entire energy of 
the system. In general, momentum conservation forces 
the upper limit to be somewhat less than this, and the 
angular integral will vanist except for certain ranges 
of p. By definition, the integral in Eg. (3.25) is taken 
over values of p and @ such that w! < (W— w), or such 
tkat, equivalently, ~'<[some known function of 
(W—w)]. Now w is a known function èf p, and p?=~" 


Cs 
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ee 


} 
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+ +l’—2pP cosd. Hence, for fixed W and P, the bound- 


ary of the region of integration will be specified by a 
definite equation relating p and x=cos6, from which we 
may calculate the functions p(x) or (p). The region of 
integration is shown in Fig. 2. In practice, calculations 
are easier if one inverts the order of integration, to 
obtain 


PEA 5 1 p(z) 
; Vw.r=2r | de f dp: p. 
oti ʻi 0 


To add a third particle to the system one repeats the 
above process by taking the dependent variables p’ and 
w’ to represent the energy and momentum of the com- 
bined system formed by the first two particles. These 
two variables may be calculated in terms of p and w 
which are now the variables of the independent third 
particle. The total momentum space available to the 
two independent particles of this system is then given by 

e 


(3.26) 


1 p(z) 
Up, p=2n f de f dp: p-U'wrp, 2), p'(p z) (3.27) 
0 


—1 


wher@ Uk is the momentum-space volume previously 
calculated for the first two particles. Here again p(x) 
efiforces th conservation of energy and momentum 
upon the system. In general, p(x) may be calculated by 
taking the first root of the integrand U'w., p. Then U’ 
will vanish when 9’ is too large in relation to w. 
Understoed, of course, in these formulas are W and IP, 
the total energy and net momentum of the entire sys- 
tem. The dynamic complexities of the particles involved 
are entirely contained in the computation of the limit 
p(x). Aside from calculational difficulties, one could 
iterate the above process to include in the system as 
many particles as desired. To deal with a center-of-mass 
system one takes P=0 in the final result, and to estab- 
lish the density of, states at a given energy W one 
differentiates Uw,o with respect to W. 


F. Spécial Cases—Extreme Relativistic Particles 


We shall pow calculate a number of special cases, in 
order of increasing complexity. First, let us reconsider 
the case of extreme relativistic particles in Sec (III.B). 
One may denote the total phase-space volume for 7 such 
narticle>by U(m). Then it would clearly follow that 


; v(1)=0, (3.28) 


since only one particle is involved. To calculate (2) we 
observe,that.the integration limits are determined by 


p+ P2—2pPx=(p')?= (w) = (Wp), 
or ; 
y:— P? 
POs i (3.29) 
of —Px] 


+ which yields 


v= f AOT OW- 3.30) 
ST 
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The first root of this is at P=W. Hence, to find (3), 
we may use the same limit p(x) as found in Eq. (3.29) 
for U(2). Thus, > è ue 


T? F. ‘ 
U(3)=—_fW?2— Pe PLT W?— P]. 
2-6! 


Similarly, 
$ 
3 


3T’ 
U(4) ae ae PVLW?—3 P27), (3.32) 


The corresponding ™momentum-space weight factors are 
then found by taking P=0 and differentiating ‘with 
respect to W. One obtains 


dU (1) 
awe 
d0(2) 
= (4/2), 
dw ) 
(3.33 
dO(3) Tr? 
=. 1, 
dW 2-5! 


and 
d0(4) 3-119? ° 
Ws., 


dW 2-8! 


These results are the same as those given by the general 
formula in Eq. (3.16). 


G. One Mass-N Particle with 
n Extreme-Relativistic Particles 


With but little extra difficulty it is possible to include 
one mass-.V particle in a system with » massless par- 
ticles, a calculation of particular relevance to pion- 
nucleon events. One calculates U () as for the n massless 
particles, and then adds the mass-NV particle and the 
simultaneous requirement that the total momentum 
P=0. In all cases the momentum limit is determined 
from the equation is 


P= (p')?= (w= (Wu) =[W— (NL PPR, (3.34) 


which yields 
p= (W?2-N4)/2W, , (3.35) 


a quantity independent of the variable x. Let the mo- 
mentum volume be denoted by U(1N,n0). Then for 
n=1 we have $ 


‘@ 
` 


1 plz) , 
U(LN 10)=2e | ax f Pap 
-l N 


3 ~ xf W2—N272 
= [— |. (336) 


Sin 


Other results follow similarly. Since the-formiulas ase 


complicated, however, we shall quote only the energy” 
derivatives, which ar> of ‘paramount inéerest. These 
> X mw ~ 


> > 
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include i 
du(1N,10) 7 
—. =V*{1—1" ][1—v*], (3.37) 
dw 2 
dO(1N,20) ald 
——=—— (1— r) (7-43 — 23 v4 
dW 2-5! 1 
1 
—3v+2y8)-4+120r4In-}, (3.38) 
p 
and, 
aO(1N,30) 3x11" 4697 3430 
—— = | 11—196°— yi yê 
aw 2-8! 3 3 
1855 28 7 2 
aL ys——_pl0__p24 p14 
3 3 3 3 


1 
+2804[7-+102+>*]In-}. (3.39) 
v 


Here v=N/W represents the relative effect of the 
mass-V particle over what it would have as a mass-zero 
particle. It is to be noted that the above weights tend to 
their extreme-relativistic values as V—0, and to zero 
as NI’. The complexity of these formulas, and par- 
ticularly the presence of the logarithm, presumably 
reflect the presence of the branch point in the Hankel 
functions in Eq. (3.13). In addition one should observe 
that the particle of mass N has been treated with all 
due relativistic rigor. Numerical calculations with the 
above formulas must be made with considerable care 
since many of the large individual terms cancel one 
another, leaving but a small result. 
These formulas permit one to calculate the relative 
‘probabilities for the production of various numbers of 
mesons in pion-nucleon collisions. We have carried out 
such a calculation for a center-of-mass energy of 1.92 
Bev (corresponding to 1.37 Bev pions in the laboratory 
system) and for a nucleon mass of 0.938 Bev. In this 
case v=0.487. Neglecting the effects of factors arising 
from the consideration of charge states and the identity 
of particles, the relative statistical weights may be 
calculated from-Eq. (2.7). Denoting the relative prob- 
ability for the emission of pions and one nucleon by 
S(n,1), one finds that 


` 


na Q rW? 
p sav=-(— .) —- 07, 
8r*heF 2 


Q NeW 
se,= (=) -—— (1.547), 


c O XexeneJ 2-51 (3.40) 
and . 
— 8G ) ( > ) sr" (0.485) 
i = S(3,1)= “—— (0.485). 
ages Smh’? 2-8! 
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The total energy appearing in Eq. (3.40) is actually 


expressed in momentum units, and the factor-in W 
becomes 
W W Ne 1 


=—. = 3 (3.41) 
2r Nc 2xh vrNn 


Here Ay= (2rh/Nc)=1.210-% cm is approximately 
the extent of the meson cloud around a nucleon. Defin- 
ing further the nucleon volume, 2o= (47/3)Ax?, one’ may 


express the results in terms of the ratio of the interaction _ 


volume 2 to the nucleon volume Qo. Thus 
S(2,1) Q S(3,1) Q 
2.01( ) =0.318 ), 
S(1,1) Qo 


S(2,1) Q 
SBS) 
BEP 


It is quite clear that the interaction volume plays a 
predominant role in the determination of the Tiieson 
multiplicity. 


and 


(3.42) 


er 


l> 
H. Several Particles of Finite Mass 


To consider correctly the case of several particles of 
nonvanishing mass involves one in calculations which, 
while elementary from the mathematical standpoint, 
still offer great practical difficulties. Initially let us 
discuss the case of two particles having masses M and N. 
For zero total momentum the limit function p(x) is 
independent of v and is given by the equation 


P= p?=w?—N?= (W—w)?— N? és 

=(W— (+ ppm, 

or i 
p=1/2W| (W2— M2— N?)?— 4222, (3.43) 

From this we find that . 


a] (W2— M2— N) —4MN 8 
| | . (3.44) 


0(1M,1N)=— 
6 Ww? 


It is to be noticed that this reduces to the corr€ct form; 
Kgs. (3.36) and (3.33), as M and WV tend to zero. Also, 
U=0 when W=M-LN, as is to be expected from energy 
conservation. 

Since we would like to apply our results to the 
multiple production of pions, let us next consider the 
case where two particles (the`pions) have a mass M, 
and one particle (the nucleon) has a mass N. We first 
must derive a formula analogous to Eq. (3.30) for the 


two particles of mass M. ‘The function p(x) is deter- 


mined from 


P+ P?—2P p= p= w'2— M? ant 
=[W— (MWp) pM. (3.45) 


a 


5 
4 


Pe OE Oe 


(2M, 1N) = 


a > 4u? 
Pe 
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If we define the dimensionless quantities 


p z Ie M N 
SS, HEDT moa By FI (: Es 6 
o WwW W Š W : ay) 
we find that - 
p(x) z j 
— [= y(x) e 
Wo 
Š ' r 1—Bx? e 4y? 1/2 0-8] 
=ø (1- = ) | . (3.47) 
r 1—68 1—8 2[1—62x?] 


From this one may calculate, by elementary methods, 
that 


rW? 4u? \ 3/2 
0(2M)=——(1 #( = ) . (3.48) 
6. 1—6? A 


When P=0 this volume vanishes for M= (1/2)W, as 
is to be expected. To include the third particle, of mass 
N, the new limit function is determined from the first 
root, of U(2M) above. Expressed in terms of the vari- 
ables.zu* and p’, this integrand is 


. mw)’ /24 6 4M2 3/2 
ViGM)=—s |: | 1- = ] . (3.49) 


w2]; SA o 
Eoo 


Hence tlfè condition for p(x) is, with (P=0), 
4M?=w"— p= [W— (N+ py? P— pt, 


j+y* 2 1/2 
pE) 
2 


é 


(3.50) 
giving 


(3.51) 


= 
From this we may write the desired three-particle 
momentum volumé for zero net momentum, and total 
energy W. It is 


27W 


ylz) 
| dy- y [1 (+y) "] 


dp) ChE 


AER 
3/2 

=e a (3.52) 
EGEA = 
This quantity resists ready calculation in closed form. 
Although. one is tempted to assume that in practical 
cases the relative pion mass, g, will be small with respect 
to unity, and to expand’the integral in Eq. (3.52) to the 
first order in „°, such a procedure turns out to be quite 
inaccurate. Indeed, this approximation will overesti- 
mate the effect of the pion mass by a factor of nearly 
two. To see the actual effect of the pion mass, the above 
quantity, or rather its more interesting energy deriya- 
tive, may bÈ caftulated numerically from the equivalent 


s > 
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d0(2M,1N) (19) 7/42 
Sa A (4/3) [ du: (u— 1)" u 
dW ; ” 


[0+ 4) — 4718 
{(1- + — 2v- y) 
~4y°u (1+ 2y?— 2v4-+-4 8) 
+16u4?(1-+2v?-+6r') 
—64p'(2+4y)4+256u%u'}. (3.53) 


The calculations were performed for a total energy of 
1.920 Bev (laboratory piongof 1.37 Bev), with a nucleon 
mass of 0.938 Bev (v=0.487), and a pion mass of 
0.136 Bev (u=0.071). Using Simpson’s method, except 
at the upper limit in Eq. (3.53) where the irrational 
singularity is calculated in closed form, we find the 


result to be 

dU(2M,1N) mW’ 
——=—— (1.162). 
dw 2-5! 


(3.54) 


The exact one-pion, one-nucleon weight may also be 
readily found from Eq. (3.44) and is® 
dU(iM,1N) rW? 


dW 2 


(0.714). (3.55) 


The inclusion of the pion mass has thus reduced the 
one-pion weight in Eq. (3.37) by a factor of 0.99, and 
the two-pion weight in Eq. (3.38) by a factor of 0.75. 
Hence the ratio of the relative production probabilities 


becomes 
S(2,1)| 9 
—- =1.54 (-) s 
S(1,1)| Exact Qo 


The exact calculation of the relative probability for the 
production of three or more pions would be very com- 
plex. It is of interest, therefore, to compare the results 
of the exact calculations above with those for zero pion 
mass as seen in Eqs. (3.37), (3.38), and (3.39), under 
the simple assumption that the total energy available 
is merely reduced by the total mass of the produced 
pions. Thus, the total energy of 1.920 Bev is reduced 
to 1.784 Bev for one pion, to 1.648 Bev for two pions, 
and to 1.512 Bev for three pions. These give y=0.525 
for one pion, v=0.569 for two pions, and »=0.619 for 


(3.56) 


three pions. Upon direct calculation from Eqs: 43.37), : 
(3.38), and (3.39) the relative probabilities become ~ 


S(2,1) Q ` 
oe =0550(—), 
S(1,1) Approx. Qo a 
and " We 
S(3,1) 
S(2,1) Avorn, a a 3 
> > S 3 = 
kai x msi Bs en S 
» 
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Tt is evident from the disparity between Eq. (3.56) and 
Eq. (3.57) that the simpler approximations do not 
suffice to give results which do justice to the theory. 
Stace just such an approximation was made in deriving 

= Eq. (3.23). for the nucleon-nucleon case, one may per- 
haps infer that more exact calculations are also neces- 
sary here. It has been reported that Yang and Christian 
at Brookhaven have made such calculations by a 
numerical integration of Eq. (3.1). 


I. Elaborations of the Fermi Theory 


Clearly, the preceding calculations do not represent 
the ultimate in deductions from the Fermi theory. First 
of all, angular momentum conservation was grossly 
neglected in deriving Eq. (2.7) from Eq. (2.6). It would 
evidently be desirable to have this procedure checked 
by a rigorous calculation, based, perhaps, upon the 
insertion of a suitable additional delta function in 
Eq. (3.1). An attempt of the author’s in this direction 
has met with little success in the face of mathematical 
complications. 

Another sort of refinement has been suggested by 
Lepore, Stuart, and Neuman,‘ and is based upon the 
requirement of the Lorentz covariance for the inter- 
acting system. Each particle of the system may be 
described by a space-time four-vector 


i= (ix 1%2,1%3,cl), 
and by an energy-momentum four-vector 
U= (ipnipz ips W) G 


From these one may form a skew-symmetric tensor of 
rank two 


Lij viuj sai = — Lji, (3.58) 


which is the general angular momentum tensor of the 
„particle. In the absence of an external interaction the 
sum of these tensors taken for all the particles of the 
system will be conserved. Therefore, 


a Db. Ly = const, (3.59) 


where the index s represents the particle taken in the 
sum. This equation, for i and j=1, 2, 3, represents the 
law of conservation of angular momentum as discussed 
above. The (74) component written in vector form is 


< Z. (KOW — ps) = const. (3.60) 


Using the fact that in a particular coordinate system 
` the tjme factor for all particles will be the same, and 


_ ~ =that the total momentum “will be conserved, one deduces 


yr? 


a law for the conservation of the center of energy 


(x)= EOW N/E. W). (3.61) 


oS THis reduces to the usual center-of-mass conservation 
-lawin the nonrelativistic limit. 
~- Jn addition, Lepore, Stuart, and Neuman modify the 
C "Lepore, Neuman, and Stuart, Phys. Rev. 94, 788 (1954). 
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matrix element in Eq. (2.3) to allow for the fact that 


the higher the energy of a virtual particle createq- 


momentarily within the interaction volume, the more 
closely confined that particle will be to the point of 
origin. This property is included numerically in the form 


of a Gaussian weighting factor 
expl —W 2x2/h?c?]. (3.62) 


The following modified formula is then used ‘for the 
statistical weight: 


S(N)= (22h) 30—-) 


N 
xII | faxi f p: expl— Wese/iee | 


i=l 


XS (Zp) (W-Z W) xW :/W). 


Upon calculation of the Gaussian spatial integral one 


derives 
2N—3r3(N—1)/2 W 3 yN d*p;h* 
Oe 
(2rh) N-DN32 LA J i [p2+M 2B? i 


Xê (p) ( WEW). (3.64) 


(3.63) 


S(N) 


The most striking effect of these new requirements is 
the factor 


Wop?2+M2)3", (3.65) 


included in the momentum integral for each particle. 
This represents a reduction in the phase space available 
to high-momentum particles, and corresponds to the 
reduced overlap between the higher energy bound vir- 
tual particles and the corresponding free particles. One 
would expect these factors taken together to favor the 
production of more low-energy particles and fewer higt- 
energy particles, that is, to increase the multiplicity. 
The originators of this modification to the Fermi theory 
have not yet published any numerical results.“ Since 
their approach would seem to yield an increase in the 
expected multiplicities, while previous refinements of 
the momentum-space integrations have tended to de- 
crease these multiplicities, their results will be of con- 
siderable interest. We shall further discuss these various 
calculations in Sec. V. 


oz = 


IV. CONSERVATION OF CHARGE AND ISOTOPIC SPIN 


It has been suggested by Fermi? that, besides charge 
conservation, one might also impose isotopic spin con- 
servation upon the interacting pion-nucleon system. 
The background relating to this restriction has been 
thoroughly discussed in the literature We shall here 
describe only its application to multiple meson produc- 
tion. For simplicity we shall describe the theory as 
applied to pion-nucleon collisions. Following this we 

16 Henley, Ruderman, and Steinberger, “Reactions oi pi-mesons 


with nucleons,” in Annual Revi i nford, 
California, 1953), Vol. 3, pp. 138 arene Sane 


oot 
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e 
shall append a tabulation of Fermi’s earlier results for 
nucleon-nucleon collisions. 

The behavior of the isotopic spin of a system of 
particles is conveniently described in the language of a 
fully equivalent system of angular momenta. Thus a 
nucleon, with respect to its charge state, is considered 
to behave like a particle of total “charge” angular 
momentum, or isospin, one-half. The two possible pro- 
jections of ‘this charge vector along a given axis in 
charge space represent the two possible charge states 
of a nucleon, that is, a proton or a neutron. Similarly 
a pion, which can have positive, zero, or negative 
charge states, is represented by a total isospin of unity. 
One may summarize this correspondence as 


Nucleons: Isospin T¥=1/2; 
State of T..=4+1/2 isa proton ($) 
TY =—1/2 is a neutron (n) 


Pions: Isospin 77=1; State of T, "=+ 1 is a xt 
9 T7= Oisa 7 
T7=—1 isar. 


e 
if a number of pions and nucleons are combined to form 
a joint system, the total isospin vector is formed in the 
same way that one forms the total angular momentum 
of a numbé& of particles. The resultant isospin is thus 
the vector sum of the isospins of the component par- 
ticles, and may be written 


T=; T®. 


The total charge of the composite system is related to 
the z component of T by 


Q=+e[T:+(1/2)s], 


where s is the number of nucleons present. Charge 
<conservatior#is equivalent to the requirement that T: 
be a good quantum number during the interaction. 
Isotopic spin conservation similarly is equivalent to the 
requirement that the total isospin T be a good quantum 
number. . 

We shall illustrate this with the case of a system 


. formed from a negative pion and either a neutron or a 


proton. Only two resultant total isospins are possible 
in this case, T=1/2 and T=3/2. Because of charge 
conservation we are only interested in states having 

z= -3/2, for interactions with neutrons, and tes 
1/2, for interactions with protons. Henceforth K 
shall adopt the notation (T,M) for states having ge 
isospin T and charge component T,= M; and also, for 
example, the notation (p++ — 0) for a state contain 
one proton, two positive, one neutral, and one rn iv 
pion. The orderirg of the pions 1s of no significance here. 
In order to relate the description in terms of s 
with that in terms of the charges of the two particles 


ipti i ic spin calculations for 
16 A thorough description of the one E 


nucleon-nucleon collisions is containe E or Benoem ri 

i . Yodh from a course in ^ Y 
“avian ce BeOS ES per at the University of Chisago 
during the ffl off 953. 
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Taste IIL. Clebsch-Gordon coefficients for a Pion-nucleon 
system (al*other entries are zero). 


(Mi, Ala) (TM): (3/2, —3/2) (3/2, ~1/2) (1/2, =1/2) 
(p+) 1/2 1 : =. 
(p0) 1/2 0 : 
(p—) 1/2 -1 v (1/3)  — v (2/3) 
asy —1/2 1 
(nO) —1/2 0 + / (2/3) v (1/3) 
(w—) —1/2 1 


one must know the transformation coefficients connect- 
ing the two representations. These coefficients are just 
the Clebsch-Gordon, or vector-addition, coefficients, 
and may be computed from formulas arising in 
the theory of atomic spectra.!7 Using the notation 
(T1,T2,M1,M2|T1,T2,7,M), the coefficients of interest 
in the present case are those with T;=1/2 and T,=1. 
Here M, represents the charge component of the nu- 
cleon, and M» that of the pion. These coefficients are 
given in Table III. From this table one may see immedi- 
ately that a system composed of a negative pion and 
a neutron will surely be in a state T=3/2, M= —3/2, 
or (3/2, —3/2), for short. A negative pion and a proton, 
on the other hand, will, with a probability of 1/3, be 
found in a combined state (3/2, =1/2) and with a 
probability of 2/3 in (1/2, — 1/2). Assuming charge and 
isospin conservation, only these three states will be 
needed for the following computations. Conversely, 
one may also deduce from this table that a system 
composed of one nucleon and one meson, known to be 
in a state (3/2, — 1/2), will, with a probability of 1/3 
be observed as a (p—), and with a probability of 2/3 as 
a (n0). If one assumes that the (x--p) interaction cross 
section is the same for both possible states of isospin, 
then final states of T=3/2 and T=1/2 will be formed 
with probabilities 1/3 and 2/3, respectively. The prob- 
ability of observing (p—) will then be: (1/3-1/3) 
+ (2/3-2/3)=5/9, and of observing (z0): (1/3-2/3) 
+ (2/3-1/3)=4/9. The two possible one-pion final 
states thus will be observed in the ratio (p—): (n0) 
=5:4. Actually the assumption of equal cress sections 
is a rather dubious one in the light of evidence pertain- 
ing to medium energy pion scattering, which suggests 
the existence of a resonance or near-resonance in the 
T=3/2 state.'*'8 For negative pions colliding on neu- 
trons no charge exchange scattering is possible, and the 
(n—) final state will be produced with unit probability, 
assuming no other pions are produced. 


When one admits the possibility of producing more 


than one pion, the situation becomes a litth more 
complicated. With two pions and one nucléon, for 
example, the state (3/2, —3/2) can be formed in one 
way when the total isospin of the two pions is equal 
to two (Y2,=2), and another when 7'3,=1. Following 


11 E. U. Condon and G. H. Shortley, Theory of Atomic Spectra 
(Cambridge University Press, Cambridge, England, IIN Pale 
g y3. Rev. 1, 
(1953). ee es 


18 Anderson, Fermi, Martjn, and Nagle, 
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TABLE IV. The statistical weights for the products of (#~-nucleon) collisions. For (x*-nucleon) collisions exchange (+) for (—) and 
(p) for (n) everywhere, The numbe‘s in brackets represent the total statistical weight of the state under which they are entered, 


New. Pions Final Initial: (3/2, —3/2) (3/2, 1/2) (1/2, —1/2) (a-n) (x--p) 
ae) 1/3 78 l "i 
(n—) 1 
n0) 2/3 1/3 ef 
xa (1) (1) (1) (1) (1), 
2 eae 4/5 4/5 
s, oap : 14/15 2/3 34/45 
(10) 6/5 6/5 
(n00) 4/15 1/3 14/45 
(n+—) 12/15 3/3 42/45 
Total (2) (2) (2) (2) (2) 
j= 8/5 8/5 
3 aie? 4 6/5 6/5 18/15 
(p00—) 5/5 4/5 13/15 
(n+——) ° 10/5 10/5 
n00—) 7/5 7/5 M 
n+0—) 12/5 9/5 30/15 
n000) 2/5 1/5 f 4/15 
Total =- (5) (5) (4) (5) (13/3) 
4 +-->) 80/35 _ 80/35 
Ge =) 88/35 88/35 
(p+0——) 152/35 192/60 752/210 
(p000—) 44/35 48/60 200/210 
(n+0——) 200/35 200/35 "AR 
(n000—) 52/35 52/35 uae’: 
(n++——) 87/35 123/60 461/210 
(n--00—) 129/35 165/60 _ 643/210 
(nG000) 8/35 12/60 “44/210 
Total (12) (12 (9) (12) (10) 


the spirit of Fermi’s statistical theory we assume that 
these two possibilities are equally excited, and that this 
degeneracy provides the state (3/2, —3/2) with the 
over-all weight 2. To proceed with the calculations one 
must first relate the various two-pion states according 
to their total isospin T2,, and total charge component 
M2,. This relation is provided by the set of coefficients, 
(1, 1; Mı, M2|1,1; Tor, Moz), evaluated for Lin =P, íl 
0. Only the intensities are of interest. They are, writing 
(T2,,Me;), given compactly by 


(2,2)= (++) (2,1) = (+0) 
(2,0) =1/3(+—)+-2/3 (00) (2, —1)=(0-) 
(2; —2)=(=—-) (1,1) = (+0) 
(1,0)=(+—) (1, —1)= (0-) 


(0,0)=2/3(+—)+1/3 (00). 


Once in possession of these two-pion states one may 
add the nucleon as before by utilizing the coefficients 
(Tom,1/2; Mor, My) T2:,1/2; T,M). Thus one finds di- 
rectly, for example, that a system in the state 
* (3/2, =3/2) will form two-pion, one-nucleon states 
with he relative weights for (6-—): 4/5 and for 


. (n0—): 6/5. The sum of these two weights is just 2, 
œ representing the over-all degeneracy of the (3/2, —3/2) 


state. 

One may proceed in ‘this way to build up the relative 
weights for as many pions and nucleons as desired. 
Results are given in Table IV for (7—-nucleon) collisions 
which producé one nucleon and up to four pions. The 
(x--p) events „are calculated, assuming 


weights for 


{ae a6 c 


e 
= A 
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equal cross sections for the two possible charge states. 
Weights for one charge state or the other may be found 
directly from the appropriate column. One should also 
note that as a result of charge symmetry the table will 
also give the weights for (x*-nucleon) collisions. One 
need only exchange everywhere in Table IV + for —, 
and # for n. 

These weights include the over-all contribution result- 
ing from the degeneracy of the charge states. The total 
charge space weights are written in brackets. In practice 
these factors will be nullified by the reduction in the 
over-all weight arising from the indistinguishakility of 
the pions. 

Fermi’ has made similar calculations for the products 
of nucleon-nucleon collisions. These are given in Table 
V. Two protons are necessarily in the isospin state 1, 1, 


two neutrons in the state 1, —1, and a neutron and a . 


proton in either 1, 0 or 0, 0. In the last case, provided 
the interaction cross sections are equal for T=1 and 
T=0, one may assume that the interacting n-p systera 
is equally divided between the 1, 0 and 0, 0 states. The 
results in the table are for n-p and p-p collisions. Again, 
one may obtain 7-n results from the p-p results by 
everywhere exchanging + for =- and’ p for n. 


V. DISCUSSION OF RESULTS 
A. Application of the Fermi Theory to x -p 
Events at 1.37 Bev 
We shall illustrate, finally, the application of the 
Fermi theory to a particular case which, corresponds to 


va 


cen ag? 
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the conditions of some recent experiments at the 
Brookhaven Cosmotron. Negative pions are incident on 
protons with a laboratory energy of 1.37 Bev, corre- 
sponding to a center-of-mass energy of 1.92 Bev. We 
have already calculated the phase-space contributions 
to the statistical weights at this energy. For zero-mass 
pions we found, in Eq. (3.42), that 
` S(2,1)/S(1,1) = 2.04 (2/2), 

and, . 
S(3,1)/S(2,1)=0.32(9/2). (3.42) 
An exac calculation taking into account the pion mass 
gave, however, 


In the absence of similar calculations for the three 
meson case we may estimate approximately that a 

CS (3,1)/S (2,1) Jexat= 0.2 (2/2). (5.1) 
These weights must be adjusted for the degeneracy of 
states in charge space (Table IV), and for the fact that 
the pions are identical particles as seen in Eq. (2.3). For 
gne ‘pion produced this factor is just 1/1!=1. For two 
pions it is 2/2!=1, and for three pions (13/3)/3!=0.72. 
We finally obtain the corrected weights 


#+5'¢(2,1)/Se(1,1) = 1.54 (0/0), 

and (5.2) 
Se(3,1)/S-(2,1) =0.14(2/2). 

TABLE ¥. Statistical weights for the products of (nucleon- 


nucleon) collisions (after Fermi). For m-n collisions use p-p 
results exchanging + for — and $ for n everywhere. 


Plows Final Initial: (1,1) (1,0) (0,0) (p-b) (n-p) 
IEn 
n 
~m Total * (1) O O @ (1) 
1 0) ° 1/2 1/2 
es 3/2 3/2 
a (pp—) 1/2 1/3 5/12 
(py0) 2/2 1/3 8/12 
(nn-+) 1/23 5/12 
Total (2) (2) 0 (2) G/2) 
2 ney. 6/5 6/5 
(3500) 2/5 2/5 
n+0 9/5 9/5 
(pn+0) Ae Ap 
ena) 4/5 1/3 17/30 
=. (pn00) 3/5 1⁄3 14/30 
(pn+—) 9/5 3/3 42/30 
(nn-+0) w Go OS a r 
Total 4 ) 
= 154/60 154/60 
Sain 78/60 18/60 
(pn++—) a` 175/60 esi 
(pn-+00) a 127/60 121/60 
(nn++0) 72/60 n 22/ Bath 
(pp00—) 9/10 2/5 B 
(pp-+——J > 12/10 3/5 A 
(n000) 6/10 1/5 8/10 
(pn+0—) 42/10 9/5 D 
Stir) ie Ae 18/10 
ere 0 6 @ ©) GA 
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Tape VI. Relative production probabilities for 1, 2, and 3 
v 


pions in x~-p collisions at 1.37 Bev. 


Case A (2 =M) 


Case B (Q =0.782s) 


~ §(1,1)=0.36 
5(2,1)=0.36 
53,1) =0.08 


S(3,1) 30.07 


“Case C (20.500) 
S(1,1I)=0.55 a 
S(2,1)=0.42 
S5(3,1)=0.03 


no 


Taste VII. Distribution of secondary products in 
x -p collisions at 1.37 Bev. 


Secondary Case A Case B Case C 
products (2 =N) (Q =0.782) (Q =0.5%5) 
(p—) 0.20 024 0.31 
(n0) 0.16 0.18 0.24 
(p0O—) 0.21 + 0.19 0.16 
(n00) 0.09 0.08 0.07 
(n+—) 0.26 0.24 0.19 
(p+——) 0.02 0.02 0.01 
(p00—) 0.02 » 0.02 0.01 
(n+0—) 0.04 0.03 0.01 
(n000) 0.00 0.00 0.00 


Taste VIII. Distribution of charged secondary products * 
in x~-p collisions at 1.37 Bev. 


Charged Case A Case B e, Cas C Experi- 

secondaries (R=R) (2=0.78Ns) NR =0.52) ment™ 
(m4 p)elas 0.27 0.32 0.45 0.11 
(mH P)inel 0.30 0.29 0.25 0.35 
(rtr) 0.40 0.36 0.29 0.50 
(rt+2r+ p) 0.03 0.03 0.01 0.04 


Assuming that no more than three pions are produced 
in significant quantities, and making three possible 
choices for the interaction volume 9, we find the nor- 
malized relative probabilities listed in Table VI. Case B, 
Q=0.780%, assumes the interaction volume is that of the 
proton, Lorentz-contracted to allow for its motion in 
the center-of-mass system. 

In terms of the above, and the charge distribution 
given in Table IV, we may predict the over-all distribu- 
tion of secondary particles shown in Table VJI. 

Of somewhat greater experimental interest is a table 
in which only the observable charged secondaries are 
entered. Elastic scatterings are frequently distinguish- 
able experimentally and hence are separated in Table 
VIII. Included are some tentative experimental re- 
sults from photographs of 147 events in a diffusion 
cloud chamber filled with hydrogen at high pressure.” 

The fit of theory to experiment is not very good for 
the theory predicts manya more incoherent "elastic 


1 Eisberg, Fowler, Lea, Shepard, Shutt, Thorndike and Whitte- 
more, Phys. Rev. in press. Of the observed events 31 weve iden- 
tifiable only as inelastic collisions with two emerging charged 
prongs. The assignment of these events to the categories in Table 
VIII according to the theoretical weights quoted there for Case B 
was suggested by J. C. Street. The separation of the 147 eyents 
into 95 inelastic and 52 elastic cases, and the division of the elastic 


events into 40 coherent (diffraction) and 12 incoiferent scatterings ef 
e 


are as quoted by Eisberg ef a?. The Fermi theory does not inclu 
diffraction scattering. a 2 s 
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(x--p) events than appear tc occur. The experimental 
uncertainties in the data are considerable, however, and 
the‘fractioning off of 80 percent of the elastic scatterings 
as coherent may turn out to have been too arbitrary. 
Of course the theory was also oversimplified by neglect- 
ing angular momentum and other refinements in the 
calculation of the. phase space factor. A somewhat 
different fit can be obtained if only (3/2, —1/2) states 
- are permitted to occur. 


B. Nucleon-Nucleon Collisions 


Unfortunately the fit of theory with experiment ob- 
tained in the last’ section deteriorates further when 
nucleon-nucleon collisions are considered. In a recent 
experiment to determine the meson production in (n-p) 
and (p-p) collisions at Cosmotron energies (1.0-2.2 
Bey), it was found that double-pion production oc- 
curred much more frequently than could be accounted 
for by a simple statistical theory. For example, neutrons 
of a median energy 1.7 Bev (total center-of-mass energy 
2.6 Bev) produced two pions about 2.2 times as often 
as one pion. A Fermi theory calculation by Yang and 
Christian predicted this ratio to be about 1/11 (cf. 
Table II). Even admitting many refinements it would 
seem to be difficult to reconcile these two figures. To ex- 
plain this sort of result Peaslee has advanced a theory in 
which the two colliding nucleons become excited, 
separate, and finally, when some distance apart, decay 
to produce one, and perhaps more pions.” The inter- 
mediate, excited-nucleon states were considered to be 


2 Fowler, Shutt, Thorndike, and Whittemore, preprint, Brook- 
haven National Laboratory, Upton, Long Island; Phys. Rev. 
95, 1026 (1954). 

2 D, Peaslee, Phys. Rev. 94, 1085 (1954). 
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states of angular momentum and isospin each equal 
to 3/2, and excitation energy about 160 Mev, to corre- 
spond to the apparent m+-p resonance peak observed 
in scattering experiments. Peaslee found that such 4 
hypothesis would suffice to give the anomalously large 
two-pion production observed experimentally. His 
theory also suggests that when the produced particles 
are (pn-+—) there should be a correlation between the 
decay planes of the m~ and the n, arising from the ex- 
cited neutron of charge state (3/2, — 3/2), and a similar 
correlation for the m+ and the p, of charge’ state 
(3/2, 3/2). Such a correlation was indeed indicated, if 
not assured, by the experimental data. A preliminary 
analysis of p-p collisions led the same authors?° to 
conclude that the Fermi theory is again inadequate to 
predict the observed interactions, while the Peaslee 
model offers a much closer fit. 


VI. CONCLUSION 


Pion-nucleon and nucleon-nucleon experiments con- 
ducted in the energy range of 5-10 Bev would shed 
great light upon the problem of multiple meson produc- 
tion. For the pion-nucleon case at such energies the 
Fermi theory predicts multiplicities of sufficient mag- 
nitude to give the one parameter fit, inconciusive at 1.4 
Bev, a severe test, and may enable one to determine 
whether any particular charge states are formed prefer- 
entially. For the nucleon-nucleon case one would per- 
haps expect, following Peaslee’s approach, that excited 
nucleons would be created which are capable of decaying 
into two or more pions. If so, then one would hope to 
be able to compare these excited nucleons with similar 
excited states formed in pion-nucleon collisions. Con- 
clusions must await the acquisition of experimental 
data of suitable quality and quantity. > 
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- Absorption and Velocity of Sound in Vapors 


> E. G. RICHARDSON 


King’s College, Newcastle upon Tyne, England 


> I. INTRODUCTION 


HE admirable review by Markham, Beyer, and 
5 Lindsay,!* entitled Absorption of Sounds in 
Fluids} gives experimental results mainly on liquids. 


” Since dispersion is scarcely established in liquids, these 


authors perhaps rightly surmised that there was not 
much to be gained by including velocity measurements 
in their survey. 

It is the object of the present author to supplement 
the first review by describing techniques used in sttdy- 
ing the propagation of ultrasonics (mainly) and sound 
in gases and vapors, the results attained and to say 
something of the light which they throw on other 
processes and applications in which vapors may be in- 
volved. ‘Since the prewar data are fairly completely 
documented elsewhere this survey will only cover the 
past fifteen years. Moreover, as the theory of relaxation 
is well covefd in the earlier Review, it is hoped that it 
will be unnecessary to reproduce it here in full. 


Analysis of Results 


® 

Apart from intrinsic interest in the results, or for the 
purposes outlined in the final section of this report, 
workers in this subject have been interested in the 
equation of state of a vapor, to determine virial coeffi- 
cients, for instance, or thermodynamical relationships 
connected therewith, particularly at low temperatures 
or near the critical state; or else they have been looking 
for single or multiple relaxations of the molecule, con- 
nected with a lag in the transfer of translation into other 
forms of energy, vibrational or rotational. Fortunately, 
in many vapors, it is the former type which occurs and 
‘that monotonically, so that the theory proposed by 
Henry,? Kneser,? and others can be applied. 

Elaborate and alternative forms of the single relaxa- 
tion hypothesis are to be found in the paper’ already 
Cited. It will be sufficient for our purposes to state the 


` results- to be expected from this hypothesis; thus: 


If p is the pressure on the vapor, p the density, w the 
pulsatance (w=2nf) and r the relaxation time, Cp and 
C, the ustal specific heats, Cy’ the complex specific heat 
appropriate to this case is defined by 


~ Cr =C: $C. (1+ ter), (1) 


wherein C; is the part associated with translation, Cr, 
h vibrations (or rotations). The 


* References w. 


il] be found in Sec. XI, Bibliography, at the end 
of the paper. } 
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velocity of sound is given by 


$ CoC ww? 7? 
PRC C ; 
z O E 


(2) 


and the ratio of specific heats peculiar to sound propa- 
gation 


(3) 


When Vo is the low-frequency velocity, Va the high- 
frequency velocity and the mid-frequency value {Vo 
+4(V.2—Vo?)} occurs at a frequency fm, the relation 
between V and f is 


VEVE Eh fm)? 
ee , (4) 
Vie—VE 14+(f/ fn}? 
also 
He AG 
1 core ae (5) 
Qqrr Cy 


If » the (amplitude) absorption per wavelength is 
plotted against log/, a symmetrical curve is obtained, 
of which the maximum gm is given approximately by 


a R(C,—C) Ve 
AO ean 
2 Ci(R+C)) Vo 


If a/f*,a being the absorption per unit length, is 
plotted against logf, the curve has the shape of the 
velocity dispersion reversed reaching the classical value 
(by interpolation of the viscosity coefficient) at high 
frequencies and lying 24m/Vofm higher at zero frequency. 

As Herzfeld‘ has shown this quantity can also be 
expressed as 


(6) 


(7) 


If Vo is not determined, it may be deduced from an 
assumed equation of state. 


I. INTERFEROMETERS À 2 
Theory 


The standard apparatus for yltrasonic work in gases 
remains the Pierce*® variable path interferometer, in 
which the load on a quartz or other transducer due to 
a gas column of varied length is made to influence the 


anode current in the electronic tube ‘coupled to these” 


transducer. Writing the particle velocity in the (plane) 
% ae > > 
3 4 E NA 
a» 
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progressive and retrogressive waves in the column 
(extending along x from 0 to J) as 
f= Be azeite pa, 
kanad . 
and 
a= Cerzei(ott ss) 


\B=2n/d), (8) 


with $o as that at the quartz, we may deduce the pres- 
sure amplitude by multiplying by the characteristic 
impedance (pV) of the gas. Following Hubbard® we can 
divide it into two parts, one dependent on and one 
independent of frequency, thus, 


po= pV (EP+wiQ). (9) 
The factors are given by conditions at the source face i.e. 


sinh2al 


== (10) 
cosh2al— cos28l 
sin26l 
Q = —. (10b) 
cosh2al— cos268l 


5 
At a resonance, where 67 is zero we find 


sinh2al 


Pmax=pV bo (11a) 


cosh2al—1_ 


The minimum value occurs for a further increase of 
length of \/4, to 7’, and is 


sinh2al’ 
e (11b) 
cosh2al’—1 


The difference of these values may be read off from the 
driving circuit to give us the value of a. 

Hardy’ has pointed out that if the reflector is used 
in a region so far from the source that à/4 is a small 
fraction of / the difference between the two values of 
pressure amplitude given in Eq. (11) allows simpli- 
cation to 


2pVEo 
Ap= A 
sinh2al 


(12) 


- (A discarded term involves a)? as coefficient. The latter 
quantity is negligible in pr:ctical ultrasonics.) 

The reaction on the driving circuit causes a propor- 
tionate change in grid voltage as the path is lengthened. 
This again results in a. proportionate change in mean 
anode current, J, the quantity that is measured. When 
l is large, we can, in fact, consider the resistance of the 
source'to-be constant. Thus if AJ, AZ, are the differ- 


“ences between the respective values of these quantities 


at resonance and antiresonance, respectively, for two 
; P ag o 
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distances /, and J,, 

Apa sinh2al, 
Ape 

Al 

log. —= 2a (1,—1,). 

Al, 


3 
sinh2al, 
or 


(13) 


This suggests a simple way of deriving a from a series 
of resonant and antiresonant peaks in the anode cur- 
rent as the reflector is moved. Correction may of course 
still be needed for diffraction effects. j 

Following this idea, many investigators have sought 
to determine a from a straight line plot of AZ against 
but the method is not to be adopted without considera- 
tion of the conditions under which the operation of the 
interferometer satisfies the underlying assumptions of 
these equations (vide infra). 

Before enunciating these, let us apply Eq. (8) toa 
fixed column interferometer of resonant length in which 
the standing wave ratio pmax/Pmin is determined by a 
suitable detector, in place of observations oy the 
coupling circuit as / is changed. f 

Adding the particle velocities given by Eq. (8) with 
the condition that at the reflector x=], £=0, we have 


(14) 


Omitting the time factor and writing A= Ce?! we ob- 
tain, for the velocity amplitude at the location t, 


(2) v= 2A?[cosh{2a(x—1)} —cos{26(a—J)}]. (15) 


In a vapor a is usually small compared to £ so that the 
maxima and minima in Eq. (15), as the location % of 
the detector is varied, are 24 cosh{a(«—J)} and 
2A sinh{a(«—/)} ; whence « and £ can be determined 


E=CeletiBI[ elatiB (aD — ¢~(ot iB)(2—D pint, 


Conditions under which « can be determined 
from the Locus of Peaks ‘ 


Hardy’ himself has stated the conditions under which 
Eq. (13) is valid to be (a) 2un> 2.5, where p is the ab- 
sorption per wavelength and v the number of wave- 
lengths elapsed between the source and the commence- 
ment of measurement as] increases, and (b) AZ//I<0.03. 
These conditions have been verified for a capillary tube 
interferometer by Lawley.!2 When either of these condi- 
tions is unfulfilled the graph of logAZ against J is not a 
straight line. i 

Another condition, difficult to fulfil at low frequencies, 
is that the sound waves in the tube shall be plane. If 
the tube is wide enough, relative to the source diameter, 
for a diffraction pattern to be set ‘up in front of the 
latter, one must consider how the wave form may 
thereby be changed. Near the source, considered as a 
disk of radius a, we have the F resnel region of optics, in 
which the intensity fluctuates cyclically along the axis; 
further out at a distance greater than about 2a/A, the 
Fraunhofer region is found in which thd central beam 
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is fringed by “wings,” the first minimum lying on a 


cone of solid angle (0.61)?n)2/q2, 

It is evident that such diffraction, since it is 
more or less present, will spoil the effects of molecular 
absorption such as may be taking place in the mec 
In a truly plane progressive wave such as occurs in a 
pipe for-wavelengths large compared to the diameter 
the amplitude will be given by &e-«*, but with diffrac- 
tion, We’ must make the propagation constant complex 
Born® has attempted to introduce such a d Z 
factor,mto the equations of diffraction. 

He shows that it is still possible to derive æ from two 
axial measurements of pressure amplitude pa and p, 
provided the stations are outside the Fresnel regime. 
For example, at x and x+6x the ratio = 


always 


amping 


B 
sin~{4/ (a?+-a*) —a} 3 
pa 2 
ak ) 16) 
Po f i (16) 
sin-(V/ (a+ Getd2)))—(e-+8x))) 
whence: 
phat pa sin g/(x+ôx) 
ax log—- (17) 
e pe — sing/x 


if x is large compared to a and q is written for ma?/2). 

As far as the author knows, however, this formula 
has not been applied by any experimenter. When dif- 
fraction Would take part of the central orb outside the 
reflector, a loss correction for this cause based on simple 
geometry, and under the assumption that all the energy 
lies within the angle previously cited, is usually applied. 

Even when diffraction loss is negligible another diffi- 
culty may be experienced in the form of satellite peaks 
alongside themain ones as the column length is varied. 
‘These may persist even after adjustment to parallelism 
of source and reflector. The origin of these was first 
suggested by Jatkar,® a further theory was given by 
Krasnooshkin,!” and an experimental confirmation by 
Bell." The latter has shown that radial vibrations may 


- be set up in the interferometer tube, especially if the 


supposed piston-like motion of the source should fail. 
A characteristic property of these waves is the dis- 
persion of their velocity propagation. Thus 


LOA =N /{1—N/N°) 


Here X is the wavelength for plane waves and ào is a 
function of the tube diameter d, and the mode: 


see s o= Trd/buN- (19) 


Here buy is the Meh root of the first derivative of the 
Bessel function of the first kind of order M. For complex 
modes dary is large and Xp is small. When =o, A, is 
infinite (at this wavelength the phase velocity is infinite 
and energy can no longer be propagated in the mode). 


Xo is known as the cut-off wavelength for the particular 
a 


mode. : į 


(18) 
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In most experimental cases investigated \<<Xo. Then 
Eq. (18) approximates to 


Nae 


: ; (20) 

AN 2NF 
If satellites are radial mode resonances their wave- 
lengths must follow this law. ; 

By finding the positions of satellites in a hydrogen 
filled interferometer (its tube 1-in. in diameter and a 
source of 250 kc/s X-cut quartz crystal), Bell" was able 
to measure the satellite wavelengths and identify them 
as cross modes in these formulas. í 

When these cross modes are present the observed 
absorption will be in excess of the true absorption of the 
gas by an amount due to the interference between 
different modes. The latter will be especially important 
when the former is small, as in monatomic gases. 


Pierce Types 


These follow broadly the design of their originator. 
The main requirements are the provision of a geod 
screw, means of adjusting the source and reflector 
parallel, a container for the gas capable of withstanding 
the pressure to be applied, and of being maintained at 
the requisite temperature. 


Double Crystal Types 


Two crystals which are exactly matched as to fre- 
quency face each other across an intervening column of 
gas. The length / is varied by moving one of the crystals. 
A voltage of fixed amplitude is applied to one. Then the 
voltage developed at the other is measured and is found 
to pass through maxima and minima, as the separation 
changes, proportional to sinhal and coshal, whence 
again a and £. 


Capillary Tube Types 


These have been used (particularly by Lawley") 
partly in an attempt to overcome some of the difficulties 
mentioned above as to deviation from plane waves and 
partly to test the validity of the formula of Kirchhoff”? 
which applies in this case. The velocity of propagation 
is (slightly) reduced below the free-space value, V: 


TE 


The absorption coefficient in the tube of radius r is 
a 


a 
1 fav} 
(8 am =) -. 
r¥N\2 
<. 


On the other hand, in unrestricted space for plane waves 
> a ` 


(21) 


_ (22) 
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100 
fh Mc/s atmos 


Fic. 1. Square of sound velocity (continuous line) and absorp- 
tion per wavelength (broken line) plotted against parameter fre- 
quency/pressure in oxygen (Thaler). 


where v is the kinematic viscosity of the fluid, and 7’ is 
the thermal diffusivity. 

Lawley,” Weston and Campbell“ have verified the 
validity of this formula, using gases of known viscosity 
in tubes down to 1 mm diameter. The sources were 
nickel needles maintained in magnetostriction oscilla- 
tion and the reflectors brass pistons moved along 
the tube. 

With such frequencies and slightly wider tubes there 
should be two sources of attenuation, given by Eqs. (22) 
and (23). Lawley has shown that the two effects are 
additive. 


Hot-Wire Types 


The hot-wire interferometer as usually practiced is 
an example of a fixed-path instrument to which Eq. 
(15) applies. It has been used by the author and a 
colleague on vapors.!®> Where a nickel or platinum wire 

_ of about 0.001 in. in diameter is set parallel to the wave 
front it may have its change of resistance directly 
proportioned to the velocity amplitude in a sound wave. 
From a plot of maximal and minimal readings in the 
vapor as the hot wire is moved between fixed reflector 
and source, set at a separation to produce standing 
Waves, a and 8 may be obtained. 


il. OTHER METHODS 


It is possible, in theory, to make absorption measure- 
ments in progressive (either spherical or plane) waves 
if reflection from the walls and the end of the container 

-is prévented by layers of. absorbent material. At one 


- time the radiation pressure on a vane, set at various 


distarces from the source, was much used for this 
purpose. Now it is felt that its presence is liable to 
refarn energy to the source and/or disturb the wave 
front. Thus it'is less used than formerly. 

‘The @ of resonators filled with the gas have been 
measured mostly at sonic fequencies (by Tucker,!6 
r 2 


o £ 2G © 


f . 
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Oberst,” and others) with a view to deriving the ve. 
locity of sound and absorption in the gas. 

Reverberation times in chambers filled with vapor 
have also been determined at sonic frequencies (Knud- 
sen!) to get the absorption coefficient in the gas, after 
allowing for that due to the walls. 

Pulse methods are hardly used in vapors except at 
low pressures because the usually small impedance of 
the gas and long ringing time of the quartz requires 
very long columns and short-excitation times. 

The method of passing a beam of light athwart a 
glass-sided column containing the fluid conveying plane 
progressive waves, was frequently used at one time in 
liquids especially for velocity. But this method had 
received little attention by those interested in gases, 
presumably because it requires high intensity at the 
source to obtain a sufficient change of density in the 
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a V(n/s) 


400 


Yo Mc/s atmos 


Fic. 2. Sound velocity and absorption per wavelength plotted 
against frequency/pressure in hydrogen (Stewart). 


sound fronts to diffract the light. Bommel" howeVér 
was able to apply the method. mee 
IV. RESULTS AT VARYING PRESSURE ; 

When the rise in velocity and peak in absorption 
per wavelength, characteristic of a fluid having a single 
relaxation and occurring at the relaxation pulsatance, 
were first observed in carbon dioxide, it became ap- 
parent from the work of Richards and Reid,” and 
Railston and Richardson” at varying pressure in this 
gas that it was possible to bring all results ont6 a singie 
curve, if they were plotted against the parameter fre- 
quency/pressure. a 

It has indeed become common practice to assume 
that this parameter will be relevant to the results in 
any vapor and accordingly to make a series of measure- 
ments at one or two frequencies over a range of pressure 
going down (usually) below atmospheric. Examples of 
the use of this procedure will be found in Figs. 1-4. 


Air 
From the work of Schmidtmiiller, gases of the atmos- 
phere show no dispersion at audio-freeuercies or low 
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e 
ultrasonic frequencies. However, recent results of 
Thaler” show dry, CO.-free oxygen to havea relaxation 
at f/p=60 Mc/s atmos (Fig. 1). In nitrogen (Zmuda?*) 
this effect occurs at 80 Mc/s atmos. Hydrogen and deu- 
terium (Stewart and Stewart’ and Rhodes’) exhibit a 
similar effect at about 15 Mc/s atmos (Fig. 2). 

Theseaye ascribed to a relaxation in the passage of 
translationa} into rotational energy. The specific heat 
of vibrations in these gases would come into the picture 
only at low sonic—almosy infrasonic —frequencies. 

ih 
Rare Gases 

These show no dispersion which can be ascribed to 
relaxation and their absorption is classical, i.e., that 
given by Eq. (23). 

When the pressure on a gas is greatly reduced, say to 
a few cm Hg the attenuation of a sound wave becomes 
very high. This in turn makes measurements of velocity 
in an interferometer very difficult and rather uncertain, 


0-001 “00d 035 OF ve 3 7 7) G 
A Mc/s atmos 


Fic. 3. Abs@rption per wavelength plotted against frequency/ 
{Bressure in carbon dioxide, carbon disulfide, and ethylene oxide 
(Angona). A 


fer reaction on the source fails after a few peaks and 
one is wily-nily working very close to the source. Abbey 
and Barlow,” using a loud-speaker fed from a 1000 cps 


* tone and asmicrophone in a tube, measured velocity to 


a few mm Hg in the gases of air, carbon dioxide, and 
methane. A tube correction based on Eq. (21) must be 
applied and, unfortunately, has the greatest effect at 
iow préssures. Without it, there was no significant drift 
of V with f, but after correction V* increased in all 
tases as the pressure was reduced below 10 cm Hg. 
(The term V* indicates velocity reduced to 0 C where 
Charles’ law is assumed.) Similar experiments have been 
made by Maulard” and Caro and Martin. Though the 
former supported Abbéy and Barlow, Caro and Martin 
with a somewhat more refined apparatus were unable to 
confirm this résult (in air). They applied a tube correc- 
tion 10 percent in excess of the Kirchhoff value (on the 
basis of the estimates of previous workers). They found 
the corrected velocity to agree with the accepted value 


Z a elocity and 
for normai pressure. Zartman recorded v y 
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absorption in air, nitrogen, hydrogen, “and carbon 
dioxide to 4.5 mm Hg and frequencies } to 2 Mc/sec. 

For a sufficiently high value of the product of sound 
frequency and kinematic viscosity v, dispersion- is 
possible even in a monatomic gas, where relaxations are 
not observed. The measured velocity is found to rise 
with a “Reynolds number of vibrations,” a nondimen- 
sional parameter defined as Vi?/wx Results m helium 
have been obtained by Greenspan” at 1 Mc/s and 
pressures to 0.1 mm Hg (to vary v) when the sound 
frequency is only about twice the molecular collision 
rate. Boyer” reported increases in velocity in argon, 
nitrogen, oxygen, and air at very low pressures where 
the wavelength approached the medn free path. 

Parker, Adams, and Stayeseth* measured absorption 
in argon, nitrogen, and oxygen from 1 to 10 mm Hg 
pressure. They used pulses from sources of about 60 
kc/s to give a range of f/p of 5 to 50 Mc/s atmos. In all 
three «/ f? was proportional to f/p but the factor was 
greater than the classical in the air gases. 


Triatomic Molecules 


The notable dispersion of velocity in carbon dioxide 
at 20 kc/s at atmospheric pressure was already noted 
by Pierce.® Since then it has been confirmed as a func- 
tion of f/p by many workers. This is also true of 
nitrous oxide. In carbon disulfide vapor the change in 
velocity is not so marked. In both this and in ethylene 
oxide,** plots of u against log f/p give bell-shaped curves 
corresponding to single relaxation frequencies (for vi- 
bration) of 465 ke/s. (Fig. 3). COS has a relaxation at 
300 ke/s; SO, at 1 Mc/s (Fricke®® and Petralia’). 


Polyatomic Molecules (Organic Vapors) 


Ammonia 


Steil’ found a small dispersion of velocity in this gas 
from 429 to 431 m/s. This has been confirmed by 
Ozdogan“? in the region log f/p 2 to 3.8. 


al 


> 


| 
Zo 
S 
wen 
G | 
i? 
ž v8 re 
Rog ffp McA atmos | 33° eee 
> = 
Fic. 4. Specific heats of vibration plotted against frequency/ 
pressure in carbon tetrachloride and chloroform (Fagg, Han 
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Ethyl Ether 


Slight dispersion (V*, 317 to 325 m/s) is evident, 
a/ f? is about 20 times classical, and the maximum ab- 
sorption is at f/p=1. (Matta and Richardson,” 
Nomoto,“ and McConvrey et al). 

Propylene 


Telfair“ found no dispersion in this vapor at 12 Mc/s. 


Ethylene Oxide 


Angona’s® u values are shown in Fig. 3. 


Dichlorethylene 


This occurs in two forms, i.e., cis and trans. In both V 
and a have been measured by Sette, Busala, and 
Hubbard.“ ‘There is dispersion at {/p=2.27 and 10.5 
Mc/atmos. In both C,/R falls from 7.5 at 1 Mc/s to 
3.5 at 100 Mc/s. 


Ethylene Chloride and Ethylidene Chloride 


Rao‘ has found that these vapors have slight dis- 
persion at f/p 100 and 60, respectively. 


Ethyl Alcohol 
There is no dispersion to 1.8 Mc/s (V*=262 m/s) 
(Railston‘*), 
Ethyl Todide 


Here a slight dispersion (V* 151 to 154 m/s) is 
evident (Railston‘’), 


Ethyl Formate X 


Pusat‘ has measured velocities in this vapor and 
reports a relaxation of 210-8 sec. -o 


Methyl Alcohol 


No dispersion is evident. (Railston‘*), 


Acetaldehyde `* 


A number of workers, including Alexander and 
Lambert’ and Matta and Richardson,!* have reported 
a marked dispersion in this vapor (V* 243 to 252 m/s). 
Furthermore the niaximum y at f/p=0.3 Mc/s atmos. 


Benzene 
- 


€ 
Recently this vapor has been thoroughly tested by 
+ Lambert and Rowlinson and Cheng. Its velocity, 
reduced to 0°C, rises from 181.0 to 197.0 m/s with a 
maximum of absorption, u=3, at f/p=9 Mc/s. Cheng 
determined a by peak width at 30°C. He tried two re. 
laxatigqns at 4.4 and at 13 Mc/s atmos, but the fit of the 
~~..experiments® tò- a single one at 9 Mc/s atmos looks 
equally good. The other measurements were near 100°C. 
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In benzene a is 600 times classical at 100 kc/s and 
23 times at 700 kc/s (Railston‘®). 


Methane 


This vapor was shown by Quigley"! to have a velocity 
decreasing linearly with temperature below. 0°C, at 
atmospheric pressure. Eucken and Aybar,* and also 
Telfair and Pielemeier® found, at temperatures near 
100°C, a dispersion region about 300 kc/s corresponding 
to a relaxation of about 810-7 sec. 


Pentane and Iso-Pentane 


Here there is a gradual rise in V as the pressure is 
increased to 100 kg/cm? (Kling, Nicolini, and Tinsot®), 


c Carbon Tetrachloride and Chloroform 


In this instance CCl, at 80°C showed <1 m/s and 
CHCl;<2 m/s dispersion at atmospheric pressure, ac- 
cording to Railston® (to 700 kc/s). Jatkar®® found 
similar results in CCl, at 97°C and CHCl; at 134°C. 
Fogg, Hanks, and Lambert** found, however, at 160°C 
(varying the pressure), a change in velocity in CCl, in 
the neighborhood of 2 Mc/s atmos. Hence they calculate 
the specific heats of vibration and compare’ these with 
values derived from spectroscopic data (Fig. 4). At 
30°C Hubbard, Busala, and Sette5” find these relaxa- 
tions at 23.3 (CCl,) and 27.3 (CHCl). 


Halogen-M ethane Derivatives 


The velocities in a number of less common halogen- 
substituted methanes have been examined by the last 
two triumvirates. In CH;Cl a rise, near 1 Mc/ S, occurs 
at 100°C and at 30°C. In methylene chloride (CHCl) 
the Oxford group find a single step at 2.5 Mc/s atmes 
but the Washington group claim a second rise at 105 
Mc/s atmos. Methyl chloride, bromide, and iodide have 
single relaxations from 0.2 Mc/s atmos upwards; like- 
wise, methyl fluoride CH3F at 0.05, fluoroform CHF; at 
0.30, methylene fluoride CHF, at 2, carbon letrafluoride 
CF, at 0.3 Mc/s, all at or near 100°C. Byers*’ has values 
of u in CF; at 22°C which agree with his velocity results 
to give a maximum at 0.48 Mc/s atmos. Sulfur hexa- 
fluoride, according to O’Connor,®® has a relaxation 
which, at a temperature of 36°C, is located at 5.89 10-7 
sec as determined from velocity measurements at pres- 
sures from atmospheric downwards. When 0.066 mole 
fraction of helium was added this time was reduced 
to 5.46 10-7 sec. PE 

The freons can be regarded as substituted methanes, 
containing both chlorine and fluorine, which have low 
velocities combined with high absorptions and so have 
some technical importance (vide infra). Miyahara 
and the author are experimenting on these. Freon-12 
(CClF2) has dispersion in the neighborhood of 1 
Mg/s atmos where its absorption coefficient is as high 
as 0.04 (u=0.2). EAG 
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ThExelocities in certain orga A ) i 

s Xelocities ganie vgpors will be found 
listed, un? the papers of Jatkar.***These have not been 
include h RpPhecquase “thé range of frequency was in- 
sufficient fo “determine whether dispersion existed and 
the pressure remained atmospheric. 


V. RESULTS AT VARIOUS TEMPERATURES 


When temperature is varied, even on one of the more 
perfect gases, it is difficult to correlate the results with 
any chosen parameter. Mostly the attempt has been 
made with carbon dioxide. The only results which 
cover a sufficient range of frequency are those of 
Penman® and Warner. However, these are in dis- 
agreement, as to the range of V:T, with the later ones 
of Overbeck and Kendall” who suspect that the earlier 
samples of gas were not free from impurity. At about 
100 ke/s, the dispersive region, the velocity curve passes 
through the same drop with rising temperature as with 
rising pressure. 

It is natural to attempt to plot against the parameter 
//T, T being the absolute temperature, but the result 
4s tot very happy. Using Penman’s and Warner’s re- 
sults, onè can only say that the frequency corresponding 
to the middle of the downcast increases with the tem- 
perature ir®linear fashion. 

Little work has been done at high temperatures in 
the neighborhood of 1000°C. Only that of Sherratt and 
Griffiths® on carbon monoxide can be cited and they 
used onl¥ two frequencies in the kilocycle gamut. The 
major difficulty is that the source, magnetostrictive in 
their case, must be kept cool so that it is almost im- 
possible to avoid temperature gradients in the working 
section; further attempts should be made. 

Most of the work at very low temperatures has 
keen done lw van Itterbeek and his colleagues. With 
Mariens* he made measurements in carbon dioxide, 
with van Paémal® in oxygen and deuterium, with van 
Doninck® in mixtures of hydrogen, nitrogen, oxygen, 
carbon, monoxide, and helium, in carbon monoxide and 
deuterium,®’ and with Lauwers® in ammonia at various 
. pressures. Most of these results are on velocity and they 
have been used to estimate the second virial coefficients 
in the equation of state at low frequency. (See Sec. 
VIII). A linear relationship between V and p is also 
found by Schneider and Thiessen® in helium. 


° VI. MIXTURES OF VAPORS 


Although it was known in a qualitative fashion that 
the humidity of the atmosphere affected the attenua- 
tion of*Sound therein, it was Knudsen who precisely 
investigated the phenomenon at sonic frequencies in his 
reverberation chamber. Pielemeier” and Mokhtar and 
Richardson” »pursued the effect to ultrasonic fre- 
quencies. Other adulterdnts, e.g., organic vane a 
oxygen) increase the attenuation even when i 4 
concentrations and usually, if the main gas Don 
maximum» of absorption, this is pushed to higher tr 


quencies by fie addition. 
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Possibly because thére is no fundamental explanation 
of the effect, mixtures have been studied less frequently 
of late, although van, Itterbeek and van Doninck’? have 
studied mixtures of diatomic gases at low temperatures. 
Angona® worked with mixtures of the threé vapors 
which he had studied separately. Sette and Hubbard,” 
studying the effect of impurities on the-dispersion curve 
of carbon dioxide, found differences between light and 
heavy water vapor. (See also Pielemeier and Byers,” 
Knétzel,“ and Petralia™’). 

It should be noted that in none of these cases is a 
chemical reaction concerned. However, Richards and 
Reid” have studied the dissociation of nitrogen tetrox- 
ide in the equilibrium N.,O,=2NO, by ultrasonic 
methods. This was done in the hope of being able to 
sort out the relaxing effect we have been discussing from 
a purely chemical one. Both these lead to the same 
ultrasonic effects, viz., increase of velocity and absorp- 
tion. These writers, contrary to Teeter,” think that it 
should be possible, at least in gases for which the heat 
of dissociation is large, to disentangle the two effects. 
This would be particularly true if readings in the 
dispersion range were made at various pressures and 
temperature. 


` 


VII. RESULTS NEAR THE CRITICAL STATE 


After the construction and testing of the pioneer high- 
pressure interferometer of Hodge and Hubbard,’ 
Herget,” and Noury® measured carbon dioxide and 
ethylene velocities at one frequency (266 kc/sec) to the 
critical pressure (76 and 52 atmos, respectively) and 
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Fic. 5. Sound velocity plotřed against pressure for various tei- 
peratures in carborsdioxide (Parbrook and Richardson). 
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Fic. 6. Absorption divided by square of frequency against 
Pressure at critical temperature in carbon dioxide (Parbrook and 
Richardson). 


found the sharp minimum characteristic of this point. 
In the last few years both V and a have been determined 
by Schneider,®! Anderson and Delsasso,® Parbrook and 
Richardson,® and Teilsch and Tanseberger.* In these 
gases the last authors used three frequencies and ex- 
tended the range of pressure into the liquid phase. At 
pressures above that corresponding to f/ p=0.1 there 
is no dispersion of the velocity, but the minimum just 
referred to is displaced with temperature (Fig. 5). 

By selecting published values of the product pv (v 
being the specific volume) which represents the square 
of the isothermal velocity and dividing into the square 
of the measured (adiabatic) velocity, it is possible to 
exhibit deviations from the perfect gas law. At low 
pressures this quotient is the ratio of specific heats for 
these gases. (See Table I). 

Schneider first noted a very high value of the ab- 
sorption at and near the critical point, which he ascribed 
to a relaxation. However, if one tries parameters of aw 
and f, it proves to be a/ f? which is relevant to the vapor 
phase, as usual, whereas a/ f brings the author’s results 
into coincidence on the liquid side of transition (Fig. 6). 
This strongly suggests, by its form, a scattering mecha- 
nism sich as one observes in the passage of light through 


TABLE I. Values of ratio of specific heats. 


pid Pressure (atmos.) 

30 40 50 60 70 80 
1.30 1.32 133 1.35 145 1.66 
131 1.32 135 140 168 278 
1.32 134 1.37 1.50 2.68 5.45 
1.30 131° 1.35 1.43 2.15 3.30 
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a suspension of globular entities of size comparable to 
the wavelength of the radiation. 

Schneider has studied other two-phase systems, 
Sulfur hexafluoride (/-=45.6°C, p-=36 atmos) at 600 
kc/s shows very well the sharp inflections—in opposite 
senses—as the temperature rises along an igohar (Fig. 
7). With Chynoweth® he measured both V and «æ 
in zenon. eck: 

It may be noted that other critical transitions are 
found in helium at temperatures of a few degrees abso- 
lute. For example, when the gas turns to liquid Hel and 
again in the transition Hel to Hell (liquid), Atkins and 
Chase,®” van Itterbeek,’§ Lacam,® and Noury” find a 
continuous rise of velocity in argon (at 950 kc/s) and 
nitrogen along isothermals. In methane, however, 
Lacam” notes a dispersion at high pressures—he goes to 
900‘atmos—which is the opposite to that at low pres- 
sures; the velocities at 3 and 5 Mc/s lie below those for 
900 ke/s both at 25°C and 50°C. These French investi- 
gators are working well above the critical point, so that 
the inflections noted above are not to be anticipated. 


VII. DERIVATION OF THERMODYNAMIC RELATIONS 


Now that quite detailed and precise measurements of 
the velocity of sound in vapors are available the ques- 
tion arises whether they can be used to derive some of 


Temperature 


Fic. 7. Sound velocity and absorption coefficient at critical 
pressure plotted against temperature in sulfur «hexafluoride 
(Schneider). 


the thermodynamic relationships which are now re- 
quired by physical chemists and chemical engineers. For 
these it is necessary to have values over a detailed field 
of pressure and temperature, such as those of Fig. 5, at 
frequencies where dispersion is not evidenced. In part, 
this detail must be precise enough to enable one to 
measure gradients of the curves with, accuracy. 

To give examples, the velocity „oí sound may be 
expressed, when v is the specific volume, as 


ð a Op\? T 

vi=—a(=) +e(— = (24) 
dv/ r OMY C- 3 

(Hardy, Telfair, and Pielemeier™), When: the slope 


along an isothermal goes to zero, as at the critical point, 


m 


9 
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we are left with the second term, F i 
: . From this T s í ise ' 
can bẹ calculated, given C, from other e LCE 075 Ramen Jae sie E 
p= . = D. 
The velocity of sound may also be expressed in terms 
of entropy S, as V?= —72(dp/ay avian ae S x10” 
ae U dp/ 02) s, which has the ad- | 
vantage of having entropy as independent variable. Or | ' 
in terms of. pressure and temperature, Ri biel Bx 108 
7 | 
pa (=) (2 | 
Vx QP =) oit 
a a (25) | 
IG oG ( 8G \2 
Op’ OT? sen 
Sea eames te Tae 4 
Tx 198 


The independent variables herein are pressure and 
temperature and this is convenient for deriving tables 
of Gibbs’ function G. 

rly 2 eying este 

hese examples are given to show the possibilities of 
applying acoustic measurements to the compiling of 
thermodyndmic tables. 

The velocity of sound can also be expressed by using 
the molecular weight Mn thus 


a 0} 1% 
v= -(2) (26) 


e ðv T My 


ad ° 


To determine the second virial coefficient B of a gas near 
its condensation point from plots of velocity as a func- 
tion of pressure at various temperatures, van Itterbeek 
and Keegom® used the equation of state of Kamerlingh 
Onnes, viz, py=RT(1+B/2). 

Equation (26) then becomes 


Sp 
y= v( 1+2), (27) 
RT 


C 
T dB T? @B 


ee . with h=C,/R. 
h dT 2h(h+1) dT? 


X=B+ 


Plots of V against p are usually straight lines and, for 
different temperatures, these can be extrapolated to 


‘zero pressure to give V*. This can be checked against 


its value based on the ideal y (for p=0). The slopes of 
these lines enable X to be determined for different 
temperatures, against which it is plotted. Van Itterbeek 
and Paémel,®” for their measurements 1n oxygen at low 
temperatures, tried an expansion for B in the form 


B=otbT-* tel *+dT +e *. (28) 
-e, several test values of B 


To find-the coefficients @-- ; 
can be computed-from Berthelot’s equation 


Te 6T 2 
(a) (29) 


(the suffixes refer to the critical point) and combined 
with values of X from the experiments. Figure $ be ae 
the results for pxygen; X and B are plotted against T. 


Fic. 8. Virial coefficients in oxygen: B and X versus reciprocal 
of absolute temperature (van Itterbeek et al,). S on block stands 
for X. 


A number of the workers already quoted, e.g., van 
Itterbeek ef al.4~* and Lambert et al.*°* used this 
method of computing. 


IX. SHOCK WAVES IN VAPORS 


The shock waves initiated when a gas impinges on an 
obstacle at a speed U near or above the velocity of 
sound are intense sound waves. It is no longer possible 
to consider the density. as constant, therefore, Ber- 
noulli’s equation must be modified accordingly and the 
ratio of the specific heats, y, enters into the equations 
connecting pressure, temperature, and density as well 
as the Mach number, M=U/V. 

At low velocities, forces on a body of chord ¢ are a 
function uniquely determined by the Reynolds number 
Re=Ucp/n, n being the viscosity and p the density of 
the ambient fluid. To suit experimental convenience we 
can alter the values of velocity, etc. provided we keep 
the Reynolds number the same. There is, however, no 
point in changing from one gas to another without 
changing the pressure since all gases have about the 
same value of n. At speeds where the compressibility 
of the air has to be taken into account the Mach number 
of the flow enters as a second similarity parameter. It 
is obvious here that the substitution of air by a vapor 
having a low value of V will result in the same Mach 
number for lower power. The power expended in a 
wind tunnel of cross section A can be written kpA U’, 
where k is the power factor of the tunnel and depends on 
the tunnel design. To keep the tunnel area/model size 
the same, 4/c? must be constant. It can easily be shown 
that if this is done, for constant Re and M, the*power 
varies as V*/yp. 

The suggestion has been made by Smelt” and others 
that one of the chloro-fluorine derivatives of methane 
(known in the refrigerating industry as freons, or 
arctons) or one of the hexafluorides should form the 
substitute vapor. In them the velocities are-less than 


half that in air and they gre inert and nontoxic withal- 
In fact, high-speed tynnels of this type using freon-12 


bd ’ 
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(CCl:F») are in operation in the United States and in 
England. Certain questions arise, however, as to the 
exact correspondence of fields of flow even when the 
Reynolds number and Mach, number are correct. The 
principal questions are: 


(1) Does the change in y significantly alter the flow 
field and, if it does, can a correction be made for this? 

(2) Is the equation of state of the same form? 

(3) Will the vapor condense due to the lowering of 
temperature in an expansion. 


With regard to the variation of flow field with y, 
von Doenhoff, Braslow, and Schwartzberg™ have shown 
that, at a particular speed, variation of y can be com- 
pensated by a change in the relative thickness of the 
model. This is true even though a change in y from 1.4 
in air to 1.13 in freon, involves an almost negligibly 
small change in the thickness of the chord ratio. The 
speed range over which freon is an adequate substitute 
for air is, however, limited and is, in fact, from low 
speeds to about 1.2 times the speed of sound. 

If the relaxation time differs from that of air the 
whole flow pattern may be altered. The work of 
Miyahara and the author referred to in Sec. VI is 
pertinent. If the relaxation time in freon 12, 1.5 10-7 
sec, is confirmed it means that a shock taking place at 
a local M=2 (U=300 m/sec) at a distance of 1/20 mm 
in front of the nose would be completely relaxed in this 
distance, whereas the shock front in air would be 
scarcely affected in the corresponding passage. One 
would think this must affect the local pressure on 
the model. 

In this connection a method other than acoustical 
for measuring relaxation times was suggested by Kan- 
trowitz.®*: It consists in measuring with a Pitot tube 
the total pressure head in a jet of gas debouching from 
a reservoir. One would expect this to equal the pressure 
inside the reservoir, provided there are no losses in 
turbulence, etc. But, if the gas is stopped more suddenly 
at the nose of the Pitot tube than the time required for 
the energy adjustments we have been discussing, the 
pressure registered by the Pitot manometer will be 
different for the compression will take place as though 
vibrational. 

Proximity of conditions in the working fluid to the 
ctitical state affect both questions (2) and (3). Assum- 
ing, say, van der Waals’ equation, it would be possible 
to make some modifications of the thermodynamic 
expressions for the pressures and temperatures in terms 
of the constants a and b of that equation, but a general 
formula of correction would be difficult to obtain. It has 
been estimated that the temperature at the surface of 
a model, where the velocity is highest, is y~! times the 
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works near the critical state. In such an instance y 
and are rapidly changing. hea 

This eliminates carbon tetrafluoride (one which has 
been suggested), but we are making sound measure- 
ments on the hexafluorides of sulfur, selenium, and 
tellurium which look promising. ria 

It is difficult to say precisely what the high absorption 
coefficient in these vapors (as compared to air), signifies 
in this application. It must, one would think, have an 
effect on the devolution of the shock wave front. 

It should be possible to measure changes in wave 
front thickness due to such relaxations by a schlieren 
method. This has been attempted by Hubbard, Kan- 
kovsky, and Thaler” with intense ultrasonic waves in 
carbon dioxide, while Hornig ef al." have been 
taking shadow photographs of the density in pulsed 
shock fronts progressing along a tube filled with nitrogen 
or oxygen in an attempt to correlate these densities 
with some assumed relaxations in heat capacity. Present 
day optical techniques do not seem sufficiently de- 
veloped to base precise conclusions on the photographs. 
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In default of the theorem of “detailed balance”: P;;=P;;, with 
regard to elementary transition probabilities, several “balance” 
theorems are introduced and proved on the basis of symmetry of 
physical laws in space-time. (1) First theorem of “averaged 
balance” (Sec. 5): We can establish P;;=P;; by averaging over 
quantities of “minus class.” Table V (Sec. 3) gives a list of 
“minus” quantities. (2) The so-called “detailed balance of col- 
lisions” in classical physics is a special case of Theorem I. (3) 
Heitler-Coester’s theorem of “semidetailed balance” is also a 
special case of Theorem I. (4) Second theorem of “averaged 
balance” (Sec. 5): We can establish P;;=P;; by averaging over 
quantities with pe=—1. The quantities with prp=—1 are listed 
in Table II (Sec. 2). (5) Theorem of “cyclic balance” (Sec. 7): 


In classical physics, a chain of transitions ij—k—- - -~i repeats 
itself cyclically. (6) Theorem of “long-range balance” (Sec. 7); 
The time average of transition probability from 7 to j is equal to 
the time average of transition probability from 7 to i. Theorems 
I, II, and III, are direct consequences of inversibility (covariance 
for space-and-time inversion). Theorem IV is a consequence of re- 
verstvility (covariance for time reversal). Theorems V and VI are 
connected with ergodicity of Markofl’s chains. This ergodicity is 
proved by the condition of bilateral normalization of transition 
probabilities: 2; P;;=1, 2; Pij=1. This bilateral ‘normalization 
in turn can be derived from either reversibility or inversibility. 
The limits of validity of all these balance theorems in actual 
applications are carefully examined in the text. 


4. INTRODUCTION 


T has previously come to general attention that the 
principle of detailed balance by no means represents 
a universal rule in quantum physics. Thus, thanks to 
Pauli’s timely remark, the demonstration of the 
H-theorem which does not utilize the assumption of 
detailed balance has acquired a new importance? 
The transition probability P;; from state S; to state 
S; has to obey, because of the very nature of prob- 
ability, 


D P= ih P;;>0. (1.1) 
s 
However, the inverse normalization, 
XY, P;=1, (1.2) 


is not self-evident. 

Husimi*® and Stiickelberg? demonstrated that condi- 
tions (1.1) and (1.2) are sufficient for derivation of the 
H-theorem.* A simplified version of this proof was given 
by Pauli? 

Obviously, the detailed balance condition, 
dps | Py= Pz, (1.3) 
17. Hamilton and H. W. Peng, Proc. Roy. Irish Acad. A49, 197 
44); W. Heitler, Quantum Tory of Radiation (Oxford Univer- 
sity Press, London, 1944), second edition, p. 252. 

2 E. C. G. Stückelberg, Hely. Phys. Acta, 25, 577 (1952). 
*Koci Husimi, Theory of Probability and_Stalistics (Kawade 
Shobo, Tokyo, 1942, in Japanese), p. 277. 

* Note added in proof.—See also K. Yosida, Proc. Acad. (Tokyo) 
(1940). Dr. Brockway McMillan was kind enough to point 
at ‘she mathematical theorem underlying Husimi-Stiickel- 
‘oof, has been known to mathematicians for a long time 
at von Neamann also used the same theorem in his book: 
von Neumann, Mathematische Grundlagen der Quanten- 

( ea Julius Springet, Berlin, Germany, 1932), in 
f a 
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allows one to deduce Eq. (1.2) from Eq. (1:1), but this 
is too restrictive a condition. 

It is known that the bilateral normalization, Eqs. 
(1.1) and (1.2), can be derived from the unitarity of 
transition matrix (S-matrix) in quantum physics. But, 
in this paper, the bilateral normalization is considered 
in connection with the symmetry of physical laws in 
space-time. It will be pointed out that either reversi- 
bility (covariance for time-reversal) or inversibility 
(covariance for space-and-time inversion) is sufficient 
to deduce the bilateral normalization, without making 
use of the unitarity of transition matrices or of detailéd 
balance. Indeed, reversibility or inversibility has a very 
clear physical meaning and may be considered as a 
more basic physical principle than the unitarity of 
transition matrices, which is specifically a quantum- 
mechanical situation. In the quantum theory of ele- 
mentary processes there never appear transition matrices 
which do not obey reversibility or inversibility; there- 
fore, reversibility or inversibility can be considered to 
be a sufficiently general rationalization of the bilateral 
normalization. 

The principle of detailed balance, Eq. (1.3), is 
sometimes resorted to in problems other than the 
H-theorem. It is therefore worthwhile investigating its 
limits of validity. From’ Boltzmann’s classicai work, it 
is clear that this principle is intimately related to inver- 
sibility. Our investigation will show that if the physical 
system has inversibility, the theorem of detailed balance 
can be re-established in a broader sense with the help 
of hypotheses of elementary disorder (or simply chaos 
hypotheses) with regard to the physical quantities of 
what will be called the “minus class.” To the minus 
class belong regular tensors and first “kind pseudo- 
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> tensors of odd ranks and second and third kind pseudo- 
tensors of even ranks. This classification of tensors will 
be discussed in detail in Sec. 2.4 
“his result clearly explains that the principle of 
detailed balance in classical physics is bound to utilize 
the chaoss hypothesis with regard to the positions of 
molecules, which are minus variables. Heitler-Coester’s 
so-called theorem of semidetailed balance® is also a 
variant of the above-mentioned general rule. Rather 
inexactly. expressed, this theorem of semidetailed 
»balance means that we can re-establish the detailed 
balance by averaging over the spin directions of par- 
ticles. Since spin is a minus variable, it is natural that 
a chaos hypothesis is necessitated with regard to this 
variable. Although this theorem is particularly con- 
venient for considerations in the perturbation theory, 
its domain of validity should not be overestimated. In 
order to apply this theorem, we have to describe the 
particles only by plane waves and to describe the elec- 
tromagnetic field, not by the field strengths, but by its 
sources. Otlferwise, we need further averaging or chaos 
hypothesis regarding other variables of minus class. On 
the other hand, it is also not a general rule that each 
time the particles have an “internal” freedom, averaging 
or chaos hypothesis is needed regarding this freedom. 
Boltzmann already noticed that even if the detailed 
balance does not hold, i.e., if the system in state S; 
does not return to S; after a double transition, SiS; 
—S; there will be a chain of transitions, SSS: 
—.-+-95;; by which the system will come back to the 
original state. This “cyclic balance,” or “closed cycle of 
corresponding collisions” as Tolman’ calls it, can be 
considered as a generalization of detailed balance. 
Although lacking proof, Stiickelberg? pointed out that 
the. mechanisM of his H-theorem is connected with 
cyclic balance. It is obvious that cyclic balance is a 
manifestation of the ergodic nature of physical phe- 
nomena. : 
In the-last section of this paper, we shall give a sim- 
plified version of the ergodic theorem, using only the 
hypothesis of bilateral normalization of transition prob- 
abilities. ‘This will provide a general (though schema- 
tized) basis for the theorem of cyclic balance, without 
referring to collision processes. A 
Fhis simplified ergodic theorem cannot be directly 
applied to the actual physical problems, because of 
various simplifying conditions which will be explained 
at an appropriate place. For instance, the number of 
states is assumed to be finite, which is not permissible 
for the applications in classical physics. We also ignore 
Ae Wane Phys. Chem. Res. (Tokyo) 39, 
Rev. 84, 1°08 (1951). 


Feitler, lecture notes, École d'Été 
F. eocen Phys. Rev. 84, 1259 


4S. Watanabe, Sci. Pap. Inst. 
157 (1941); S. Wafanabe, Phys. 

5 W. Heitler, reference 1; W. 
de Physique Théorique, 1952; 
(1951) 


ER. C. Tolman, The Principles of Statistical Mechanics (Oxford 


University Press, London, 1938), p. 114. : 3 er 
ER Professor W. Pauli, who in a priva 
A E Ae ized the importance of cyclic balance. 
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the important notion of “macroscopic cell” on the 
energy shell. But, on the other hand, our ergodic 
theorem has the advantages not only of being very 
simple and mathematically rigorous, but also of ex- 
hibiting all the essential assumptions necessary for the 
deduction of the ergodic nature of transition prob- 
abilities. It will be shown that the simplifying conditions 
being admitted, the bilateral normalization is the neces- 
sary and sufficient condition for the “ergodicity.” The 
term ergodicity will be defined in accordance with the 
general ergodic theorem in physics. If reversibility or 
inversibility is taken as the foundation of the bilateral 
normalization, we can attribute ergodicity to the rever- 
sibility or iaversibility of the. physical laws. As in other 
versions of the H-theorem and the ergodic theorem in 
quantum physics, here also, the noncommutability of 
the Hamiltonian with the operators defining the states 
plays an essential role. 

It is intended in Part If to examine reversibility, 
reflectibility (symmetry in space) and inversibility of 
quantum field theory,? and to discuss their bearings on 
the interaction types and other allied problems. In this 
paper, these symmetry properties are formally defined 
and, assumed to exist. The classification of physical 
quantities into four “kinds” is explained in a fashion 
which may seem unduly elaborate. But this will prove 
to be instrumental not only for the discussion of the 
principle of semidetailed balance but also for the dis- 
cussion of reversibility, reflectibility, and inversibility 
in general. 


2. “KINDS” AND “CLASSES” OF TENSORS 


In this section, we shall give the mathematical defi- 
nitions of the four “kinds” and two “classes” of tensors. 
In the next section, we shall first introduce a formal 
method to determine the kinds of tensors representing 
various physical quantities, and then clarify the physical 
implication of this determination. 

We consider the entire group of congruent trans- 
formations of coordinates, ae 


x'”= ar., (u, v=1, 2, 3, 0) (2.1) 
which leave invariant 
rutt = gutt = (H (EH (PP a). (2.2) 


It is essential in the investigations involving inversions 
to use only the real coordinates, lest the connectivity 
of the Minkowski space may be altered. H 

The tensors /#”** of the “regular” kind are definéd by 
the transformation rule, 


{ers gt ay +1", (regular) (2.3) 


p 


s As far as reversibility is concerned, the subject is fairly fully 
covered in the second paper quoted under referéhce 4. See also 
the earlier works on this subject: E. P. Wigner, UE Nachr. 
546 (1932); S. Watanabe, Le Deuxième Théorème de ; Thermo- 
dynamique el la Mécanique Orxlulaloire (Hermann et Cie, Paris, 
1935); S. Watanabe, Sci, Pap? Inst. Phys. Chem. Paeahi 
(Tokyo) 31, 109 (1937). + x 3 me 
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TABLE I. The kind to which a product of two tensors belongs. 


Reg. 1 2 3 
Reg. Reg. 1 2 3 
1 1 Reg. 3 2 
2 2 3 Reg. 1 
3 3 2 1 Reg 


which we write for simplicity as 
PIRE (regular) (2.4) 


The pseudotensors #”-` of the first, second, and third 
kinds are defined, respectively, by* 


V=oTt, (1st kind) (2.5) 
V=o,Tt, (2nd kind) (2.6) 
and =o, it, (3rd kind) (2.7) 
where 
OG hac /2 e 0) 
E (2.8) 
ONELA A) 
Ox? 7) Ax"° 
o1=0,0=— / |——]}, (2.9) 
dx? dx? 
Oe 2008) 7) 0 (a7! 2’? 08) 
o1=001= : / (2.10) 
0 (21,227,234) O(a! x? x) 


The usual definition of ordinary tensors includes 
regular and second kinds, and that of pseudotensors 
includes first and third kinds, since only the “ortho- 
chronous” transformations are considered in their 
definition. 

It is obvious from the foregoing definition that the 
kind to which a product (with or without contraction) 
of two tensors belongs is determined by the following 
tules: (a) The product of two tensors of the same kind 
is a regular tensor. (b) The product of a regular tensor 
and a pseudotensor of a given kind belongs to the last- 
named kind. (c) The product of two pseudotensors of 
different kinds is a pseudotensor whose kind is different 
from either one of the two factors (see Table I). These 
rules are symmetrical regarding three tensors involved 
in the operation of multiplication. 

The antisymmetric tensor’ 7”-* of the rank n which 
is “complementary” to an antisymmetric tensor? -> 
of the rank (4—n) is defined by 


si r= (1724) ema, (2.11) 

i Tu= (1/6) Ewart, (2.12) 
Tur=* (1/2) Ewart, (2.13) 

ag = coal, (2.14) 
- -o Turd Ewart, (2.15) 


. 


aie 9The modifier “antisymmetric” apnlies only to ranks higher 
__ than one. 
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where the tensor ¢ is completely antisymmetric, and its 
component €wa(=— °>) is +1 or — 1 in any coordinate 
system, accordingly as (u,v,«,4) is an even or odd per- 
mutation of (1,2,3,0). To satisfy this definition, e must 
be a pseudotensor of the fourth rank of the first kind. 

This complementary relation is reciprocal with regard 
to rand 40 The kinds of r, t, and e (1st kir.d) are related 
by the product rule. Thus, the second line of Table I 
will give the kind of r as dependent on the kind of 4, 
The scalar defined by Eq. (2.11), as complementary to 
e itself, is a regular scalar and has the value —1 in any 
coordinate system. This ep» should not be confused 
with a regular tensor nw» Which is +1 or —1 accord- 
ingly as (y,v,x,4) is an even or odd permutation of 
(1,2,3,0) in a particular coordinate system. Such a 
tensor changes the signs of its components by a trans- 
formation in which c=—1. The complementary scalar 
to 7 is then a pseudoscalar of the first kind. 

If a physical quantity is to be expressed as a tensor 
component, we have to determine (a) the rank of the 
tensor, (b) the component which represents it, and (c) 
the kind of the tensor. We assume in this work thet 
(a) and (b) are already determined by the Lorentz 
transformations in the narrow sense, except for the 
ambiguity due to the possible complementary repre- 
sentation. 

To determine the kind of a tensor, it is sufficient to 
examine its behavior for time-reversal (hereinafter 
reversion), : 


(2.16) 


and its behavior for total space-reflection (hereinafter 
mirage), 


Soh, de eee, Gi, ar Sa 


(2.17) 
For reversion or mirage, a component Q of any 

tensor will retain or change its sign, but its absolute 

value remains unchanged. We write for reversion 


ae, ES a, Se, Se? 92°, 


Q’=prQ, pr=+i1 or —1, (2.18) 
and for mirage 
Q’'=pm0, pw=+1 or —1. (2.19) 


The four possible combinations of the values of pr and 
px will lead the classification into four kinds.- 


TABLE II. The sign of pr for various components as 
dependent on the kind. 


Rank Component Reg. ist and 3rd 
Scalar . = = + 
Vector Space + — — + 

Time — + + E 
Tensor, 2nd rank Space-space 1 

Time-time: + = = aF 

Space-time — + aF = 


1 A little caution must be taken regarding the sign. For instance, 
r= but #2=—7%, also r= but t= —r!™ according to the 
above definition. 7 
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According to the definition of the four kinds of 
tensors, pr and pm are directly determined by the kind 
of the tensor and by the nature of the component in 
coasideration. Tables II and III list pg and py for 
various components up to the second rank. In the 
designation of the nature of components in these tables 
“space” means y= 1, 2, 3 and “time” means p=0. i 

For the combination of reversion and mirage (herein- 
after total inversion or inversion), 3 


e t,o r, a, L, — 2, (2.20) 
we have | f 

Q'=p10, 2.21) 
with Oe 
Pr=PprpM. (2.22) 
For total inversion, the transformation matrix T of 

Eqs. (2.4)—(2.7) becomes simply ™ 
A T=}I or —I, (2.23) 


as the rank of the tensor is even or odd. In Eq. (2.23), 
I means the identity matrix. Here the o’s are 


eo. |) o=+l, (2.24) 
Definitions (2.4)-(2.7) show that the coefficient pr is 
then simply@he product of T in Eq. (2.23) and one of 
the o’s in Eq. (2.24). Thus we obtain a simple rule for 
pr which depends only on the rank and the kind of the 
tensor: pr is positive for regular tensors and first kind 
pseudotenfors of even ranks and for second and third 
kind of pseudotensors of odd ranks. For regular tensors 
and first kind pseudotensors of odd ranks and for second 
and third kind pseudotensors of even ranks, pr is 
negative. All the quantities of the former group will 
form the “plus class,” and all the quantities of the latter 
group the “minus class.” See Table IV. 


o=—1, o=—1. 


3. DETERMINATION OF THE KINDS OF 
PHYSICAL QUANTITIES 


It is asbasic assumption of this entire work that any 
physical quantity can be represented as a component 
of a tensor ofa certain kind. We are now going to intro- 
duce a set of formal prescriptions by which the kinds 
of various physical quantities can be determined ac- 
cording to the definition of each quantity. Admittedly, 
“definitions” of physical quantities and “physical laws” 
involving those quantities are hardly separable in many 


cases. As a result, one may raise an objection to the 


TABLE. IIE. The sign of par for various components as 
H dependent on the kind. 


——————— 


Scalar c + = + = 
Vector Space = + = SR 
e Time che = =E = 

Tensor, 2nd rank Space-space 
Ue ee Time-time + = + PA 
Space-time = $ = P 
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Tage IV. The class of a teasor determined by its rank and kind. 


Rank Reg. Ist 2nd 3rd 
Even + “> = = 
Odd = oe sfa ai 


“proof” of reversibility, etc., to the effect that the kinds 
of the physical quantities are determined in such a way 
that the reversibility, etc. may hold automatically. The 
point is, however, that the same physical quantities 
appear in various physical laws, and that it is meaning- 
ful to verify that there is no internal contradictions 
among these laws. We shall use as elementary a defini- 
tion as possible to determine each physical quantity’s 
kind. The basic rules serving this purpose are as follows: 


(a) The attributes of elementary particles, i.e., rest- 
mass, electric charge, mesie. charge, and magni- 
tude of spin, are regular scalars. 

The proper-time differential ds is a pseudoscalar 
of the second kind. 


~ 
= 


Rule (b) means that the sign of ds is determined by 
the sign of the time differential dt. In addition to these 
rules, we notice that the transformations (2.16) and 
(2.17) do not change the sign of the operation 


ff [irar 


This means that, as far as the signs of pr and pu are 
concerned, a physical quantity and its density behave 
in the same manner. 

To begin with, we note that four-velocity dxt/ds is a 
second kind pseudovector, since dx” is a regular vector 
while ds is a second kind pseudoscalar. By the regular 
invariance of intrinsic mass, the momentum of a particle 
md:v*/ds then becomes a second kind pseudovector. This 
warrants the positive-definite definition of energy, since 
pr=pmu =1 for the time component of a second kind 
vector. Force md?x#/ds? must be a regular vector, for dx“ 
is a regular vector and ds stands here squared. Since the 
total charge of an elementary particle is a regular scalar 
the current-density vector should be a second kind 
pseudovector, the only kind of vector whose time com- 
ponents (charge-density in this case) has pr=pm= 1. 
The orbital angular momentum, being the product of å 
position-vector (regular) and a momentum-vector 
(second kind), should be represented by the Space 
space components of a second’kind pseudotensor. From 
Tables II and III, we see that the space-space com- 
ponents of a second kind pseudotensor have the same 
values of pr and px as the space components of a first 
kind pseudovector. This suggests that the spin-densisy 
of a particle, if expressed as a vector, should belong 4o 
the first kind. The magnetic moment depsity*cah be 


pictured as the product ofwharge (regular) and angular ~ 3 


momentum (second kind); hence, it must be repre- 
> » > 
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TABLE V. Classification of various quantities into 
kinds and classes. 


Rank Kind Class Quantities 


Intrinsic mass, electric charge, mesic 


Scalar Reg. + ; 
charge, magnitude of spin, Lagrangian 


Ist aR Magnetic pole strength 
Vector Reg. = Position-time, force 
Ist = Spin density 
2nd + Four-velocity, momentum-energy, 


current-charge density, electromag- 
netic potentials, linear polarization of 


photon 
Tensor Reg. + Energy-momentum density 
2nd — Angular momentum (space-space), 


electric field (space-time), magnetic 
field (space-space), electric moment 
density (space-time), magnetic mo- 
mert density (space-space), circular 
polarization of photon (space-space) 
All the quantities listed under 2nd 
class pseudotensor become 3rd class 
pseudotensors by the interchange of 
space-space components and space- 
time components 


3rd = 


sented as the space-space components of a second kind 
pseudotensor. If it is represented as space-time com- 
ponents of a tensor, this tensor must belong to the 
third kind in virtue of the theorem of complementary 
tensors. By the definition of the electric field as the 
force on a charge, the pr and px of the electric field 
must be the same as those of the space components of 
a regular vector which is force. From Tables II and III 
we see that the electric field, if represented as the space- 
time components of a tensor, must belong to the second 
kind. If we know, from the Lorentz transformation in 
the narrower sense, that the electric and magnetic fields 
build a tensor, the magnetic field should then be repre- 
sented as space-space components of a second kind 
tensor. An alternative representation of the electro- 
magnetic field is, in virtue of the theorem of comple- 
mentary tensors, such that the electric and the mag- 
netic fields are respectively represented as space-space 
components and space-time components of a third kind 
pseudotensor. If the magnetic pole strength is defined 
as the ratio of the force to the magnetic field, it must 
behave like a first kind scalar, since we have pr= 1 and 
pa=—1 for force and pp=—1 and p= 1 for magnetic 
field. 

The kind to which energy-momentum density tensor 
belongs can be determined by the requirement that its 
space-time components and time-time component 
should behave like the momentum-energy vector which 
is avsecond kind vector. This classifies the energy- 
momentum density tensor as a regular tensor. From the 
relation between the energy-momentum density tensor 
and the Lagrangian density, it follows that the latter 
has to behave like the diagonal elements of the former. 
In otker words it is a regular scalar. 

-Tke ranks and kinds of various physical quantities 
~ being thus determined, their, classes immediately follow 
rom the rule which is tabulated. in Table IV. All these 


` versed state, 
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results together with some results which can easily be 
inferred are listed in Table V. 

The determination of the kinds to which various 
“internal” variables (i.e., other than position-time zaq 
energy-momentum) of a spinor field belong, requires a 
further discussion in quantum field theory, which will 
be given in Part II. Only some of the results will be 
given here. The pr of these quantities have already been 
determined in a previous paper" while the par of these 
quantities can be shown to be the same as in the 
c-number theory as determined by the transfo:mation 
properties of spinors. Table VI lists the classification 
of these quantities, assuming that the two spinors 
appearing in each expression belong to the same trans- 
formation rule for reversion, and mirage." 

The kind of the pi-meson field is the same as the kind 
of*its source, since the differentiation operator, if 
involved, is a regular vector. If the spinors representing 
the nucleons before and after the emission dr absorption 
of a pi meson are of the same kind, then the above table 
(under q-number theory) will immediately give the 
kind of the pi-meson field-? An inspection of Takle VI 
will tell that a combination of scalar and vector types 
of interaction and a combination of pseudovector and 
pseudotensor types of interaction are ot allowed. 
This “exclusion rule” of combination arises not from 
mirage but from reversion due to the change of pr in 
q-number theory." This change of pe is exactly what is 
required to give to these quantities their respective 
physical meanings.!! (See for instance that spin wtysy,.W 
becomes, as it should, a first kind vector.) 

We now proceed to introduce the notions of “re- 
? “miraged state,” and “inverted state.” 
The determination of kinds of the physical quantities 
given above is based essentially on a comparison of the 
two descriptions of the same physical phenomefon 
referring to two different coordinate systems related to 


TABLE VI. The kinds and classes of various tensorial quantities 
built with two spinors of the same kind. The resulis for two 
spinors of different kinds can easily be inferred from this table (see 
reference 11). 


c-number theory q-number theory 


Kind Class Kind Class 
vy 2nd = reg. + 
ip yw 2nd 4b 2nd + 
Ww yyw 2nd — 2nd = 
WW yey 3rd — 1st a 
i ysya 3rd = ist = 
WYS Yu yy 3rd — 3rd oa 

eee 


4 The second paper quoted under reference 4. These topics will 
be discussed in detail in Part II, Secs. 9 and 10. 

» This is true for neutral pi mesons. The situation is more com- 
plicated for charged pi mesons. See Part II. 

13 Thus; this is a direct consequence of the results of the second 
paper quoted under footnote 4. See also G. Liiders, Z. Physik 133, 
325 (1952). For a more general discussion, see Sec. 10 Part II of 
the present paper. We shall see that the above “exclusion rules 
hold also in cases where the two spinors involved belong to dif- 
ferent transformation laws. 
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» each other by Eq. (2.16) or Eq. (2.17). The physical 


insight. into the meanings of “kinds” can be obtained 
more easily by an alternative interpretation of the 
trénsformation (2.16) or (2.17), namely by considering 
two phenomena connected by this transformation 
described by the same coordinate system. 

Two phenomena are said to be reversed phenomena 
of each.ather if, by suitably choosing the coordinate 
origin, all the space coordinates involved in one phe- 
nomenon at any instant /=4 become the same as those 


, involved in the other phenomenon at —1. It is hereby 


understood that the corresponding coordinates refer to 
the same physical entities, e.g., the particles of the 
same attributes. The two states of the physical system, 
one referring to a phenomenon at /, the other referring 
to its reversed phenomenon at —/, are said to be 
reversed states of each other. : 

From this definition it follows that in two mutually 
reversed states, the same particles have the same posi- 
tions but the opposite velocities. Thus the current in 
the reversed state should have the opposite sign, re- 
sulting in the opposite sign of magnetic field, etc. The 
rest of the argument then follows the same pattern as 
in ‘the preceding determination of kinds of tensors. We 
can confirm? in this manner, that the invariance or 
change of thé signs of the physical quantities in the 
reversed state is exactly the same as pr determined in 
the foregoing. The reversed state Spr of a state S can 
now be redefined as a state in which all the physical 
quantities with prp=1 have the same values as in S 
and all the quantities with pp=—1 have the same ab- 
solute values but with the opposite signs. 

Two phenomena are said to be miraged phenomena 
of each other if, by a suitable choice of the coordinate 
origin, all the space coordinates involved in one phe- 
nomenon at / are the negative of all the coordinates 
involved in thé other phenomenon at the same instant 
t; these coordinates are supposed to refer to the same 
physical,entities. The states of the physical system in 
such two phenomena at the same instant are said to 
be the miraged states of each other. 

Comparing the consequences of this definition of the 
miraged states with the preceding determination of the 
kinds of tensors, we can redefine the miraged state Sar 
of a state S as a state in which all the physical quan- 
tities with pm= [1 have the same values as in S and all 
the physical quantities with pw=—1 have the same 
absolute values but with the opposite signs. 

Two phenomena are said to be totally inverted phe- 
nomena of each other if, with a suitable choice of the 
pace-coordinates involved in 
one phenomengqn at i are the negative of the corre- 
sponding space-coordinates-involved in the other phe- 
nomenon’ at —/. The two states compared here are 
totally inverted states of each other. The totally ie 
verted state Sz of a state S can be defined as a state in 
which all the physical quantities with pr=1 have the 


space-time origin, all the s 


v 
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same values as in S and'all the physical quantities with 
pr=—1 have the same absolute values but with the 
opposite signs. 


4. REVERSIBILITY, REFLECTIBILITY AND 
INVERSIBILITY 


Every closed system of physical laws-must include a 
time-dependent law from which it is possible to deduce 
predictive statements. Such a theoretical system should 
be capable of answering questions of the following type: 
What is the probability P(.S—S’; ) of finding a physical 
system in state S’ at the end of a period of time 4 if the 
system was found in state S at the*beginning of this 
period? Such a probability will be simply called transi- 
tion probability from S to S’. 

If the description of the system by states S and S’ 
is maximal, i.e., as detailed as allowed in principle, the 
prediction may be called a micrescopic or dynamical 
prediction, while in other cases it is only statistical. 
If the transition probability refers to a “statistical” 
prediction, we shall use the symbol W instead of P. 
We shall deal only with the dynamical probability P in 
this section. In classical physics, P is either 1 or 0, 
while in quantum physics we have only 

5 


0O<P<1. 


In classical physics, a state is maximally defined if 
the values of all the independent physical quantities 
are furnished. In quantum physics, a maximally defined 
state is a “pure state,” or a quantum state, in contrast 
to a “mixture” (Gibbs ensemble or density matrix). 
Such a pure state may be considered as an eigenstate 
of a set of mutually commuting operators, representing 
a group of physical quantities, although in some cases 
these operators may be quite complicated. 

Covariance for reversion, or reversibility, means that 
a process and its reversed process have the same prob- 
ability, i.e., the transition probability from S to .S’ 
during £ is equal to the transition probability from the 
reversed state Sp’ of S’ to the reversed state Sz of S 
during time /. Symbolically: 5 


P(S>S'; )=P(Sr'>Sr; t). (4.1) 


If a state S is characterized by the values Q of physical 
quantities, Sr is characterized by pzQ. We write for 
brevity $ 


S={Q}, Sr={erQ}, S'=(Q'}, Sa’={0n0'}. (42) 


Covariance for mirage, or.reflectibility, then reeans, 
in a similar symbolism, 


P(SS'; ))=P(Su-S a; D (4.3) 


4 According to this usage of words, the ordinary transition prob- 
ability in quantum physics from one quantum state to another 
should be qualified as microscopical or dynamical and not statis- 
tical. The “statistical” transition probability in quantum physics 
then refers to a transition of a system known to be in a Hilbert — 
subspace to another Hilbert Subspace, where the dimensions of — 
subspaces are more than cae. r 


< AS 
` 


s s 
` ` 
: 3 , % 

Collection, Haridwar 


Fide 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


32 SATOSI 
with 
S={0}, Su={eu0}, S'={0'), Sw =(P). (44) 


Finally, covariance for total inversion, or inversi- 
bility, means 


P(S>S'; ) = P(S >Sr; Ì (4.5) 


with 
S={Q}, Sr={P}, S'={0'}, Sr’={pr0"}. (4.6) 


There is a simple theorem which follows directly 
from this definition, because of (2.22). 

Theorem: If a physical system obeying a certain set 
of physical laws has any two of the three kinds of co- 
variance, reversibility, reflectibility, and inversibility, 
then it also has the third one. 

For instance, suppose that a system enjoys reversi- 
bility and inversibility. First, by reversibility, we have 


P({Q {09} 5 0=P({erQ}—{erQ} 54), (4-7) 
and second, by inversibility, 
P({erQ}—{prQ} ; )=P({orerQ}—{orp2Q'} 58). (4.8) 
Combining these two we obtain Eq. (4.3), for 
$ PM=PIPR. (4.9) 


It should be noted, however, that it is quite possible 
that a physical system possesses only one of the three 
kinds of covariance. Physical systems which obey re- 
flectibility but not reversibility (hence not inversi- 
bility) are familiar to us. It may also very well happen 
that a physical system does not obey Eqs. (4.1) and 
(4.3) with the right signs of pr and par but does obey 
them with wrong signs of pr and px for some of the 
quantities, leading however to the right signs of 
pr=prpm. This statement is true for any permutation 
of pr, pu, and pr. 

In classical physics, all the physical laws are written 
in terms of tensorial components. Therefore, if the 
physical quantities appearing in each equation belong 
to the same kind, reversibility and reflectibility (hence 
also inversibility) are automatically guaranteed, since 
the existence of a solution representing a process will 
imply the existence of another solution representing the 
reversed or miraged process. As this situation is well 
known, we shall limit ourselves to some remarks of 
general nature. 

The mechanical laws are covariant for both reversion 
and mirage as far as the force md?x#/ds? is equated to a 


_ regular vector. The space‘components should then have 


pr=t1, pu=—1. The frictional force, e.g., 
—k(v/|v|)v? (Newtonian), has pm=—1 but pp=—1; 
thus, it satisfies reflectibility but not reversibility. 
However, the Lorentz force e(E+[vxH]) has the 
right signs. Fhe Maxwellian equations are covariant for 
bathe reversion and mirage, since the electromagnetic 
ne tensor and its source, aire current vector, belong 
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this, Ohm’s law, I=cE is not covariant for reversion; 
the left side has pr=— 1, pw=—1 while the right side 
has pp=+1, pxr=—1 (see Tables II, II, and V). 

In quantum physics, the physical laws are ‘aot 
written in tensorial expressions. Therefore we have to 
examine whether we can construct the whole theory in 
such a way that the expectation values ‘(including 
eigenvalues) of all the relevant physical quantities 
behave for reversion and mirage as their respective 
kinds will dictate. The kinds of the physical quantities 
which have classical analogs can easily be determined 
by classical physics. The purely quantum mechanical 
quantities such as spin, magnetic moment, etc., can also 
be determined by their relations to known classical 
quantities. For instance, from the conservation law of 
total angular momentum, we have to assume the same 
p and pw for spin as for orbital angular momentum. 
This is what has been done in Sec. 3. The question as to 
whether quantum physics in its entirety z can be for- 
mulated in a covariant way for reversion and mirage 
will be studied in Part II. 

In the remainder of this paper it is assumed that the 
physical laws governing the physical systems under 
consideration obey reversibility, reflectibility, and in- 
versibility. This assumption may be comzidered to be 
warranted as far as atomistic laws are concerned. In 
particular, it is understood that these covariance prop- 
erties exist Independently of how the states S and S’ 
are defined, in so far as they are maximally defined. 


5. DETAILED BALANCE, SEMIDETAILED BALANCE 
AND AVERAGED BALANCE 


The theorem of detailed balance, literally taken, 
would mean 


P(SS’ ; 1) = P(S’>S; i)? (5.1) 


This type of theorem holds only in the first-order per- 
turbation theorem in quantum theory and, of course, 
is not of a general validity. , 

The so- -called theorem of detailed balance in classical 
physics by no means claims Eq. (5.1), which is a dy- 
namical or microscopic law, but it does represent a 
statistical law in which the state of a system is charac- 
terized by a distribution function in velocities (or 
momenta). It may be written, to exploit the distinction 
between P and W, as 


W(S>S'; 1) =W(S'>S; À, (5-2) 


where S is defined only by a certain distribution in 
velocities. This theorem is based essentially on the 
interesting fact that, as far as the linear momentum 
and energy of particles are concerned, a state and its 
totally inverted state are identical, since pr=1 for all 
the four components of momentum-energy vectors 
which are pseudovectors of the second kind. (See 
Table IV.) 

' This indicates that the basic fact underlying the 
theorem of detailed balance is the theorem of total 
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ainversibility. This theorem which can be written as 


P(S—S'; ù= P(S7'S7; Ù, (5.3) 


% 
means that the (dynamical) transition probability from 
state S to state S’ is equal to the (dynamical) transition 
probabilitys from the inverted state S7’ of S” to the 
inverted state+S; of S, whereby the inverted state is 
constructed from the original state by keeping the 
values of all the quantitigs of the “plus” class and 
changing the signs of all the quantities of the “minus” 
class (sec Table V). It should be noted that since the 
position 1s a minus quantity, comparison must be made 
between a quantity at x in S and the same quantity at 
—x in Sz. For instance the electric field (a minus 
quantity) at x in S must be equal in magnitude and 
opposite to the electric field at —x in Sz. This is in 
agreement with the situation created by the source 
point of this electric field placed at x in S and the same 
source point placed at —x in Sy. 

Let us now disregard the electromagnetic field 
strengths aņd the notion “force.” Then, if we con- 
sider only particles without spin, the only difference 
between a‘state and its inverted state lies in position 
coordinates and angular momenta (see Table V). This 
means that Cdordinates should be miraged; the change 
in sign of angular momentum ensues automatically. 
This mirage of coordinates (x——x) involves not only 
the mirage of positions of particles but also the mirage 
of shapes ot particles and mirage of the boundary. 

It is now clear that, if the molecules are spherical 
(or points) and spinless and the boundary issymmetrical 
with regard to mirage, then the classical theorem of 
detailed balance in Eq. (5.2) can be deduced, without 
discussion of collision processes, simply by the as- 
sumption that*the distribution function is independent 
of position. This shows that the classical theorem of 
detailed balance is based on the “chaos” hypothesis in 
regard to the positions of molecules. (Chaos in angular 
momentais a result of the chaos in positions.) 

Tf the molecules are not spherical and/or the boundary 
is not miragesinvariant, the chaos hypothesis regarding 
positions does not guarantee Eq. (5.2), since we still 
have miraged molecules and the miraged boundary in 
the inverted state. If the particles have at least one 
plane of symmetry, the miraged shapes of the particles 
can be reached by some rotations. If the boundary has 
at least one plane of symmetry, the miraged boundary 
can be considered as a rotated position of the same 

In. this case, we can further introduce a 
chaos hypothesis in regard to the orientations of the 
molecules. This will secure a type of relation in Eq. 
(5.2), but the right hand side will still refer to the 
rotated position vf the boundary. Only if the boundary 
has a symmetry with regard to mirage, can we have 
the classical theorem of detailed balance on assumption 
of two kirtds of chaos, one regarding positions, the other 
regarding the orientations of molecules. This situation 
explains why tke usual illustrations of the breakdown 


boundary. 


PHYSICAL 


LAWS. PART I 33 
of the theorem of detailed balance concerns either non- 
spherical molecules'® or some irregular boundary. The 
first categories of cases of breakdown can be remedied 
by the assumption of chaos with regard to the orien- 
tations of molecules if they have at least one plane of 
symmetry. It is possible that this hypothesis should be 
sufficient even if the molecules do not have a plane of 
symmetry, but in such a case we shall have to discuss 
the collision process in more detail. In case of a bound- 
ary which does not have a mirage-invariant shape we 
had better resort to a long-range balance, or cyclic 
balance, on which we shall touch in the last section. 

The above derivation of the classical theorem of 
detailed balance suggests an immediate generalization. 
We can always establish a statistical balance of type in 
Eq. (5.2), by assuming as many chaos hypotheses as we 
need quantities of minus class to describe the state. 
This general rule will be called the “principle of aver- 
aged balance.” 

In classical physics as well as in quantum physics, we 
can limit the number of necessary minus quantities to a 
certain degree by suitably choosing the employed 
variables. For instance, we can use the electromagnetic 
potentials (plus quantities) instead of electromagnetic 
field strengths (minus quantities) ; we can even describe 
the electromagnetic field by its source alone. In more 
elementary examples, “force” can be replaced by 
“potential.” 18 

In quantum physics, there is further a simplifying 
situation due to the existence of noncommuting quan- 
tities. If we characterize particles by their linear 
momenta (plus), we must disregard their positions 
(minus) and angular momenta (minus). Hence, for the 
quantum mechanics of spinless particles, we have 
the detailed balance in the strictest sense (5.1) if we 
adopt the plane wave description; the total Hamiltonian 
is assumed to satisfy inversibility. 

In case of the particles with spin, we can only derive 
the statistical balance of Eq. (5.2) by one chaos 
hypothesis regarding spin. This can be done by de- 
scribing the electromagnetic field by its sources and 
describing these charged particles by linear momenta. 
From Table V, we see that the remaining physical 
quantities of the minus class are only spin and electro- 
magnetic moments. Therefore, averaging over spin 
directions will yield the averaged balance of Eq. (5.2). 
This is what Heitler® calls the principte of semidetailed 


16 R. C. Tolman’s textbook (quoted in reference 6), p. 119. 
Heitler’s lecture note (quoted in geference 5). J. M. Blatt and 
V. F. Weiskopf, Theoretical Nuclear Physics (John Wiley and 
Sons, Inc., New York, 1952), p. 530. 

16 E, H. Kennard, Kinetic Theory of Gases (McGraw-Hill Book 
Company, Inc., New York, 1938), p. 57. 

u We should keep in mind that in some cases the chaos hy- 

othesis regarding one variable automatically entails the chaps 
EOE regarding another variable. For instance, the chaos 
regarding position will result in chaos regarding angular momen- 
tum. The same is true for spin and magnetic moment. ° 7 + 

18 Thus, this situation is taken care of by the positional chaos; 
“force” changes its sign, but “attraction” and “repulsion” are 
invariant notions. 3 aa s 
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balance and it is a special case of our principle of 
averaged balance. 

The theorem of semidetailed balance is obviously 
very convenient when applied to the usual perturbation 
theory in which plane waves are taken as unperturbed 
states, i.e., eigenstates of the noninteracting Hamil- 
tonian. Moreover. the spin iWtysyav (a=1, 2, 3) in the 
direction of propagation of the plane wave and the 
magnetic moment ityayy(c,b=1, 2, 3) in a direction 
perpendicular to this direction of propagation commute 
with the noninteracting Hamiltonian in the Dirac 
theory as well as with the momentum operator. The 
characterization cf a state by the momentum and one 
of these “internal” variables is therefore suitable for 
discussions in the perturbation theory. But this is only 
one of the possible modes of description. For instance, 
the total angular momentum, which is a minus variable, 
is also a constant of motion of the noninteracting 
Hamiltonian. It should also be noted that the inverted 
state of a diverging wave is a converging wave, there- 
fore it is also outside the scope of this theorem. 

There is no reason to limit the initial and final states 
to the eigenstates of the noninteracting Hamiltonian. 
If, for instance, the probability of existence of particles 
is more or less loczlized (wave packet), position variables 
(minus) will intervene in the description of state, thus 
we shall again need a hypothesis of chaos regarding 
these “external”? variables. On the other hand, we could 
avoid internal variables of the minus group by using, 
for instance, the spin-orbit interaction energy instead 
of spin (or magnetic moment) itself. If we consider 
two charged particles in interaction, the magnetic 
moment is reversed in the inverted state; however, the 
magnetic field strength, due to the other particle, also 
changes its sign in the inverted state at the point, so 
that the magnetic interaction will remain unchanged. 
Since this description involves more or less localized 
particles, we shall have to use a chaos hypothesis for 
“external” variables. We are faced with a kind of com- 
plementarity; to avoid one kind of chaos, we have to 
introduce another kind of chaos. It should also be 
recalled that there are also internal variables of the 
plus group (see Table VI). 

We thus see that we have to be very cautious in 
application of Heitler-Coester’s principle of semi- 
detailed balance. A general rule, which does not fail, 
is that we should-first determine the classes (plus or 
A of all the variables used in the description of the 
system, and, if we want to use a theorem of the type of 


- Eq. (3.2), we should assúme chaos hypotheses for all 


the minus class variables (principle of averaged balance). 
We sometimes encounter, in the existing literature, 
statements to the effect that every time particles have 
ar-internal freedom, we have to perform an averaging 
with régard to this freedom. But, the coincidence of 
external?and internal variables with plus and minus 
variables in the case of He'tler-Coester’s theorem is 
only accidental. _ Py Pa 
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We have used the words averaging and chaos in the 
above exposition without precise definitions. Now we 
should like to provide such definitions. We take a 
mixture M, of state S and its inverted state S; with 
equal weights. Then the average probability (average 
over S and Sz) of transition of a system in ‘this mixture 
to S’ or Sy’ is given by 


Wi=3{P(SS") + P(SS1! 
+P(S72S’))+P(Sr>Sr)} , (5.4) 


Due to inversibility in Eq. (5.3), this probability is 
equal to the average transition probability of a system 
in a mixture M2 of S’ and Sr’ with equal weights to S 
or Sr. 


Wa=3{P(S’S)+ P(S’>S7) 
+P(S1'S)+ P(S1'—S7)} =W? (5.5) 


is the exact meaning of Eq. (5.2). 

It should not be understood that this general con- 
sideration provides any justification for the chaos 
hypothesis (equal weight of a state and its invertéd 
state). In fact, if a system in mixture M, had the same 
average transition probability to S’ and tg Sy’, i.e., if 


2{P(SS')+ P(S1>S)} 


=3{P(SS7')+P(Sr-Sr)} (5.6) 


were true, then a “chaos” would remain a “chaos” after 
transition. By no means is Eq. (5.6) guaranteed by 
inversibility. One general way of justifying the chaos 
hypothesis is to take a mixture of all the possible states 
with equal weight; then we can expect that a system in 
this general mixture has an equal average transition 
probability to S’ and to Sz’. In this case, the equal 
weight of a state and its inverted state holds as well 
before as after transition. This property implies that 
2: Pi;=>): Pix where S; and Sy are inverted states: of 
each other. This condition is guaranteed by tht inverse 
normalization which is always true whenever there is 
inversibility or reversibility, as we shall see in the next 
section. : Taek 

As far as the theorem of averaged balance is con- 
cerned, averaging is supposed to be made over each 
pair of states S and Sz, or {Q} and {prQ}. But, in case 
of space coordinates, pairing of x and —x with regard 
to a particular coordinate system does not have an 
invariant meaning for translation; averaging all the 
values of coordinates is usually required. 

Our derivation of the theorem of averaged balance 
using inversibility shows that we can also introduce 4 
second theorem of averaged balance by considering 
reversed states instead of -inverted states. Thus, We 
can obtain a type of relation, as Eq. (5.2), by averaging, 
or introducing chaos hypotheses, with regard io all the 
variables involved which have pr=—1 (instead of 
pr=—1). : R 


` 


_CÇ-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


ba 


\ 


RRE i 


ba 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


SYMMETRY OF 


6. BILATERAL NORMALIZATION OF TRANSITION 
: PROBABILITIES 


First we shall consider the condition of bilateral nor- 
malization of microscopic transition probabilities P, 
and then consider the same condition in regard to statis- 
tical transition probabilities W. 

For a giver maximally defined state S, we think’ of a 
series of maximally defined states S; (i=1, 2, 3, ---) 
such that S is one of them, say, S;. Let us take the 
physical ‘system in any condition, and represent the 
probability of finding it in state S; by p;. If 

Pil (6.1) 
we speak of a “complete” set of states. While in quan- 
tum physics there are more than one such set, in clas- 
sical physics there is only one complete set. In classical 
physics, the number of possible values of 7 are usually 
continuously infinite, and even multi-dimensional. In 
these variables (i.e., in the phase space) p; will be a kind 
of 6 function. Thus Eq. (6.1) should be understood as 
asschematic simplification of the situation. 

Take two such complete sets (which may be the same 
or different) of states S; and S; and consider the transi- 
tion probability: 

P(SS;3 1). (6.2) 


The definition of P (Sec. 4) results, in virtue of (6.1), 
in the norfnalization regarding the final states: 


z £ P(S—S;; )=0. (6.3) 
7 

However, the normalization with regard to the initial 

states: 

o > P(SS;;)=1 (6.4) 
is not guaranteed by the definition. We shall see that 
if ¢he physical system obeys reversibility or inversi- 
bility, irrverse normalization, Eq. (6.4), follows from 
the first normalization, Eq. (6.3). In the discussion 
which immediately follows, we shall speak only of 
reversibility, but the word “inversibility” can always be 
substituted for the word “reversibility.” 

The basis of the demonstration is the fact that if S 
belongs to a complete set S;, its reversed state Sp also 
belongs to set S;. Because of reversibility, if SS” is a 
solution of the dynamical law, Sr’—Sz is also a solu- 
tion, implying that if S is a possible state, Sp is also a 
possible state. In classical physics, there is only one 
complete set of states, therefore, this means that if S 
is a member of the set, Sp is also its member. 

In quantum physics, a pure state S={Q} can be 
considered as an eigenstate of a family of Hermitian 
operators.which, though complicated at times, represent 
some physical quantities belonging to one or other of 


the four kinds. Therefore, each of them has a definite- 


sign of pr. If the reversed state is a possible state (which 
is the case her), this state must also be an eigenstate 
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of this family of physical quantities, for it is charac- 
terized by {prO} of the same physical quantities. 
Hence, Sz belongs to-the same complete set as S. We 
can thus conclude that « complete set of states is 
composed of self-reversed states and pairs of mutually 
reversed states. 

This being the case, the summation over all che S;’s 
and the summation over all the S;r’s must mean the 
same operation. First, by reversibility, 


E PSS); )=L PSS; b) (6.5) 
which, due to the above remarks, equals 
(6.6) 


Z P(SjrS:5 i), 


which is on account of the first normalization condition 
(6.3) equal to unity. Hence Eq. (6.4), Q.E.D. 

It is true that the physical meaning underlying the 
unitarity of transformation matrices in quantum 
physics is connected with the “completeness” of repre- 
sentation. But here we have derived the bilateral 
normalization without utilizing specifically quantum- 
mechanical relations. 

We now pass to the bilateral normalization with 
regard to the statistical transition probabilities W. 
It is usually the case that states S defined only statis- 
tically exhibit also “completeness.” In other words, we 
can consider any nonmaximally defined state S as a 
member of a series of nonmaximally defined states S; 
(i=1, 2, 3, ---) such that the probability w; of finding 
the system in a state S; of the series obey a normaliza- 
tion condition: 

DE w:=1. 


i 


(6.7) 


For example, in classical physics, after averaging over 
all space-coordinates, the state of each molecule of a 
gas is characterized only by velocities v. Then the 
velocity space can be divided into small volume ele- 
ments, which certainly have the property of complete- 
ness in the sense of Eq. (6.7). We can also apply this 
consideration to a pair of molecules, as is usually done 
in the discussion of collision processes. In case of Heitler- 
Coester’s mode of description, after the averaging over 
the spin directions, the possible values of the momenta 
will constitute a complete set. a. - 
Now, if such a set of nonmaximally defined states 1 
so chosen that the theorem of averaged balance in Ea 
(5.2) is true, then the inverse normalization, `= 


E W(SS;;)=1, (6.8) 
follows immediately from the first normalization, 


E W(SS,3)=1, * (69) 


"es 


which is a consequence of fq. (6.7). The reversibility 

relation, Eq. (6.5), ot the correspendin; g inversibility 
x a 

4 . 5 - 
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relation, cannot be adapted in the same form to the 
W’s for arbitrarily chosen sets of nonmaximally defined 
states. f 

It should be recalled that even in classical physics, 
the w’s and JV’s are not limited to zero and unity, which 
is the case for the p’s and P’s in this form of physics. 


7. ERGODIC PROPERTY OF TRANSITION 
PROBABILITIES 


It is well known that the Markoff chain exhibits a 
particular property which may be called “ergodic.” 
The usual exposition of this subject is too mathematical 
in nature and often overly simplified by the assumption 
of detailed balance: P;;= P;;. 

This section intends to point out that the bilateral 
normalization of transition probabilities is just neces- 
sary and sufficient to derive the “ergodicity” of the 
Markoff chain; the section also hopes to clarify in what 
sense we can speak of an ergodic theorem. In the fol- 
lowing, we shall discuss the subject in terms of the 
microscopic, or dynamical probabilities P, but we shall 
soon find that the main body of argument also applies 
to the statistical probabilities W7. 

We take a complete set © of maximally defined states 
Sa and the indices 7, j, etc., of the S’s are supposed 
always to refer to this same set. We limit ourselves to 
the cases where there are a finite number of states in 
the set: 


t=1,2,3,---,7. (7.1) 
In classical physics, there are usually a continuously 
infinite number of the S’s. In this case, the entire 
argument that follows offers only a mathematical model 
which may approximate the real physical situation. In 
quantum physics, Eq. (7.1) does not imply a real 
limitation, since we need actually consider only a 
limited region of energy values (microcanonical shell) 
and we can also assume the space domain to be limited. 
Then the number of quantum states will become finite. 

Among these states S; (i=1, 2, ---, 7), some will be 
disconnected from one another due to various conser- 
vation laws. For instance, two states belonging to dif- 
ferent values of the total angular momentum will allow 
no transitions from one to the other. Thus, the entire 
set of S; will be divided into subsets, in each of which 
the states are “connected.” Hereinafter such a subset 
will be called a “subshell.” A more rigorous definition 
of subshells will be given soon. 

Thé theory of Markoff’chains pertains to the “‘re- 
peated” transition probability P;;™ which is defined by 

Pj= PP =D PPO (7.2) 

A k k 
where >, a 
a P= P= P(S-S;; 7). 


a a 


(7.3) 


classical physics, we have Pij=1 or 0, therefore 
m= for 0.°In quartum-vhysics, 0< P;; <1. 
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In both cases, we have 


E Pyma 1, (7.4) 


j 
which follows from Eq. (7.2) in virtue of the first 
normalization: r 


D, Py=1. (7.5) 
i 
In the same manner, the inverse normalization: 
È Py=1 TUA 
will result in : 
2 Pay =1. (7.7) 


The classical physics is characterized by the fact that 
Pi =P(SS;; nr), (7.8) 


which means that the physical system is not disturbed 
by observation. In quantum physics this is not the case, 
in general, unless the operators defining S; commute 
with the exact Hamiltonian. In quantum physics, tke 
repeated transition probability in Eq. (7.2) acquires a 
physical meaning only on assumption that the system 
is observed every r seconds with the operators defining 
S: In other words, starting with a pure state S;, we 
observe the system after r seconds, and the result is 
statistically represented by a mixture (ensemble or 
density matrix) composed of various S; with the weight 
P;;. By repeating this process at each interval of 7 
seconds, we obtain after 77 seconds a mixture of S,’s 
with the respective weights P;;™, This means that 
although we start with the microscopic transition prob- 
abilities P;;, we have to interpret the repeated transition 
probabilities P;;~™ in quantum physics in terms of 
“mixtures.” The ergodic theorem discussed in this 
section thus refers to a chain of repeated observations 
and should not be confused with the more important 
ergodic theorem!® which refers to two observations, one 
at the initial instant and the other at the final instant. 

Closely related to P;;, and physicallysand mathe- 
matically more significant than these are the quantities: 


n 


25%= 5 Pym /n. 


(7.9) 

m=l1 f 
They are physically important since @;; represents 
the “time average” of transition from S; to S; during 
the time 77 seconds, while P;;" represents the transi- 
tion probability at the instant uz seconds after the 
initial instant. Indeed, the main concept in an ergodic 
theorem, in physics, is a comparison of “average in 
time” with “average in microcanonical ensemble.” 
Furthermore they are mathematically useful since 2 
.° J. von Neumann, Z. Physik 57, 30 (1929); W. Pavli and M. 
Fierz, Z. Physik 106, 572 (1937). For an interpretation of Neu- 
mann’s ergodic theorem in terms of initial and final observations, 


see an article by S. Watanabe in the mono h, Louis de Broglie 
(Albin Michel, Paris, 1952), p. 385. ae \ 
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has a better convergence than P™ for nx. Although 
P® is zero or unity in classical physics, 2 is not 
necessarily so: 0<0,,; <1, 

From the first normalization, Eq. (7.5), folloy 


WS: 
4 De Qj = 1, (7.10) 
9. 5 7 
and from the‘inverse normalization, Eq. (7.6), 
E I =1, (7.11) 
t 


Now the “ergodic theorem” which we are going to 
prove can be enunciated as follows: The time average 
Q; ™ (n—=>%) of the probability of finding a system, 
which started from any S; in a state S; of the same 
subshell is equal to the a priori probability of a state 
in the subshell, i.e., equal to 1/s if s is the number, of 
states in the subshell. This statement is certainly a 
faithful adaptation of the general ergodic theorem to 
our simplified case, since the microcanonical ensemble 
represents a mixture of‘all the states on an energy shell 
with equal ‘weights. The main purpose of this section 
is to show that the inverse normalization is the neces- 
sary and sufficient condition for this simplified ergodic 
theorem, the first normalization being always assumed 
for the Markoff chain. 

First we shall enumerate, without proofs, some of the 
elementary theorems and definitions regarding the 
Markoff chain which can be found in any exposition of 
the subject. We shall denote by © the original 
complete set of states in Eq. (7.1). Only the first 
normalization is assumed in the following theorems. 

Theorem I. The sequence 


QUD, Qy, ---, (Si SjeS) (7.12) 
converges to 2 limit: 
à lim Qy ™ = Q2. (7.13) 
of course, we have 
(7.14) 


osa Sl, Sarl 
- 


In the set © there can be some states S; such that the 
average transition probabilities 2;,° to them vanish for 
any arbitrary initial state S;. 

Definition I. The “vanishing” part B of © is the set 
ofall states S; such that 


0;;°=0, (SiS, SB). (7.15) 


For the rest of the original set: G’= G—B (which can 
easily be shown not to be empty), we have the following 
theorem: 


2 Theorems I-IV are given in Husimi’s textbook (Husimi, 
reference 3, p. 280), but their physical applications in physics, 
including Theorems V and VI, are not given there. Husimi’s ex- 
position ie based on K. Yosida and S. Kakutani, Topological 
Mathematics (Iwanami, Tokyo, 1939, in Japanese), Vol. 2, p. 20. 
See also, W. Feller, Introduction to Probability Theory and is 
Application Goha Wiley, New York, 1950), p. 307 f. 
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Theorem IT. If 2;7°>0, then 


Q> 0, (Sa S5’). (7.16) 
In other words, if 2;=0, then 
R= 0, (Si, SiS’). (7.17) 


Using Theorem II, we can divide, S’ into subsets 
(“subshells’”) such that 2;;% is zero if S;and S; belong 
to different subshells, and Q,,°>0 for S; and S; 
belonging to the same subshell. 

Definition IT. 


S'=€,+Gr+---+G,, (7.18) 
R: =0, (SiG, Sj), (7.19) 
2;7>0, (SopS). (7.20) 


A subshell Œ is disconnected from the vanishing part B 

and from another subshell ©’ not only in terms of {2;,? 

[see Eqs. (7.15) and (7.19) ] but also in terms of Pij. 
Theorem III. 


Puj=0, (SG, SB) | 
Pa=0, (S€, SE) +. (7.20) 
Pij=0, (SieG', Sj&) 


Obviously the inverse of this theorem is not true. It can 
happen that P;,=0 even for S; and S; belonging to the 
same subshell, i.e., in spite of 2,,7>0. 

We now pass to study the properties of those ®, ;” 
whose initial and final states belong to the same 
subshell Œ consisting of s states S;: 


i=1, 2,3) 2-8 (7.22) 
In virtue of Eq. (7.21), we can derive from Eq. (7.4) 


Z Py=1, (Si, SjeG). 


j=l 


(7.23) 
Similarly, because of (7.15) and (7.19), we have 


E =l, (Sz E) C728) 

The relations (7.23) and (7.24) show that the first 

normalization Eqs. (7.4), (7.14), remains unchanged 

when the initial and final states are limited to a subshell. 
We are now prepared to introduce an important 

theorem. ah te 
Theorem IV. O57 (Sis SjeS) is independent® of the — $; 

initial state S;: AEE A 

Q= “(Si, SE) 


Of course, we have, due to Eq. (7.24), 


= 


Z Qg=1. 

j=l 
The discussion up to this po 
normalization. We now in 


Èi 
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the inverse normalization. We shall consider the con- 
dition, 


> P= 1, OR SjeS), 


i=l 


(7.27) 


which exhibits a symmetry to Eq. (7.23). Because of 
Eq. (7.21), the ‘summation with regard to S+ in 


[XK SS PP, (Si, SieE; SeS), 
k 


actually extends only over S,¢€. Hence 


j 2 P= 1, O3 Sj), 


(7.28) 
and, by Eqs. (7.9) and (7.13), also 
SS Q= 0,;°=1, (Sa, S;e®). (7.29) 


We now propose to show that the inverse normaliza- 
tion as in Eq. (7.27) is equivalent to the condition that 
Q” is not only independent of the initial state S; 
(Theorem IV) but also independent of the final state S;. 
This last condition can be written, in view of (7.26), as 


Q.j°=1/s, (Si, SiG). (7.30) 
Theorem V. The necessary and sufficient condition for 
Eq. (7.30) is Eq. (7.27). 
Proof: From the definition of 2;;° in Eqs. (7.9) and 
(7.13), we can easily obtain 


O° =P) Ua Pra (Si, SC; SueeS). (7.31) 
k 


Because of Eqs. (7.15) and (7.19), the summation over 
Sr, in reality, extends only over S+e®. If Eq. (7.30) is 
the case, Eq. (7.31) becomes 


il il 2 
Sa Paz, (Sx, SjelS) (7.32) 
S Skt 

showing that Eq. (7.27) is a necessary condition for Eq. 
(7.30). 

Next we shall show that Eq. (7.27) is also a suf- 
ficient condition. 

If Eq. (7.27) is true, then we have Eq. (7.29), which 
in view of Eq. (7.25) means 


Z UP ==, (Si SG) 
=I s 
a J 


(1.33) 


3 


Q.E.D. (1.34) 


, , oe S 


Thebrem ¥ is obviously equivalent to the ergodic 
theorenrwe stated at the beginning of this section. 
£ It should be noted that our inverse normalization in 


Eq. (7.27) is not necessarily equivalent to the inverse 
a s 2 oO = 
Ae 


bac , Ns Ki In Public Domain. Gurukul Kangri Collection, Haridwar 


normalization with regard to the entire set: 


D Pw=1, (Si, SjeS). 


(7.38) 


a 


It is, however, easy to see that if Eq. (7.35) is true then 
Eq. (7.27) is also true, and that if Eq. (7.27) is true 
and if the entire set © has no vanishing part %, then 
Eq. (7.35) is true. Actually, in Eq. (7.27), S; and S; 
can be extended, without any additional assumption, 


~ 


to all the states belonging to 5'= S- Y: 


D Pj= 1, (Si, SjeS'), (7.36) 


on account of Eq. (7.21). The summation in Eq. (7.36) 
extends to all the states in ©’, Equation (7.36) is 
equivalent to Eq. (7.27). 

Now, if Eq. (7.35) is true, we shall have Eq. (7.11), 
with S; S;«©. But this contradicts the existence of a 
vanishing part in Eq. (7.15). Hence, if Eq. (7.35) is 
true, then G= ©’, and Eqs. (7.36) and (7.27) ensue. 
On the other hand, if we have Eq. (7.36) as a given 
premise, then the conclusion of Eq. (7.35) can be drawn 
only with the help of an additional condition G=G’. 

From whatever state one may start, ultimately there 
will be a vanishing probability of having the system in 
a state belonging to the vanishing part as in Eq. (7.15). 
Furthermore starting from a state in any one of the 
Œs, we have a vanishing single transition probability 
Pi; landing in a state in the vanishing part as in Eq. 
(7.21). In physical problems, an initial state is, after 
all, the final state of another chain of observation. We 
may justifiably exclude states of the vanishing part 
also as initial states. In any event, symmetry of the 
physical laws in time (reversibility or inversibility) 
results in the inverse normalization, Eq. (7.35), which 
implies nonexistence of the vanishing part. 

The ergodic theorem is sometimes expressed as a 
statement regarding the eventual return to the initial 
state. For actual physical problems in classical physics, 
a rigorous return to the initial state is not to be expected, 
but the return to a state infinitely close to the original 
state (the so-called quasi-ergodic theorem) is sufficient. 
However, in our simplified theory, a rigorous return to 
the initial state can be concluded in classical physics. 

Theorem VI. If the values of P;; are limited to zero 
and unity, then there exists a value of 1 such that 


P;,M= il, (7.37) 
except for S; belonging to the vanishing part: 
Proof: Taking i= j in (7.20), we have 
2:°>0 (7.38) 


Hence, for large enough values of n, we have 
2 (n>n). OD 


. 2;;5%>0, 
Comparing Eq. (7.39) with Eq. (7.9), we see that there 


ts 


ge 


be 


ing creme 
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ə must be a value of (indeed there must be an infinite 


number of such ws) for which 


> Pii™>0, (7.40) 


If Pij is zerọ or unity as we assume, then P;;™ is also 
limited t@ the values zero and unity. Then Eq. (7.40) 
means that there is a value of » for which 

P= íl- (7.41) 


Taking the smallest value of such ws, we can further 


e. . . N 
infer, in virtue of Eq. (7.2), 


1= PaM Pum Puen. 7 (7.42) 


showing a cyclic return to the initial state.2! This repre- 
sents the fundamental fact upon which Boltzmann’s 
theorem, which Tolman calls “cycle of corresponding 
collisions,” is based.° Our proof of the cyclic balance as 
seen in Eq. (7.42) is more general than Tolman’s argu- 
ment, since he (1) assumes without verification the 
ergodic nature of physical phenomena, (2) utilizes, 
throughout the chaos, hypothesis regarding the position 
of molecules, and (3) limits his discussion to collision 
processes. Admittedly, our proof is conditioned by the 
assumption That the number of possible states is finite. 

It is not surprising that the proof of Theorem VI does 
not utilize the inverse-normalization, since, in classical 
physics, exclusion of vanishing part immediately results 
in inverse’ normalization. Indeed, states belonging to 
S= G— are connected in this case by a one-to-one 
correspondence.” 

The theorem of cyclic balance, in Eq. (7.42), can be 
considered as a generalization of the theorem of detailed 
balance, which is a special case of Eq. (7.42) for n=2. 
Indeed, from a 

G] P;=P ji, (7.43) 
follows i 
(j 


Px = PaPri =X (Pa) (7.44) 
k k 


In classical Physics, only one of Pa (k=1, 2, ---) is 


21 Taking the smallest common multiple of the ’s for various 2, 
(7.37) and (7.42) will become valid for all the7’s. — a: 

2 This means that P;; is actually a permutation, and it is 
obvious that a finite number of repeated permutations results in 
the identity transformation. The author’s thanks are due Pro- 
fessor S. Kakutani for reading this section before publication 
and for pointing out various interesting facts pertinent to the 
subject matter, including the point mentioned in this footnote. 


CC-O. In Public Domain. Gurukul Kangri Collection, Hari 


PART I j 39 


different from zero and equal to unity. Thence, 


Py = (7.45) 
In quantum physics also, we have Eq. (7.40), but it 
is not of particular interest. Probably another generali- 
zation of Eq. (7.43) may be more useful. 
In classical as well as quantum physics, we have 


UP =S, (7.46) 


which is an obvious consequence of Theorem V. This 
means that the time average of transition probability 
from S; to S; is equal to the time average of transition 
probability from S; to S;. This is also equal to the time 
average of probability of return to the original state: 
Q: or Qj”. The theorem presented in Eq. (7.46) may 
be called the theorem of “long-range balance.” 

In the entire, foregoing discussions, we used chiefly 
0;;™ instead of P;;™, but it is evident that if P,,“ 
(n—~) has a limit, this limit is the same as 2;,;7. 

It should also be noticed that we can apply all the 
foregoing discussions to W(.S;—S;; i), if Eq. (6.8) holds. 
Even in classical physics this quantity is not limited to 
the values zero and unity. Therefore, what has been 
stated above, with regard to quantum theoretical P’s 
applies, mulalis mutandis, to the W’s. 

For applications of our results to quantum theo- 
retical problems the following remarks should be kept 
in mind, as should the comment with regard to chains 
of repeated observations in connection with Eq. (7.2). 
If the S’s are defined by operators which commute with 
the exact total Hamiltonian, then P,;=6,;, and the 
subshell will reduce to one quantum state. In this case, 
the entire argument loses its physical interest. Therefore, 
the essential point in the discussion of ergodicity lies in 
the tacit assumption that the operators defining the 
states S; do not commute with the exact, total Hamil- 
tonian. In fact, this assumption is adopted, explicitly 
or implicitly, in any version of H-theorem or ergodic 
theorem in quantum physics." In applications to ther- 
modynamics, it is necessary to introduce the idea of 
macroscopic cells on the macroscopically defined energy 
shell.” Our derivation, which does not make use of this 
concept, should therefore be considered as a simplified 
model which serves only to clarify the mathematical 
gist underlying more elaborate formulations. 

The author would like to thank Dr.,Cecile M. DeWitt 
whose instructive seminar talk partly motivated the 
author to undertake this work. 
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field theory. 


Symmetry properties of physical laws with respect to space-inversion, time-reversal and charge-con- 
jugation are investigated in detail in the framework of the quantized field theory. In most cases, the require- 
ments of invariance for these transformations are automatically satisfied; in a few other cases they can 
impose certain conditions on the ways in which different types of interaction are to be mixed. The ideas of 
space-parity, charge-parity and superselection rules are coherently derived from the general formulation of 


1. INTRODUCTION 


HE aim of this paper is to give a systematic 
exposition of the q-number theory of space- 
inversion (mirage), time-reversal (reversion), and 
charge conjugation. As such, the present article is a 
review report on this field of problems which, in recent 
years, has attracted increased attention from theo- 
reticians. This paper is also the first attempt of exposi- 
tion which, from the outset, is based on full recognition 
of the fact that Hermitian or unitary operators exist 
which, without being c-numbers, can commute with 
all the known physical quantities. The existence of such 
operators was previously pointed out by the author, and 
the operator of double reversion was shown to be an 
example of such operators. This fact was used by Wick, 
Sightman, and Wigner to introduce what they called 
the superselection rule.’ In this paper, the superselection 
rule is not only given thorough investigation and 
natural generalization, but its basic idea is invoked at 
various stages of the paper. Although the idea of super- 
selection is indispensable for a clear understanding of 
symmetry problems, and its discovery originated from 
a consideration of symmetry problems, the super- 
selection rule itself can be proven without the help of 
space-time or charge symmetry. 
In Part I, reversibility (invariance for reversion), 
4 reflectibility (invariance for mirage), and inversibility 
(invariance for space-and-time inversion) have been 
formally defined znd assumed to hold whenever neces- 
sary. This Part II provides the proofs of these invariance 
principles, together with the proof of charge-conjuga- 
* tion-invariance. In some’particular cases, these inva- 
,  riance principles are conversely used to determine the 
"correct prescription for mixing interaction types. Com- 
binations of two kinds of symmetry, such as reversion 
aad charge-conjugation, are also given due consideration. 
he so-called “charge-symmetry,” which consists of 


= 

. Watanabe, Phys. Rev. 81, 1008 (1951). This paper will 
r. be referred to as (R). Sze, in particular, the discussion 
ection with Eqs. (4.25) and (8.15) of (R). See also, Wick, 
ghtman, and Wigner, Phy”. Pev. $8, 101 (1952). 


2 CC-0. In Public Domain. Akui Kangri Collection, Ħaridwar 
r 


interchange of neutrons with protons is not considered 
in this paper. 

In Sec. 2 of this part, the spinor is introduced ac- 
cording to Cartan’s method, since it is certainly the best 
suited for the mathematical analysis of the congruént 
group of the Minkowski space. This short introduction 
regarding the spinor is intended to fW}, theoretical 
lacunae left in the current expositions on spinors. The 
ten different “kinds” of spinors previously introduced 
by the author are also explained systematically. It will 
presently become clear that this classification of spinors 
is no longer of great importance in the field theoretical 
application of spinors. This is essentially due to the 
fact that the physical content of the theory is left 
unchanged by a gauge transformation of all the spinors 
involved. Specification of the “kind” of a spinor has 
sometimes been associated with the law of conservation 
of heavy particles.2 In this paper, however, we take 
the viewpoint that the “kind” of a spinor does not 
have much physical importance and that the physical 
Jaws such as conservation of heavy particles should 
rather be associated with the superselection rules. 

As a result of our conformity with Cartan’s method, 
our formulas will be written with the’ help of the 
E-matrices instead of the customary y-matrices, which 
may be a little obnoxious to some readers. However, 
is it aesthetically more acceptable, to have E,y and 
E,E,E3y for time-reversal and 3-dimensional space- 
inversion, respectively, or to have yryzysv and ys for 
these transformations? In any event, a dictionary of 
translation from one language to another is attached 
in Sec. 2. 

In Sec. 3, the unitary operators R, M, I, and C, 
corresponding ‘to reversion, mirage, space-and-time 
inversion, and charge-conjugation, are introduced and 
their properties are investigated in great detail. Section 
4 shows how the c-number field theory has the correct 


> , 
_| As many pertinent references as are known to the author are 
Cited in each individual section in the following dealing with a 


specific problem. However no bibliographical completeness has 
been contrived. x 
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behavior for space-mirage but the wrong behavior for 
time-reversion, and how the q-number theory can 
correct the reversion property retaining the mirage 
psoperty, and at the same time correctly formulate 
charge-conjugation. 

Sections 5, 6, 7, and 8 demonstrate the existence of 
the unitary operators for the various transformations, 
thus proving the invariance of the theory for these 
transformations. These operators will be given explicit 
operatorial expressions. Section 9 discusses the trans- 
formation properties of various tensorial quantities 
formed with spinors, thus substantiating some of the 
results anticipated, without proof, in Part I. Sections 
10 and 11 deal with the various types of nucleon-pion 
interaction and nucleon-lepton interaction. It will 
become clear through these discussions that, except in 
the case of a photon or a neutral pion, it is meaningless 
to speak of space-parity or of charge parity of a single 
elementary«particle. 

For the nucleon-pion interaction, the mixture of the 
scalar and,vector types and the mixture of the pseudo- 
vector and pseudotensor types are forbidden by both 
feversibility and charge-conjugation-invariance. The 
mixture of regular tensorial quantities and pseudo- 
tensorial @uantities is also refuted from a general 
ground. For: the nucleon-lepton interaction, there are 
two general families of types. The first family is charac- 
terized by coupling of the same tensorial or pseudo- 
tensorial quantities originating from nucleons and from 
leptons. The other family is characterized by the 
coupling of tensorial quantities of nucleons with pseudo- 
tensorial quantities of leptons, and vice versa.’ Both 
families are equally justifiable, but it is forbidden to mix 
them. Moreover, charge-conjugation and reversibility 
require certain phase-factor relations among the inter- 
action constants in each of the families. 

Sections 5-9 will show how the concepts of reversion, 
mirage, and charge-conjugation are useful in deter- 
mining the eigenfunctions of each field and also in 
establishing relationship among eigenfunctions. The 
angular momentum representations for photons (Sec. 6) 


` and for ele¢trons (Sec. 8) may prove to be instrumental 


for problems other than the formal ones of symmetry. 
As an illustration of the selection rules originating 
from space-symmetry and charge-conjugation, Sec. 13 
will discuss the two-photon decay of a positronium and 
of a neutral meson. 
` It was pointed out in a previous paper by the author 
(paper R in reference 1) that reversibility as such does 
not determine the commutation rules of charged fields, 
but, rather, it is charge-conjugation that does this deter- 
mination. It is not intended to emphasize this point any 
more in this paper, but Secs. 4 and 5 will give a sketch 
of the underlying facts. If one had the impression that 
reversibility had the power of determining the statistics- 
type, it-is only because the combination of what we call 
reversion in this paper and charge conjugation was 
used for time reversal. This fact should not be inter- 
t 


preted as meaning that one view point is “correct” and 
the other is “incorrect.” What we should call time- 
reversal is more or less a matter of taste.* The important 
fact is that we have as many laws of invariance as we 
have modes of transformation allowed by the theory, 
although some of the invariance laws can be derived 
from other invariance laws. It is also important to 
recognize this fact in regard to space parity. The author 
showed in a separate paper that we can define many 
different mirage operators leading to many different 
parity values of a given state. There is no room for 
argument as regards which one of the different defini- 
tions is the “correct” one. Even after we have imposed 
upon the mirage operator M the conditions that M?=1 
and MW yuc=Wyac, ‘there still remains ambiguity of 
parity value of a state in which the number of charged 
particles is odd. [See Eq. (8.28). ] Any conclusion that 
can be drawn from either value of parity is “correct.” 


2. MATHEMATICAL PRELIMINARIES 
A. Rotations and Reflections 


The mathematical entity now called spinor was first 
introduced by Elie Cartan? in 1913, i.e., some fifteen 
years before the discovery of the relativistic wave 
equation of the electron. Cartan’s method is based on 
the fact that any congruent transformation can be con- 
sidered as a product of simple reflections, although the 
way of this decomposition involves certain arbitrariness. 
For this reason, Cartan’s method is best suited for dis- 
cussions pertaining to inversions, while it brings about 
the same results pertaining to rotations as in the ordinary 
method. 

By the simple reflection with respect to a plane 
passing through the origin and having the normal vector 
a" (u=1, 2, 3, 0), an arbitrary, regular vector x# is 
transformed into 


ac'# = x — 2a” (a,x?) /(a,a*). (2.1) 


The metric tensor g,, of the Minkowsky space is given 
in (1.2.2)4 We assume hereafter that the normal 
vector a# is always normalized to 1 or — 1, and we speak 
of a space-like reflection or time-like reflection according 


as 
G,0=+1 or a=]. (2.2) 


The sign of a* itself is arbitrary, hut the sign of a,a* 
has a vital meaning. 
Let the number of space-like reflections, the number 


of time-like reflections, andsthe total number ofreflec-_ 


tions of both kinds in a decomposition of a given con- 
gruent transformation be denoted, respectively, by vs 


va and v. The parities of these numbers are determined 


* The definition used in this paper is more convenient only in 
the sense that it has a direct correspondence with the 
reversal in classical physics. ial e 

3 É, Cartan, Bull. Soc. Math. de France, 41, 53 £1913), See al 
É. Cartan, Le théorie des spixcurs (Hermann et Cie, Paris, 19. 

+ (1.2.2) mean formula (2.2) ôf Part I of this paper. 
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when the congruent transformation is given, and we 
have 
T= (= 1)”, ov (—1)", o=(— D (2.3) 


where oa ca and o are given in (I.2.8), (1.2.9), and 
(1.2.10). It can be shown that, by a suitable choice of 
simple reflections, v can be made equal to or less than 4. 
The general cońgruent transformations can be clas- 
sified into four categories’: (M) o,=+1, o=+1; 
(8) «=—1, «=—-1; © o=-1, «=+1; (D) 
o,=+1, o,=—1. The invariance of the theory for (20), 
Lorentz transformations and space-rotations, is well 
established. All the transformations belonging to class 
(Œ) can be obtained by multiplying all the transforma- 
tions of class (X) to any single transformation of class 

(©), say, 
(x, V, 2, )—>(—x, 3) Sa t). (2.4) 


All the transformations of class (Q) can be obtained 
by multiplying all the transformations of class (X) to 
any single transformation of class (D), say, 


(a, J, 4; i) (x, Y, 2, =) (2.5) 


All the transformations of class (X) can be obtained by 
multiplying all the transformations of class (X) to any 
single transformation of class (8), say, the product of 
(2.4) and (2.5). Thus the study of invariance for the 
general congruent transformations reduces to that of 
the transformations given in (2.4) and (2.5). 

Here we shall insert illustrations to show how a 
spatial rotation and a Lorentz transformation can be 
decomposed into simple reflections. Take two purely 
spatial unit vectors, a and b, where a?=b=0. We 
perform reflections first with regard to a and then with 
regard to b: 


x’=x—2a(a-x), 


2.6 
x= x'—2b(b-x’). ee 


Putting 
Ae. 
[aXb]= ae c, 


a 


where c is a unit vector, we can easily deduce from Eq. 


(2.6) 
x"=x-+ (1—cosy)[eX[eXx]]+sing[eXx]. (2.8) 


As a simple geometrical consideration will show, this 
transformation Eq. (2.8) is nothing but a rotation by 
angle ọ about the axis c. Suppose we perform further- 


(a-b)= coss, (2.7) 


more «1 simple reflection with regard to c. Then we 


obtain 
x" =x" — 2c(c- x”) 
a= —x— (1+coso)|cX[cXx]]+sing[cxx]. (2.9) 


This reduces to (2.4) for y=r, agreeing with the fact 
that the total mirage is the product of a simple reflec- 


5 See the Appendix in (R), reference-1 of this paper. 
a: CS ere 
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tion with regard to any direction in space and a rotation 
by 180° about this direction. 
In a similar way, let us take two space-like unit 
vectors, q 
a= (cosha, 0, 0, sinha), 


b= (cosh, 0, 0, sinh), 


(2.10) 


and perform simply a reflection first with regard to a! 
and then with regard to b”. Then, the product of these 
two simple reflections becomes 


x= cosh gx!+sinh gx, 
oe | (2.11) 
g =sinhgx!-+ cosh gx, 
with 
g=2(B—a). (2.12) 


Eqfiation (2.11) is merely an ordinary Lorentz trans- 
formation with a relative velocity in «’-direction of 
magnitude v: : 


sinhg=0/(1—2")! (2.13) 


The interchange of a” with b” changes the sign of th 
relative velocity. A 


B. Spinors 


G. 


The basic matrices E, (u=1, 2, 3, 0) are defined by 
JEJA. E, En = D gigs (2. 14) 


They must have at least four columns and four rows to 
satisfy Eq. (2.14). The “contravariant”’ E’s are defined 
by 


E*gy= Ey. (2.15) 
The matrix E4, defined by 
E,=1E=—ik, (2.16) 


will occasionally be used, but usually the index u will 
be supposed to run over (1,2,3,0) instead of (1,2,3,4). 
It should also be noted that Es defined by ` 


FE, E.E3E)=1Es (2.17) 


anticommutes with the four Z’sand E;?=1. The change 
of factor 7 into —z in Eq. (2.17) will change what 
Eddington would call chirality, but it does not affect 
the ordinary four-dimensional theory. 

There exist two important matrices, J and K which 
transform the basic E’s respectively into their Her- 
mitian conjugates and their transposes.® The matrix J 
is a necessary instrument to construct the “adjoint” 
spinors, while K is closely connected with the charge- 
conjugation. Their properties are, more precisely, 


PEIZ =, RJ- J= (2.18) 
KEK- E, INR aie, (O50) 
$ The J used here is ¢-times or minus 7-times the J used in (R)- 


For the derivation of the theorems concerning K and J, see 
W. Pauli, Ann. Inst. Henri Poincaré, 6, 137 (1936). 


ts 


$ 
f 
Í 


t. 
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where the bar means the Hermitian conjugate, and T 


means .the transpose. Between J and K there exists a 
relation: 
e = 
K=-J'K-Y, (2.20) 
There are'sets of E’s such that Ey, Eo, Ez, E,(= —iEo) 
and Es are all Hermitian (£o: anti-Hermitian) : 


E, =E Es= Es. , (Hermitian system) (2.21) 
In a Hermitian system, we can take 
J=J(H=fa=ky= —iEo, (Hermitian system) (2.22) 


and we have = 
K=K~. (Hermitian system) (2.23) 
Furthermore, among the Hermitian systems, there exist 
such sets of £’s, that Ey is antisymmetric, three out 
of Ey, E», E», Es are symmetric, and the remaining one 
is antisymmetric. If, for instance, we take E, as this 
antisymmetric one, then we have the following rule: 


ə E, Ez, Eo, Es have real elements, 
(special) (2.24) 


"Ez has imaginary elements. 
a 
In such a system we can take 
J=—iE); K=E,E;. (special) (2.25) 


If we take Es as the antisymmetric one, we have the 
following rule: 


E, Es, Es, Eo have real elements, 
(special) (2.26) 
Es has imaginary elements. 


In such a system, we can take 


J=K=—iky. (2.27) 


eAs will be seen presently, we must adopt a Hermitian 
system ih order to make the normalization of spinors 
in the ordinary 3-dimensional space feasible. We shall 
not use particular representations of J and K, except 
in the last stage of calculations, in order to keep the 
physical implications of J and K clear. As to the special 
systems in (2.24) and (2.26), they are mentioned only 
to compare the E-system to the usually adopted repre- 
sentations of the a’s and the y’s, and will not be used 
in’ this paper. 

We now introduce a 4—4- matrix representation of the 
entire congruent group. Any congruent transformation 
can be expressed ag the result of a series of simple reflec- 
tions characterized by normal vectors: aqm”, a2)", `+, 
a", whereby the order in which the reflections appear 
must be respected. Corresponding to this transforma- 
tion, we éntroduce a matrix: 


It should be noted that 


AA=03(E,E,+E,E,) = Ou. (2.30) 
Therefore, 
S=: “Ayi (2.31) 


It is easy’ to show that the .S’s defined by Eq. (2.28) 
are faithful representations of the congruent group, but 
they are by definition two-valued since the sign of each 
a” is arbitrary. From the defining properties of J and 
K follow 


JAS =0,5-, 
K“SK=0,S7. 


(2.32) 
(2.33) 


The spinor ¢ is defined as the representation vector 
of the .S’s, i.e., 
= SE. (2.34) 


The adjoint spinor has to be defined by a transforma- 
tion rule which contains S~, so that it may cancel with 
the transformation of £. This could be done, in view of 
Eqs. (2.32) and (2.33), either by the use of J or X. 
However, in order to provide the possibility of positive 
definite normalization of spinors, we have to choose the 
former alternative. Thus, the adjcint spinor £% is 
defined by 

=EL (2.35) 
where the bar means the complex conjugate in the 
c-number theory and the Hermitian conjugate in the 
q-number theory. The star will be reserved to designate 
exclusively the complex conjugate. In virtue of Eqs. 
(2.32) and (2.34), the transformation rule of £x becomes 


(2.36) 


EXE oe 


Now, the relation in Eq. (2.33) can be exploited to 
derive a physically important theorem—namely, if & 
and &* transform according to Eqs. (2.34) and (2.36), 
then £ and £* defined by 


f=h*K, =K! (2.37) 


transform, because of Eq. (2.33), again according to 
the same rules as seen in Eqs. (2.34) and (2.36). That 
t: and £X, defined by Eq. (2.37), are connected by Eq. 
(2.35) can be shown by Eq. (2.20). We shall see that 
transformation in Eq. (2.37) provides the possibility 
of charge-conjugation. 

The “normalization” of spinors is usually done by 
requiring * a 


f PGE tdxdydz= +1, (2.38) 


which is possible in a Hermitian E-system, because,on 
account of Eq. (2.22), Eq. (2.38) becomes Ha 


©“ @.6 


2 S=A,---AsAi, (2.28) pean Aree 
: Z= A * 
an Sena eee (2.29) J page epee Ce 
f Da Ae s “ 
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We shall have to use quantities of the type XO% 
where O is some operator with Dirac indices. It should 
always be kept in mind that if , 

Q=0f=i0'E, (2.39) 
this means that 
=-1iE,O or O= —iE,O’. (2.40) 


If O’ is Hermitian, then Q will be real in the c-number 


theory. 
It may be well to introduce here a set of simplifying 


notations: i 
Ep= (1/2) P(@,8)EaEs=(E,E,— E,E,)/2, 
Epx= (1/6) P(@,8,y) EakgEy, 
Eyan=(1/24)2) P(a,8,7,5) EalsE,Es, 


(2.41) 


where the P’s are sign-functions such that, for instance, 
P(a,8,7) is +1 when (@,8,y) is an even permutation of 
(u,v, x), and —1 when (a,8,7) is an odd permutation of 
(u,v,x), and otherwise zero. 

We shall now give the transformation properties of 
various tensorial quantities that can be built with the 
two spinors, y and o, obeying the same transformation 
rules of Eqs. (2.34) and (2.36): 


A=W*Ese, T=/*¢, 
A,=1WE,¢9, T=W*EsE,g, 
Ap= V*EsEwe, Tu= WE wo. 


(2.42) 


The factor z is inserted in four of those quantities so 
that the complex conjugate of each quantity becomes, 
in c-number theory, identical with the original quantity 
with y and ¢ interchanged. It is easy to show, with the 
help of Eqs. (2.34) and (2.36), that the “kinds” of 
tensors, defined in Part I, of these quantities are as 


follows: 
A, Ay, Ayr, 


second kind 
| (2.43) 
F, Ty, Pw. 


third kind 


Because of Eq. (2.17), A,, and T,» are complementary 
to each other. See (1.2.13). The assignment of kinds 
given in (2.43) will no longer be valid in the q-number 
tery See Table III, Sec. 9. 

- To avoid repetition, let us prove only that A, is a 
second kind pseudovector. Taking an arbitrary regular 
vector, x“, we built a matrix 


Com fe 
X=HE,. 


If x" undergoes the transformation of Eq. (2.1), X 


will undergo the transformation, 


XaalE,= o Ep— 2010" Ey (ax) 


es att =X— a (AX+XA)=— AXA, 
A is given by Eq. @ 29) More generally, if x” 
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undergoes a series of simple reflections, X will obviously 
transform according to 


X’=(-1)'oA,- 
=S XS. 


-AXA +A, ; 
(2.44) 


Now take the product of A, with «* and call it C: 
CENNE (2.45) 


then this will transform, as a result of Eqs. (2.34), 
(2.36), and (2.44), so that 
C'= W ARa gx! = oop% Te, gx", (2.46) 

showing that C is a second kind scalar. Then, in virtue 
of the quotient rule of tensor calculus and of the 
product rule of Part I, we conclude that A, is a second 
kind pseudovector. 

Equation (2.44) can also be written, with the help 
of a#., (1.2.1), as 


C—O ME ome (2.47) 
Since 2” is arbitrary, we obtain 
Ea". =0 SE, S. 5 (2.48) 


The relation in Eq. (2.48) enables us to compare the 
E-system with the customary a system and y system. 
More precisely, the transformation rules for class 2 
alone are not sufficient to determine the unique corre- 
spondence between the E-system and the y system. If 
we assume, for the y system, the transformation rule 
(including inversions), 


Ya» =Sy,S, (2.49) 
then we obtain the following correspondence: 
Eı=iysy1= — 102038 =F 1ps i 
Es=iy5Y2= — iazab = T23 
E;z=iysy3= — 101028 = T 3p3 (2.50) 


Eo=1E1= —ysY4= — 10102038 = —ip2 
Es=y5= tæa; = — p1=ip2p3 
M=tHiEs, Yz=iE2Es, y3=1E3Es, 
j ya=EEs, ys=Esp (2-51) 
m=}, œz=EE as=EE; B=EEs 
EX= —itEo= —i$tys, t= Eya =1EXEs 
A=yio, 


Ay a ip ty, P, 


(2.52) 
T=— itys 

L=iiyyse T(m v=1, 2,3, 4). (2.53) 
Aw=—iptywp, To=ibtyyyse H 


< From this correspondence, it can be seen that the 
Hermitian 2’s result in Hermitian a’s and Hermitian y’S- 


ta 


nakama ian 


ta 


- 
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ə Furthermore, the customary representation of a’s in 


which only œz has imaginary elements corresponds to 
the special system as seen in (2.24) and Eq. (2.25). 
TIe y system, in which only y4 and ys have imaginary 
elements, corresponds to the special system of (2.26) 
and Eq. (2.27). If we translate the charge-conjugation 
of Eq. (2.37) into the language of the y system with the 
help of ‘Eqs. (2.50) and (2.52), we get 


s f= iE tESK. 

Comparing this with Schwinger’s definition’ of charge- 

conjugation matrix C: 
f= — fC (2.54) 
we get C=iy5K and, in particular, in the system of 
(2.26) and Eq. (2.27), ‘ 
We shall, now discus’ briefly the spinor transforma- 
tions corresponding to the illustrations given in Eqs. 
(2.8) and (2.11). The transformation matrices S cor- 
responding to Eq. (2.6) are, according to Eq. (2.29), 


x CEN; 


Aaa Pa, Ay=b%Ez (a=1,2,3). (2.56) 
Then the S for Eq. (2.8) will be 
S=A,Aa=—> cies 
eee 2 (2.57) 
(a, B, y: cyclic in 1, 2, 3). 
In particular, when =r, Eq. (2.57) becomes 
S1= — (E1263 + E2361 + E102). (2.58) 
The simple refiection with regard to c is 
© So= Eyer+ E2024 Exes. (2.59) 


The product of these two, which is the mirage, is given 


S3=9251= — EE ok. (2.60) 


These S1, S2, Ss will play a certain role in the angular 
momentum representation of a spinor field. It should 
be noted with regard to Eq. (2.57) that if we con- 
tinuously increase p up to y=27, S becomes —1 
instead of +1, which is another manifestation of the 


two-valuedness of S. ' | : 
Next for the transformations given in Eq. (2.10), we 


have 2 © A 
aR A= E; cosha+ Eo sinha, 


"A= E; cosh8+Ep sinh£. 


The result of these, which is a Lorentz transformation, 
is j 


(2.61) 


S= A2A1= cosh(8—a)—E1Eo sinh(8—a). (2.62) 
7 J. Schwinger; Phys. Rev. 74, 1439 (1948). 
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The transformations considered in Eqs. (2.56)~(2.62) 
pertain to the spinors which do not depend on the 
position in the space-time. Usually the spinor represents 
a certain field, and therefore should undergo the trans- 
formation of the argument (x,y,2,l) of £ in addition to 
the spinor transformation considered above. As is 
always the case in the problems of this kind, two 
“pictures” are possible: either the vectors are fixed 
while the coordinates are shifted or the vectors are 
shifted while the coordinates are fixed. Taking the first 
picture, let us denote the transformation in question by 
an abstract symbol S. The same space-time point will 
be given coordinates x” and x’# in the old and the new 
frames of reference, in such a way that x'= Gx. At the 
same time the field quantity F will become SF =F". 
Therefore the field quantity F in the new description is 


F' (x!) = SF (2) = SFS N. (2.63) 


As an illustration, let us apply this formula to the 
transformation of Eqs. (2.8) and (2.57): 


g 2) 
(x)= (- Ds sinsctBay + 05- ) 
Y 2 2 


XE(x’+ (1—cose)[ex[exx’]] 
sinele Xa. (64) 


The sign of ¢ is opposite to that used in Eq. (2.8) since 
we have to take © instead of S in the argument of &. 
In particular, if ¢ is oriented in the z-direction, 


_¢ g 
El yN (- sin Ecos) 
2 2 


é(cosex’+singy’, —singx’-+-cos¢gy’, 2’). 
Ss ? 


For the infinitesimal rotation dy, we then obtain 
[dé(x’,y',2/)/de} ono 
=y (d&/dx")— 2 (dt/dy!)—YEuk 
=—i(Lst hoa)é, 


where L; and øs are the z-components of the familiar 
operators of orbital angular momentum and spin 
angular momentum, i.e., : 


L3=—i(x0d/dy—yd/x), o3=—iEE2. (2.66) 
C. Various Kinds of Spinors i hy 


(2.65) 


Just as the four kinds of tensors (Part I, Sec. 2) are = 


equally justified representations, the four kinds of 5 


spinors, ® ais 
H=SE,  1'=0Sn, PRONE =i 
fe: =o1S§, | o=o So, wigs a s 
3 See the Appendix of reference (R). ie ee 
Sg e = T 


2 Sc ee 
be 


a oak 
ae 
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are equally justified spinors: The tensorial quantities, 
Eq. (2.42), made out of two spinors belonging to the 
same kind have the same transformation properties as 
have been given with regard to the é spinor. If y and ¢ 
of Eq. (2.42) belong to different kinds, then the trans- 
formation properties of the tensorial quantities are 
modified by the o’s involved in y and ø, just like the 
product rule of Part I, Sec. 2. 

However, Eq. (2.67) is not the only possible analog 
of pseudotensors for spinors. Actually, the three groups, 
each with two elements, (¢=1, c=—1), (@:=1, 
o:=—~1), and (¢,=1, cs=— 1), are respectively iso- 
morphic to the three factor groups, (A+8, C+D), 
(I+, BHD), and QI+D, B+ ©) of the congruent 
group since o°=0; =0;=1. The spinor representation 
is basically two-valued, therefore, —1 as well as +1 
can serve as the identity transformation. Thus, we can 
equally well take (@)?==1, (¢)!==+2), ((¢,)}=+1, 
(c) =i), and ((¢,)!=-+1, (¢,)!=-47), as the repre- 
sentations of the above factor groups. This considera- 
tion leads to the following new kinds of spinors: 


w=(0)'Sp, »'=(01)'Sv, x’=(c,)!Sx, 


where the square root of —1 can be either +7 or —i. 
Thus, Eq. (2.68) actually contains six different kinds 
of spinors. The possibilities of Eq. (2.68) were first 
indicated by the author? in connection with a 5-dimen- 
sional theory, but it obviously applies to the 4-dimen- 
sional theory. It can easily be seen that Yang-Tiomno’s 
A- and B-types” belong to our Eq. (2.68) and C- and 
D-types belong to our Eq. (2.67). Our classification is 
more detailed than Yang-Tiomno’s because classes § 
and D are taken into consideration. 

We should not spend more space here for these dif- 
ferent kinds of spinors, because, in field theory, we 
shall be able to deal with the possible phase changes 
in reversion and mirage without specifying, in advance, 
the kinds of involved spinors. 


(2.68) 


„D. Transpose Operators 


In the main body of this paper, we shall have to deal 
constantly with transpose operators. The transpose O7 
of an operator O can of course be defined in the matrix 
form, 

07 ;;= O;;= (0;;)*. (2.69) 

The operation of transposition itself is not an invariant 

operation for an arbitrary unitary transformation. We 

shall ste in the next section that, in spite of this, a 
_ Certain operation involving transposition acquires an 

- invariant meaning. 
For instance, the operator O=0/dx has the matrix 
elements, : 
3 * (x lOl) = (x — z"). (2.70) 
pu A 
9S, Watanabe, Sci. Pap. Inst. Phys. Chem. Research (Tokyo) 


157 (1941). 
Se Yang dnd J. Tiomno, Phys. Rev. 79, 495 (1950). 
+: Ogee 
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Its transpose is given by 


(a! |O7 |”) =0' (4 —2') = —8' (a! — 2”). 


(2.71) 


Therefore, O=0/dx is an antisymmetric operator Sn 
the x-representation. Equation (2.71) shows that here 
OT means the differentiation of a function standing to 
the left of this operator. This agrees with tke more 
elementary definition of the transpose operator, 


of= for. 


(2.72) 


In the same way, pz-= —10/0« is also an antisymmetric 
operator in the x-representation, because it is Hermitian 
and has imaginary matrix elements. We shall often use 
an operator Q, which is defined by 


0, f(«)= f(—2). (2.73) 

For this operator, we have 
(a’|Q.|2"”) = (a"|Q,7 |x") =8(a'+a""), (2.74) 
i.e., in the «-representation, : 
Q= (== (2.75) 


For an operator which involves both space-time coor- 
dinates and the Dirac indices, we have to take the 
transpose with respect to both of these variables. For 
instance, if O is given by” 


T i=l a (a=, 2, 3), (2.76) 
La=—La" = —il%p(9/dx,)— xy (3/318) | 
(a, B, y: cyclic in 1, 2, 3), (2.77) 
then we can write, with the help of Eq. (2.19), 
MESEI (2.78) 


E. Redundancy of State Functions 


It is well known that the state function W has an 


inherent arbitrariness of phase factor, i.e., ¥ and 
ein (2.79) 


represent physically the same state, where œ is an 
arbitrary real constant. However, it seems to be 
generally overlooked” that a can be a particular func- 
tion of the occupation number of spinor particles 
involved in Y. For instance, we can take as exp(ia) 
the following function: Fors 


= yl 0 

ee AA AT AT o ) 

i i = 2.80 

with 9, pee 
A= 


uP. A. M. Dirac, The Principles of Quantum M echanics (Ox- 
on University Press, London, 1947), third edition, p. 267, Eq. 


£ This fact was noticed b 5 3 zas. (4.25) 
and (8.15) in reference R the author in relation to Eqs. ( 


a 


ce 


ee Z 


t4 


o~- 


ee 


OO EE EE EE ee 
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ə The index i should run all over the spinor eigenstates, 


If Y is known to represent a state with an even (odd) 
number of spinor particles, then AW will be ty (=Y) 
Bat, in general, we have to leave A as an operator, If 
A commutes with any arbitrary physical quantity, Y 
and AW will represent the same state because these two 
state functions will have the same expectation values 
for all the physical quantities. 

i The commutability of A with any physical quantity 
is a consequence of the fact that any physical quantity 
must involve an even number of spinors. In Sec. 3 of 
Part I of this paper, we adopted the viewpoint that 
all physically observable quantities are tensorial quan- 
tities of some “kind.” If this postulate is maintained, 
the spinor analysis will tell us that any physically ob- 
servable quantity must be an expression quadratic or 
bilinear in spinors or a product of such expressions. It 
usually has the form y*Og, but in some cases (e.g., pair 
creation inferaction energy) Y*O¢* may also happen, 
where yX is linear in creation operators g and ¢ is linear 
in annihilation operators g. The forms of g and g are: 


HG ae 
Bee eo 


or their transforms by. an unitary transformation. The 
factor represented by II is the well-known Wigner- 
Jordan factor. We now have 


A AgA=—g, AgA=—g. (2.82) 


As a resulteany expression which contains an even 
number of spinors commutes with A. 

Later we shall see that A is not the only unitary 
qperator that commutes with all the known physical 
quantities. In general we shall write W for any operator 
that has this property. A more general consideration 


" - on this line will be given in the section on the super- 


selection rule. 


3. OPERATORS FOR REVERSION, MIRAGE, INVERSION 
AND CHARGE CONJUGATION 


We use the interaction “picture” throughout this 
paper, in order to have at hand a formula applicable 
for both the Heisenberg and Schrédinger pictures 
rather than for the purpose of actual calculations of 
specific problems. in this picture. We can obtain the 
formulas for the Heisenberg and Schrödinger pictures 
from those of the interaction picture only by including 
the entire Hamiltonian either in the “free” Hamil- 


tonian ôr in the interaction Hamiltonian of the inter- 


action picture. 
The time development of th 
governed by ‘the interaction 


e state function, W(é); is 
Hamiltonian H(/) and 


expressed by 


W (tz) = U (to,ty)V (ti), (3.1) 
with > 
dU (boy) /dla== — iH (t2) U (to,41), l 
(3.2) 
dU (tots) /dh=-+iU (tay H(h), | 
and 


U(tyh)=1, Unt) =Ü (tah) =U (tst) (8-3) 


The time-development of the physical quantity, OH), 
is governed by the free Hamiltonian Ho and expressed 
by 
Q (to) = UT” (to,t1)Q (h) Vol ta,tr), (3.4) 
with 
dU o(to,t1)/dlo= —iHoU (to,t1), 


Ula fi=t. (3.5) 


According to the definitions of reversed phenomenon, 
miraged phenomenon and inverted phenomenon given 
in Part I, Sec. 3, they are characterized by 


Q'(—)=pr2(), Q'(—x)=puQ(x), 

Q'(—x, —)=p10(x,t). (3.6) 
See (I.2.18), (1.2.19), and (1.2.21). In quantum physics, 
we have to reinterpret Eq. (3.6) int terms of expecta- 
tion values. Thus, the reversed phenomenon Wp(Z), 


miraged phenomenon W,,(/), and inverted phenomenon 
W(t) of an original phenomenon ¥ (/) should be defined 


by 
(Wa(—4), O(x, —)¥nr(—2)) 


=pr(©(t), (xv), (3.7) 
(War), O(—x, DW) 

=pul(¥(!), Qx, NYH), (3.8) 
(Yr(—), O(—x, —)¥r(—2)) 

=pr(%(), OG"), (3.9) 


where pr, par, and pr are given in Part I, Tables II, 
III, IV, and V. 

We add to these sign functions another sign function 
pc for the charge conjugation. For all the “mechanical” 
quantities pc is +1 and is —1 for all the “electro- 
magnetic” quantities. Among the various quantities 
mentioned in Part I, Table V, the following are elec- 
tromagnetic”: electric charge, magnetic charge, current, 
electromagnetic potentials, electromagnetic field 
strengths, electromagnetic smoment. All the mst are 
“mechanical.” Using pc, thus defined, the charge-con- _ 
jugate phenomenon V¢(/) should be defined by ., 


(Vel), O(x,)¥cM)=pcYO, ONY). (3.10) 


We can now define reversibility, refiectibility, -etc. 
as follows: If Vr(1), Yu), Y(t), or Vo(!), defined in 
Eq. (3.7) through Eq. (2.10), is a solution of Eqs. (3.1) 
and (3.2) on conditién that W() is a solution, then we 


? Ree 
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speak of reversibility, reflectibility, inversibility, or 
charge-invariance. Actually, we gave, in Part I, a slightly 
different definition of reversibility, etc. According to 
this definition, reversibility holds if the transition prob- 
ability from an arbitrary © to an arbitrary , say, is 
equal to the transition probability from the reversed 
state of 9 to the reversed state of ©. We shall presently 
see that the definition given above automatically entails 
this last definition. 

We now introduce four time-independent, unitary 
operators which are useful tools to prove the four 
invariances in question. Reversion operator R, mirage 
operator M, inversion operator J, and charge-conju- 
gation operator C are defined by 


Q(x, —)=pe(RAO(%,)R)7, (811) 
Q(x, d) =a MENM), (3.12) 
Q(=x, —)=pr(I“0(x,)?, (3.13) 
Q(x, )=pc(CO(%,C>). (3.14) 


In the quantized field theory, the field strengths ap- 
pearing in Q are q-numbers, and the transposition T 
refers to the Q as aq-number. For instance, 

(RAVYOYR)7=RYTONWTR, (3.15) 
where T on O has the meaning of transposition dis- 
cussed in Sec. 2, D. The left-hand side of Eq. (3.15) 
should not be equated to the negative of the right-hand 
side on the ground that y and yX anticommute. The 
anticommutability of Y and yX is implicitly taken care 
of by the matrix representations of y and yX. 

Tt is true that the operation of transposition is not 
invariant for a unitary transformation. For this reason, 
one may think that the definition in Eq. (3.11) has no 
physical meaning. However, this is not the case, for we 
have not yet determined the transformation rule of R 
for a unitary transformation. Take a time-independent 
unitary transformation V, by which Q and W are 
transformed into 


Q'=V-0OV, W=V-y, (3.16) 
Then we shall have again 
O'(x, —/) =pr(R’10' (x,)R)?, (3.17) 
in the primed system if we take 
ae R'=VRv*, (3.18) 


“where-V*= V7. Thus, we assume Eq. (3.18) to be the 
_ transformation tule of, R for the time-independent 
unitary transformation V. We have the same rule for I. 

_ We shall now show that R, M, I, and C, if they exist, 

bles us: to, build We, Vu, Yz, and Wg, defined in 


183A time-Hependent unitary transformation will be described 


er s section. r 
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Eqs. (3.7)-(3.10), from a given Y: 
Wr(—D=V* OR, Farhi) =MVY(i), 
W(—f)= wv* (I, Wo(1)=CW(2). 


(3.19) 


We can, of course, insert arbitrary phase fectors in 
these equations, but, for the moment, we essume them 
to be included in R, etc. To avoid repetitions, let us 
prove only that Ypg defined by Eq. (3.19) satisfies Eq. 
(3.7). By the use of Eq. (3. 19), the left-hand. side of 
Eq. (3.7) becomes 


(RTW* (0), O(—O)RTW*) = (W*(), R7Q(—DR™V* (Ù). 
Because of Eq. (3.11), this is equal to 
< pr&*(), Q OI* O) =pek@, QOY ()*. 


This is equal to the right-hand side of Eq. (3.7). The 
proof runs in a similar way for M, J, and C. 

It should be noted that if W-and Wr undergo the 
transformation of Eq. (3.16) and R undergoes the trans- 
formation of Eq. (3.18), then Eq. (3.19) again holds in 
the primed system. 

It is now clear that reversibility, reflectibility, inver- 
sibility, and charge invariance will be established if we 
can prove that Vr, Y x, Wr, and Ye defined in Eq. (3.19) 
are solutions of Eqs. (3.1) and (3.2) on condition that 
W is a solution. We shall now show that this is in fact 
the case. Take two transformation functions U (t,0) and 
U(0, —#) which are, according to Eq. (3.2), given by 


dU (1,0)/dt= —iH()U (1,0), 
dU (0, —1)/d(—) =+1U(0, —)H(—), 


3.20 
with se 
U(0,0)=1. 
Their transposes are then determined by 
dU" (t,0)/dt=—1U7(t,0)H7(), 
dU" (0, —1)/d(—i) =+iH?(—)U7(0, —d, 
\ ) (3.21) 


with 
U7 (0,0) =1. 


Applying R to Eq. (3.20), we get (on account of the 
time-independence of R) 


IRU (.0)R/dt= — iH? (—) RU (1, OR, 
dR“U (0, —1)R/d(—1) = +:R-U (0, —) RH"), 


with (3.22) 
RU (0,0)R=1, 
because (pr=-+1 for energy) : 
H(—)=(RAH()R)?. (3.23) 


he 
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Comparing Eq. (3.22) with Eq. (3.21), we obtain 
R1U(1,0)R=U7(0,-—), R-U(0, -)R= UT (1,0). 
Š (3.24) 


, Combining the two equations of (3.24), we get'* for 
U(t, —‘)= U(1,0)U (0, =1); l 


UT (t, —)=RU (t, —OR. (3.25) 
In a similar way : 
i U(t, —)=MU (t, —)M—, 
UT, —) =U (HI, (3.26) 


U(t, —)=CU (t —)C. 


It should be noted that the two equations of (3.24) are 
equivalent to each other if R’R™ commutes with U. 
We shall discuss this condition in more detail later. 
Once Eq. (3.25) is established, it is an easy matter to 
show that if a 

3 (=U, —)¥(-), 


Wr(N=U(t, —)Yr(— i). 


In fact, because of Eqs. (3.19) and (3.27), we have 


(3.27) 


ethen 
(3.28) 


Ua(Q=V*(-)R=W*(NU(, —)R, (3.29) 
which, in virtue of Eq. (3.25), equals 
W*()RUT(t, —)=U (t, —i)¥r(—t). (3.30) 


Q.E.D. The proof runs the same way for the other 
operators. y 
It has been shown that if R exists to satisfy Eq. 
(3.11), reversibility is guaranteed. We shall show in the 
following sêctions that such an R in fact exists. 
Incidentally, by the use of 


5 R7Uo(t, 0)R=Uo7 (0, —¢), etc., (3.31) 
which is a consequence of 
ž Ho= (R*H)R)’, (3.32) 
we can rewrite Eq. (3.11) in a simpler way: 
Q(x,0) =pr(RQ(x,0)R)”. (3.33) 


‘The derivation is similar to our argument regarding 
Egs. (3.20)-(3.24). 
Another important consequence of Eq. (3.25) is the 
alternative definition of reversibility used in Part I. 
Suppose there are given two arbitrary states, © and Q. 
In one process, we consider the transition probability 
from © at f= —h, to @at /=+h. In the other process 
we consider the transition probability from the reversed 
state Qe of Q at f=—h to the reversed state Or of O 


at t= 4: OR MAER en 
in the corresponding Eq. (4.27) in (R). 


u There is a misprint 


The first transition probability is determined by the 
transition matrix 


(9, U(h, —=4)0) A 
and the second by 
(Or, U(t, —h)On)= (0% RTU (by, —h)RIO*), 


On account of Eq. (3.25), we can rewrite this last ex- 
pression as 
(0, Ült, —)2)*= (2, U (h, —4)9). 


Thus, nl 
(2, Ul, —)O)=(Or, UU, — Qe). 


* (3.35) 


From this it follows, to use the notation of Part I, that 


P(O—2Q, 21) = P(Qi—O pn, 21). (3.36) 
In a similar way, we obtain 
(2, Ut, —)O)=(Qu, UL, —YOw) 7 
| (3.37) 


= (Or, U(t, —)%) 
= (Qc, U(t, — Oc). J 


It should now be noted that the faur operators defined 
by Eqs. (3.11)-(3.14) have two types of basic arbi- 


trariness: 
(3.38) 
(3.39) 


Se, 


SWS, 


and 


where S stands for any one of the four operators, and 
W is supposed to satisfy 


Wow7=0 


for any known physical quantity Q, as has been ex- 
plained towards the end of the last section. For our 
study of symmetry properties of physical laws, how- 
ever, we need to pick any one S that satisfies the de- 
fining equation; the arbitrariness expressed by (3.39) 
can be relegated to a separate study ef operators 
satisfying Eq. (3.40). We shall see in Sec. 12 that we 
can assume, without loss of generality, that Y under 
consideration belongs to one or another of the Hilbert 
subspaces, within each of which the effect of W on Y 
is equivalent to multiplication by a phase factor. 

Suppose then that one such R satisfying Eq. i 
obtained and that its phase factor in the sense of 
(3.38) is fixed arbitrarily. Then Eq. (3.19) will be write 
more generally as A ang 


Wr(—D =U (OR, | 


(3.40) 
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Now, if Yp(—2) is the reversed state of (4), then 
V(—2) must be the reversed state of Vr(/). Therefore, 
we shall have 


(=)= PVR" (NR. (3.42) 
Combining Eqs. (3.41) and (3.42), we obtain 
P) =e -IRTRY (i) (3.43) 


for any ¥. Equation (3.43) represents the effect of a 
“double reversion.” 

Equation (3.43) shows that RTR- is either a c-num- 
ber, e99), or a q-number for which W is an eigenfunc- 
tion corresponding to eigenvalue, e*-. Such a 
g-number must obviously be of the type as seen in Eq. 
(3.40), exemplified by A in Eq. (2.80), since otherwise 
W and R?R“Y would be physically distinguishable 
from each other. 

Whatever RTR- may be, we cannot exclude the 
possible existence of self-reversed processes in Nature. 
The simplest example is the vacuum state. For such a 
state a and 6 must be the same, and we obtain 


Weerr (t) = R7RWV cite (i). (3.44) 
Tf RTR- is a c-nuraber, then (3.44) gives 
RIRS=1 or RT=R. (3.45) 


If RTR is, for instance, a multiple of A, i.e., if 
RTR=aA, then Eq. (3.44) shows the existence of an 
eigenstate of RTR for which the eigenvalue is +1. 
Since A has eigenvalues +1, a must be +1. In par- 
ticular, since A operated on the vacuum-state Wyac 
must give +1, due to the definition in Eq. (2.80): 


AWyac= WV vac (3 46) 


and since Wy, is a self-reversed state, we have to choose 
a=1, or 
RTRI=A. (3.47) 
It should be noted that the transformation in Eq. 
(3.18) leaves the condition in Eq. (3.45) unchanged. It 
also leaves the condition in Eq. (3.47) unchanged pro- 
vided V commutes with A. This commutability may be 
assumed as quite a general rule since A must be unob- 
servable before as well as after the transformation. 
The transformation in (3.38) can be considered as an 
especially simple case of Eq. (3.18). The transforma- 
tion of (3.39) transforms RTR- as follows: 
RTR7—RTR“CRW RW) (3.48) 
j.e., if R=WRW*, then RTR- remains unchanged. 
We shall see later that for the spinor fields we actually 
get Eq. (3.47) and not Eq. (3.45), at least insofar as we 
remain in the framework of the accepted theory of 
quartized spinor fields. No transformation W is dis- 


-covered that makes RW7R“W—1=A, so that the trans- 


tion in (3.48) would result in Eq. (3.45). For 
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the boson fields, we obtain Eq. (3.45) in a quite natura] 
fashion. 

A by-product of this consideration is that Eq. (3.44) 
is satisfied only if A=+1, i.e., the self-reversed states 
must involve even numbers of spinor particles. It 
should, however, be kept in mind that this: conclusion 
is derived in a purely field theoretical point, of view, i.e., 
the entire physical system is inclusively represented by 
Y. 

As for the operators M and C, if we perform a double- 
mirage and a double-charge-conjugation, in a similar 
way to Eqs. (3.41) and (3.42), we obtain 

WY=OMY(), VOY=e PCW. (3.49) 
This shows that M? and C? must be a c-number or a W, 
in the sense of Eq. (3.40). If the former is the case, we 
have to put M?=a, and C?=), where a and b are some 
constants of absolute value unity. But this ø and b can 
be transformed easily into unity by the transformation 
in (3.38), which does not change anything essential 
except that the eigenvalues are renamed. Therefore we 
can, in this case, write í 


(3.50) 
(3.51) 


re = we Ske 
MSM — Vi O— Ca — IC: 


In contrast to the case of RTR, the conditions of 
Eq. (3.50) on M? and C? determine the numerical 
factors of M and C. We shall see that we can find ina 
quite natural way M and C which satisfy Eq. (3.50) in 
both spinor and boson cases. However, it should be 
noted that the transformation in (3.39) leads to 


W(WMWM4)M?, 
2 (WCWC)C?, 


(3.52) 
(3.53) 


the right sides of which are not necessarily c-numbers 
even if the original M and C obey Eq. (3.50). As a 
matter of fact, we can easily find a W which commutes 


with M and C and whose square is not a c-number. - 


Under these conditions, the right-hand sides of (3.52) 
and (3.53) become W? which is again an operator of 
the type of Eq. (3.40). Actually, if we use the general 
expression of M satisfying Eq. (3.12), we are led to the 
expression of M? which is not a c-number, and this fact 
was used to discover a new W.!5 However such M can 
be brought back to an operator satisfying Eq. (3.50) 
by a transformation of the type of (3.39). Therefore, 
we can always assume Eq. (3.50) and study tne W’s 
of Eq. (3.40) as a separate problem. 

Coming back to Eq. (3.49), if we take as Y a self- 
miraged state or a self-charge-conjugate state, We 
obtain, under the assumption of Eq. (3.50), 


(3.54) 


16. Watanabe, Proc. Internatl. Nuc. Phys. Conf. Glasgow 
(1954) (to be published), S Z 


ia— {1 
2 


ee=+1. 


te 


ye 


— 


Se 


ta 
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In other words, an operator satisfying Eq. (3.50) will 
have gigenvalues +1 and —1, corresponding to which 
there will be two groups of eigenfunctions, dividing self- 
raraged or self-charge-conjugate states into two groups. 
This is the origin of the even and odd parity with regard 


‘to space-symmetry and charge-interchange. 


We will how briefly discuss the combinations of two 
operators, introduced in Eqs. (3.11)-(3.14). As a repre- 
sentative of the various combinations, let us first study 
the time-reversal considered by some authors,!° which 
is acttially the combination of the reversion in our sense 
and the charge-conjugation. According to this point of 
view, the reversed state Vp’ is defined by 


(Fr (0, O(x, —)F r (0) 
=prpe (T (i), O(x,)v(), 


instead of Eq. (3.7). Such a state can be considered 
either as tlfe reversed state in our sense of the charge- 
conjugate state or the charge-conjugate state of the 
reversed ssate in our sense. Accordingly, the operator 
R’ playing the role of R in this point of view, i.e., the 
operator such that Wp’(—1)=*()R’, will be given by 
elther 


(3.55) 


P RCR or R'=RCT. (3.56) 
Since these two operators must physically have the 


same effect, we have to write 


CAR=WRC?, (3.57) 
where W may be a c-number or an operator of the type 
in Eq. (3.40). If we adopt R'=C~R and C*=1, the 
double time-reversal in this modified point of view will 
be represented by 


RTRA=RIRIOW, (3.58) 
where W is the one used in Eq. (3.57). 

As a.matter of fact, exploiting the arbitrariness still 
disposable within the conditions of Eqs. (3.47) and 


. (3.50), we gan adjust C and R in such a way that W in 


, 


Eq. (3.57) becomes an operator with a wide range of 
arbitrariness, including W = c-number as well as W= A. 
As a result, R’, in contrast to R, can satisfy RTRA=A 
as well as R’7R’=1, among other possibilities.” 
Another important combination is one of reversion 


16 Called standpoint (I) in paper (R). See, for instance, J. 
Sines Bien 82, 914 (1951). Our standpoint in this 
present paper, which was labeled (IL) in paper (R), is in ac- 
cordance’with the standpoint adopted originally by, W igner and 
the author’s older papers. See. E. P. Wigner, Gottinger Nac a 
546 (1932); S. Watanabe, "Le deuxième Théorème de la Thermo- 
dynamique et la Mécanique Ondulatoire (Hermann et Cie, Pai 
1935); S. Watanabe, Sci. Pap. Inst. Phys. Chem. Researc 
(Tokyo) 31, 109 (1937). Dr. R. H. Good showed, in a private 
communjcation, that this standpoint applied to the Dirac : eory 
of electrons can be smoothly connected through Pauli’s on 
ponent theory to the time-reversal of the Schrödinger theory o: 


T r connected with Eqs. (6.5) and (6.19) in 


paper (R). 
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operator and mirage operator, which should result in the 
total inversion operator T: 


[=MR or I=RM?. 
Since R’ and J can thus be obtained from the basic 
three operators, R, C, and M, we shall not discuss R’ and 
J any further in this paper. If reversibility and charge- 
invariance hold, then the invariance for the modified 
time-reversal will hold. If reversibility and space- 
invariance hold, then inversibility will hold. 
Before closing this section, let us briefly consider the 
effect of a time-dependent unitary transformation’ R: 


UW=VIQOOVO, WV H=V7Ov). (3.60) 


We can re-establish Eqs. (3.11) and (3.19) in the primed 
system by taking 


(3.59) 


R' =V ORV (3.61) 
We can pass from the interaction picture to the 
Schrödinger picture by putting 


V= (i) = Uo (4,0). (3.62) 
The reversion operator in the Schrödinger picture then 
becomes 


R'= U(t 0)RU 1(0, —H=R, (3.63) 
on account of Eq. (3.31). This shows that we can use 
the same R in the Schrödinger picture. 

In a similar way, we can show by the use of Eq. 
(3.24) that the reversion operator in the Heisenberg 
picture is the same as in the interaction picture. By the 
same token, M, C, and J remain unchanged in all the 
three pictures. 


4. PASSAGE FROM C-NUMBER THEORY TO 
Q-NUMBER THEORY 


Before discussing the symmetry properties of physical 
quantities in the q-number theory, it may be well to 
survey the situation in the c-number theery and to see 
in what respects the c-number theory encounters dif- 
ficulties. As is well known, the c-number theory Is 
analogous to the Heisenberg picture in the sense that 
the time-development is attributed solely to the time- 
dependence of the physical quantities. 

The problem in the c-number th<ory lies in the fol- 
lowing situation. The field variables v (which may be 
tensorial or spinorial) are supposed to have certain 
transformation properties. Lhe physical quantities Q,. 
such as spin-density, electric current, etc., are expressed 
in terms of field variables u. By the transformation 
properties attributed to the ws, we can therefore deter- 
mine the sign function p’ (for reversion, mirage, etc.) of 
each Q. On the other hand, due to the physical con- 
sideration given in Part I, each Q has a definite p. The 
problem is then to see whether the p’ determined by the 
transformation properties of the field variables s 


-m we} 
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actually coincide with the p determined by physical 
requirements. We shall presently see that this is not 
always the case. 

In reference to Eq. (2.63) of Sec. 2.B, let us consider 
© as representing reversion’, mirage Wè or inversion 
Q. In these cases, we have: G@'=G, or €*=1. Of 
course, for spinors the effect of © has an ambiguity of 
sign; as a result, G*=-+1. But, the physical quantities 
being of even order in spinors, this ambiguity does not 
affect our argument and we can take G’=1 without 
loss of generality. For the field variable v, Eq. (2.63) 
{ becomes 


(4.1) 


This can be interpreted as follows: Let u(x) and w’(x) 
represent the original process and its transformed (i.e., 
reversed, etc.) process. Then w(x) and s(x) are con- 
nected by Eq. (4.1). 

Now each physical quantity Q(«) has an expression 
in terms of the u(x) and some operator which may also 
depend on v. Therefore we can write 


j Q(x) =OLu(x) ; £]. (4.2) 


For the transformed process, we have to take a’ at Gx 
and transform the explicit x in Eq. (4.2) into Gx. Thus 
we have to compare with Eq. (4.2) the following quan- 


w (x!) = Sulo’). 


tity: 
Olu (Sx); Sx], 
which, in virtue of Eq. (4.1), becomes 
Sul); Sx]. 


The operation © has been shifted from the argument 
x to the tensorial or spinorial function #. The p’ which 
was in question in the foregoing must then be given by 


OLSu(x) Sa] =p’OLa(x) ;x1], a 
OL Su( Sx), a ]= uS); S] 


The second equation is obtained from the first just 
renaming ‘Gs: as x. The first one shows that p’ can be 
determined simply considering the tensorial or spinorial 
transformation of u and the transformation of the coor- 
dinates explicitly involved in Q. 

We shall soon see that the p’ thus determined is not 
necessarily the same as the p given in the tables of 
Part I. The physical quantities can be classified into 
“mechanical” and “electromagnetic.” In the field 
‘theory, mechanical quantities are primarily derived 
= from «ne “free” Lagrangean density, by the well-known 
procedure. For instance, if the Lagrangean density is a 
regular scalar, the energy-momentum tensor T” and 

e angular momentum, tensor M” are regular tensors 
according to this procedure. We have shown in Part I 
the enezgy momentum tensor should belong to 
e, regular kind. As to the angular momentum tensor, 
T e that the complementary vector, in the sense 


= teal becomes a first kind pseudo- 


‘ 


coe 


a - a a 
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vector. We have explained in Part I, Sec. 3, that the 
angular momentum (including spin), if represented as 
a vector, should behave like a first kind pseudovector, 
This shows that if the Lagrangean density is a regplar 
scalar, all the rest of the mechanical quantities will 
become what they should be. In a similar way, if the 
current-density appearing in the interaction Lagrangean 
density is a second kind pseudovector, all the rest of 
electromagnetic quantities will behave as they should. 
For this reason, we shall limit our discussion to the free 
Lagrangean and the current-density. This wiil be the 


case also in our q-number discussion. If it is desired, ` 


one can check the results for individual physical quan- 
tities, but we shall ignore such a discussion in this 
paper. See, however, (R) and Sec. 9 of this paper. 

Let us first consider the free Lagrangean of tensorial 
fields v, i.e., pion-field and electromagnetic field. This 
term depends on v through a combination of the type 
dv, or v, and it contains an operator which is a 
regular scalar, such as (0/dx,)(0/0x”). Now, whatever 
the kind of the field v may be, the combination of the 
type dv or d,v" is a regular scalar on account of the 
product rule, Part I, Sec. 2. Therefore, the free La- 
grangean of tensorial fields is a regular scalar. The 
current-density of a tensorial field is typified. by 


s# (w) =iel_(00(x)/Ox,)0(a) — B(x) (O0(~)/Oxy)], (4.4) 


which refers to a scalar or pseudoscalar pion field. 
Since dv is a regular scalar, quantity (4.4) is a regular 
vector. This is, however, not the desired transformation 
of the current-density, which should be a second kind 
pseudovector. In other words px’ is all right, but pr 
is wrong. This is also true for the current due to a vector 
or pseudovector pion field. 

For a spinor field, the free Lagrangean has the form: 


c= 
&(x)=— A OOOD TE Ont iE 
Now, since YE, and YXEs belong to the second kind, 
Eq. (2.43), and 0/0x, to the regular kind, £(«) (4.5) 
is a second kind scalar. In other words, par’=pu but 
pr'=—pr, in the cnumber theory. On the other hand, 
the current density due to a charged spinor field: 


s# (a) = ei (a) BAY (x) (4.6) 


is already a second kind pseudovector, (2.43). 

As a conclusion, we can say the following in the 
c-number theory. As far as mirage is concerned, par 3S 
exactly what it should be, pir’=pw, for all the quan- 
tities. For reversion, the mechanical quantities of 
tensorial fields and the electromagnetic quantities of 
spinorial"fields have the right signs, pr’=pr, but the 
electrical quantities of tensorial fields and the mechan- 
ical quantities of spinorial fields have the wrorg signs, 
pr’ =—pr. 

Regarding the charge-conjugation, we can see from 


is 
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ts 


le 
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Eq. (4.4) that the interchange of v and 7 will result ina 

reversal of sign of the current. Thus, we can define the 

c-number transformation for charge conjugation by 
ei, Jey, (4.7) 

A 

It should bẹ noted already at this state that by (4.7) 

a reprêsentative term in the free Lagrangean will be 

transformed like 


» > (00/dx,) (30/ðx)— (d0/dx,) (35/31), 


showing that, if we adopt the Fermi statistics in 
q-number theory, the transformation of Eq. (4.7) will 
result in an undesired sign-change of energy. 

For the charged spinor field it is suggested by Eq. 
(4.7) that y may probably be transformed to a multiple 
of yX. Since we have seen that y and ¥*K have the 
same transformation rule for the congruent group, it 
would be a natural choice to take 


© yik, Peyk, (4.8) 


"as the c-number transformation rule for charge con- 
jugation. However (4.8) would transform Eq. (4.6) into 


© eip K OEK Y= eip bX (4.9) 


on account of Eq. (2.19). Equation (4.9) is, in the 
c-number theory, equal to Eq. (4.6). Thus Eq. (4.9) 
does not change the sign of the current-vector. How- 
ever, if we adopt the Fermi statistics in the q-number 
theory, Eq. (4.9) will become the negative of Eq. (4.6), 
“except for a c-number additional term.” We shall 
later see that we can drop this last restrictive clause by 
adopting the so-called Heisenberg prescription. 

Now, ineanticipation, we shall sketch what will be 
done in the q-number theory to remedy what was wrong 
in the c-number theory and to retain what was right 
<in the c-number theory. Take, for instance, Eq. (3.12), 
where-the matrix M is supposed to operate on the field 
variables. The left-hand side of Eq. (3.12) means, to 
use the notation of Eq. (4.2), 


O (Mx) = Ou (Ma) Mz], 
and the right-hand side means 
OL Mux) M], 
ve Eq. (3.12) is equivalent to 
i Of Mu(x)M- jx] =p OLu Mx) Mx]. (4.10) 


Since pm =p, the second equation of Eq. (4.3) be- 
comes : 

OLMtu Nex) a ]= p40 lu (Max), Mx], (4.11) 
having the same right side as Eq. 4. 10). Therefore all 
we need do in the q-number theory 1s to put 


Mu(x)M =Mu (Mz). (4.12) 
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Thus, for instance, since we know that the effect of 
Wet on a spinor is essentially multiplication by Era we 
get, from Eq. (4.12), Š 


Myx DM- = "Erp (xt). (4.13) 


If we want to be more general, we could also put 
Mp (x,) MA = Eel Vp (— x,/). 


However, this is nothing but the combination of Eq. 
(4.13) with a transformation of the type 
WyWo=Wy, WOW'*=0, (4.14) 
where W” is again an operator of the type in Eq. (3.40), 
which can be studied separately from space-symmetry. 
As for reversion, we have to note that the right side 
of Eq. (3.11) involves the transposition. As a result, the 
effect of R on u must contain the transposition (u—u") 
besides the transformation that could be inferred from 
a comparison with the c-number theory. First, regarding 
the tensorial field, pr’ is wrong only for the electromag- 
netic quantities. We can remedy this by assuming that 
the effect of R is the combination of the cnumber 
transformation for Jt and the c-nuraber transformation 
for © as in Eq. (4.7): 
Rul) R=RCuT" (Rx). (4.15) 
As © does not change the sign of mechanical quantities, 
the correct sign pr’ of the free Lagrangean will be 
retained by Eq. (4.15). Moreover, since © in (4.7) 
interchanges once v and @ and, since Eq. (3.11) also 
interchanges v and #@, as can be seen from Eq. (3.15), 
the transformation in Eq. (4.15) will serve the purpose 
without any specific assumption as to the statistics. 
Suppose we apply the same procedure as that of Eq. 
(4.15) to the spinor field, viz., let us assume that the 
effect of R is essentially the combination of the e-number 
transformation of y for reversion and the charge-con- 
jugation of (4.8). Then, on account of the c-number 
reversion, £(x) changes the sign, but the effect of 
(4.8) will be such that (4/dz,) is interchanged to 


_ (d/dx,). Thus, it will change the sign of the first two 


terms of Eq. (4.5). Since K7E;K=E;", while KOE,K 
=—E,7, the effect of (4.8) will be such that YE 
changes the sign, in contrast to Eq. (4.9). Thus, such 
a combination will correct the sign pr’ of Eq. (4.5). 
On the other hand, (4.8) does not change the sign of 
Eq. (4.6); therefore, the right sign of the current will 
be retained. In this argument, it is understood thac, 
since Eq. (3.11) involves the interchange of the factors 
y and y* and (4.8) also interchanges these factors, we 
need not invoke a specific assumption regarding the 
statistics.!® This shows that the desired effect of R 


—— -2 


en 
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would be 


RIY(ÒR =e" Eo KY*T(— i), (4.16) 
where Ær stands for the c-nuniber transformation for St. 
As regards charge-conjugation, we have already seen 
that (4.7) and (4.8) give the correct signs provided we 
adopt Bose statistics for tensorial fields and Fermi 
statistics for spinorial fields. Thus, we should take 


CuCC = Qu, (4.17) 
where © is (4.7) or (4.8). 

The pion nucleon interaction plays a unique role in 
determining the kind of pion field. Tensorial fields in 
general appear in a quadratic or bilinear form in the 
free Lagrangean. Their sign in no way affects the 
mechanical behavior of their fields. However, as has 
been seen in Part I, there seems to be a sound physical 
reason to believe that the electromagnetic field is 
“observable,” in the sense that its sign-change actually 
changes the physical situation. Thus we could meaning- 
fully assign the second kind to the electromagnetic 
quantities. This determination, in essence, has been 
done through their interaction with the charged field, 
by assuming that, in a reversed state, a particle of the 
same charge is performing a kinematically reversed 
motion. 

In contrast to this, there is room for doubt as to 

whether there is any physical meaning in assigning a 
kind to the pion field. If this assignment is to be done, 
it must be possible only through the pion-nucleon inter- 
action, which is the only term where the mesic field 
strength is standing in the first power. Indeed, we could 
determine the interaction type by experiments, and 
require the regular invariance of the interaction La- 
grangean. The sign-change of the source involved in 
this interaction is not necessarily uniquely determined. 
For the neutral pion, the source is represented by a 
quantum jump from a proton state to another proton 
state, or from a neutron state to another neutron state. 
Therefore, the source has a definite sign-change for W? 
and Jè for the neutral meson. For a charged pion field, 
the source represents a transition from a proton state 
to a neutron state, or vice versa. Therefore the sign- 
change of the Source depends on the relative phase- 
change of proton spinor and neutron spinor for M 
and N. So far there is no reason to believe that there 
should he any physical mean‘ng in such a relative phase- 
change of proton and neutron. Therefore, in conclusion, 
we could say something regarding the neutral pion field, 
but we cannot say anything definite regarding the 
changed pion field. See Sec. 10 for more details. 


cally assigned transformations of the field variables, and the 
»q-number transformation” means the one consistently corrected 
according to the consideration of this section. Thanks are due 
Dr, R. H. Géod who has pointed owt this Jather misleading use of 
terminology. A e l 

E? 
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In passing, it should be noted that we have to include 
in Eqs. (4.15) and (4.17) the arbitrary phase factor 
exp(ia) of (4.7) and (4.8). Even if we obtain Eqs, 
(4.15) and (4.17) with a fixed phase factor in a certain 


gauge reference, an arbitrary phase factor exp(ia) will 


reappear by an arbitrary gauge transformation, for 
these transformations connect a field variable to its 
Hermitian conjugate. The situation is different for M, 
since a gauge transformation will leave Eq. (4.12) or 
(4.13) invariant. This situation is connected with the 
fact that the condition, M@?=c-number, actually deter- 
mines the phase factor involved in Eq. (4.12), e.g., 
exp(ia) in Eq. (4.13), while the conditions on C? and 
RTR have no bearings on the phase factors in (4.7) 
and (4.8). This situation will become more clear when 
we discuss the concrete cases. 


5. PION FIELDS 


We can describe the pion field alternatively by three 
Hermitian field components 2° (p= 1, 2, 3). or by one 
complex component v and a Hermitian component 1), 
with É 

u® = (v+d)/V2, u®= (v— 7) /VZi. (5.1) 
a 
However, the gauge transformation (with constant 
phase), 


Aa JE *°5, (5.2) 


is equivalent to 


u— cos yu — sing), 
(5.3) 


usinou + cosou® 


resulting in mixing of u® and «©. There is no obser- 
vational ground for separating the charged pion-fields 
into u®-field and u®-field. The effect of a transforma- 
tion ©, expressed in terms of independents u® and 
u® will then involve the danger of losing from sight 
the physically pertinent general features. 

We shall first discuss a scalar « (of unspecified kind) 
and later briefly survey the case of a vector u (of un- 
specified kind). The free Lagrangean density and the 
Lagrangean density of electromagnetic interaction are, 
in the case of a scalar u, 


Lu=— (00/0x,) (3v/ 3x) — Kv 
—1(du®/dx,)(du®/du)—4eu@u®, (5.4) 


Lau=A,s*—A,AXs, _ 65 
where s# is given by Eq. (4.4) and s by 

S= en. i (5.6) 
The pion-nucleon interaction will be discussed: sepa- 
rately in Sec. 10. ` 


The defining equations of R, M, and C are Eqs. 
(3.11), (3.12), and (3.14) applied to &., s” and s. Since 
£u and s must belong to the regular kind and s” to the 


be 


l 
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second kind, we should have 


pr=pm=pc=1 for Lu, 

—pr=—pu=—pc=1 for s% (a=1,2,3) | 
pr=pm=—pc=1 for s°, [ on) 
pr=pm*pc=1 for s. 


From (4.7) and Eq. (4217), we get 


CoC eiag, CIC tHe, Cu@CI=+u8, (5.8) 
It is obvious that Eq. (5.8) satisfies the requirements 
in Eq. (5.7) for £u only if v obeys the Bose statistics. 
As for s*, if itis written in the form given in Eq. (4.4), 
the assumption regarding statistics is necessary in 
order to satisfy Eq. (5.7). However, the c-nurnber 
theory can just as well lead to the expression of s#: 


“mid o (dv/dx,) 0]. (5.9) 


If we use this expression for s#, the transformation in 
Eq. (5.8) satisfies the requirement in Eq. (5.7) for s“ 
without an assumption regarding statistics. It is to be 
noted thas Eq. (5.8) is compatible with the supposed 
properties of C: C?=1, CC=1 [see Eq. (3.51) ]. The 
transformation in Eq. (5.8) also leaves the usual com- 
mutation rules unchanged. 

Since the interchange of a factor in v and a factor in 0 
results, because of the D-function in the commutation 
relation, in an additional c-number term, it is desirable 
to replace an expression of Q by 


Q—310+pcCQOC+], 


or at least tS require that the expression of Q must be 
such that it remains invariant by the transformation 
in (5.10). Then wé no longer need be concerned about 
the additional c-number term. If the quantities are 
written’according to (5.10), charge-invariance is auto- 
matically satisfied. If we apply (5.10), to £u, then the 
expression ‘will become such that it will vanish for the 
Fermi statistics. 

A similar prescription to (5.10) was first introduced 
by Heisenberg in relation to positron theory to avoid 
the zero-point charge, and (5.10) has the same effect 
for the v-field.!? For instance, the expression of Eq. (4.4) 
for the current does not obey the Heisenberg prescrip- 
tion, but Eq. (5.9) does. It is well known that Eq. 
(5.9) leads ‘to the vanishing zero-point charge. It is 
easily understandable that if the theory is exactly 
charge-invariant, then’ no zero-point charge should 
appear. For, if there would be any zero-point charge, it 
should as well’be positive as negative. 

Next, following Eq. (4.15), we can write down the 


(5.10) 


19 W. Heisenberg, Z. Physik 90, 209 and 92, 692 (1934). See 
also W. Soa Modern Phys. 13, 203 (1941), in particular, 
pp. 208, 224. 
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relations that R should satisfy: 
(R-o(i)R)? =e 0(—0, 
(REOR) =e (— t), z (5.11) 
(Rn (OR) =u ®(— i). J 


This automatically satisfies Eq. (5.7) for £u. If s” is 
expressed as Eq. (4.4), then Eq. (5.11) automatically 
satisfies Eq. (5.7) for s”. If it is written as Eq. (5.9), we 
need an assumption regarding statistics to satisfy Eq. 
(5.7) for s#. Since the c-number theory provides no 
reason for preference between Eqs. (4.4) and (5.9), we 
cannot claim that reversibility has the power to deter- 
mine the statistics. 

It should be noted that Eq. (5.11) is compatible with 
the anticipated properties of R: RT=R [Eq. (3.45) ], 
f&=R". The transformation in Eq. (5.11) also leaves 
the commutation rule of v and « unchanged. 

The combination of C and R, considered in connection 
with Eq. (3.56), has two expressions: 


C(R OHR) C =e, | 


: (5.12) 
(R€CI(OC-R) =e nli | 
The difference between these two, expressed by the 
unitary transformation W in Eq. (3.57), is essentially 
a gauge transformation of v, which will be discussed 
separately. 

Analogously to (5.10), in order to have the “exact” 
reversibility, it is desirable to require 

OAO Hor(R O AR)T]. (5.13) 

For instance, the expression formally desirable for the 
current would then be 


s*= diel (d0/dx,)v— T (dv/dx,) 


+0(d0/dx,)—(do/dx,)5]. (5.14) 
Obviously this does not bring about anything new. 

The above argument shows that the requirement of 
charge invariance determines the statistics-type of the 
field® but the requirement of reversibility does not. It 
is of some interest to note that we can restate this situ- 
ation saying that the compatibility of (5.10) (charge- 
invariance) and (5.13) (reversibility) determines the 
statistics. For instance, the compatibility of Eqs. (4.4) 
and (5.9) requires the Bose statistics. We can generalize 
this statement for other quantities and fields. However, 
the underlying fact is that if we suitably express the 
physical quantities in the forms allowed by the c-num- 
ber theory, reversibility alone can always be satisfed 
without a specific assumption regarding statistics. 

If we translate the transformations in Eqs. (5.3) and 


2 W. Pauli and F. J. Felinfante, Physica 7, 177 (1940). 
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iC 11) into the language of the Hermitian uw, we obtain It is easy to show that a K and M can be expressed 
eee h in the operator forms as follows”: 
CuMC = cosau -sinau ©, p 


OE nu cosa o oe =(—1)*(+1)°, 7 
: ira 5. i a | 
(Ru (OR)? = cosBu (—1)-+sinBu (—d), b=3 TE (VE ay J+ (k) g-(k) | 
(Ru (OR)? =sinBu™ (—1)—cospu(—2), —e~‘ag_(k)g,.(k) ], -(5.24) | 
= showing that it is meaningless to speak of a trans- ©=} NV;3(k), te 
© formation rule for # alone or 7) alone. We can write i: ! 
= &y in two terms referring separately to wu and u®, dy 
= but the equivalence of such an expression to Eq. (5.4) ; R=e*(—1)¥(—1)® i : 
= requires the Bose assumption. with je 
Finally the mirage operator is given, in accordance, _ ea l 
with Eq. (4.12), by a N), l 
i Molx)M = ei(—x), v=; iz 2 [Ni (k)+N4(—k) 
š Mo(x)M7=e"*70(—x), (5.16) — G.(k)gs(—k)—gs.(—k)g..(k)], ¢ (5.25) 
Mu® (x) M= +1 (—x). O=4 ZW: (k) +N3(—k) 
k t 
ese relations are compatible with the relation M~ 2 mel Veet ae OK AJ. 
M and the commutation rules. However, in order to (=)gs(k)gs(—k)— (+)g:(—k)gs(k)]; 
tisty M= c-number, we have to adopt y such that MECE 1) | 
e't¥=+1=¢-77, (5.17) with ‘ | 
* = 
This situation makes a contrast to the case of charge- ®=% D [N+ (k)+N+(—k)— (+) GW g4(—1) 
Co ugation in which C?=c-number does not deiere 3 z s 
‘the phase factor in Eq. (5.8). The double-sign in Eq. = (E)9+(—k)g+ (kK) + N_(k)+N_(—k) 
(5.17) and the one in the third relation of Eq. G. 16) are T E EN a i 
E eaae He- (+) (6.26) 
Expressing the u-feld by its Fourier expression, O=5 INE) I Ns(—k)— (+)/93(k)g3(—k) | 
> Í = 2y. —} = y 
) X( w) Lg(k) exp(+ —) —(+)’93(—K)ga(k)]. | 
| 


+9-(k) exp(—+)], (5.18) The arbitrary signs of the entire C and M are so chosen 
)=> (2Ve)-Lg3(k) exp(-++—) that CWyac= MY vac= Y vac: % 

k The summation symbol with a prime, 57’, over. k 

+93(k) exp(—++)], (5.19) means a summation such that k and —k would cover 

i vd ; j the entire momentum space. For Eq. (5.25), it is 

a (+—)=exp(+ik-x—iel), (5.20) assumed that g and g os ee in a usual repre- 

© exp(—++)=exp(—ik-x+iw/), ; sentation in which they have real matrix elements. In 

“a Eq. (5.26) (+) and (+) refer to the same double-signs 


E o= |+], in Eq. (5.23). i 
Eqs. (5.8), (5.11), and (5.16) The vector pion field (of unspecified kind) can be 
Eek described by a l tor v#(x,f) and a Hermitian 
Ce (k)C eisg. y a complex vector v 
e) ORAS) vector u#(x,/) with u=1, 2, 3,0. The transformation 
ac iag, (k), (5.21) Tules corresponding to Eqs. 6. 8), (5.11), and (5. 16) 


will become 
CC eiti, Cy OCHE, (5-27) 

Rin ()R)7= CRA 0), 
(R14, (AR) T= -u«(—2), 
Mo, (x) M= (+)o#(—x), ig 
Mu, (x) M= (4)/u#(— x). í 


method W here i is foken irom L. Wolfenst 


(5.22) (5.28) 


ee 


to 


mE 
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It should be noticed that in Eqs. (5.28) and (5.29) we 
have cdvariant components on the left sides and contra- 
variant components on the right sides. This will take 
car@&of the vectorial transformations of the field vari- 
ables for Ñ and M. 

The plane wave expansion of the vectorial field needs 
a little caution. The waves will be decomposed into 
transvefsal waves (r=1,2) and longitudinal waves 
(r=3). The transformation rules for Jt and W of the 
annihilation operators are different for the two groups 
of waves. On one hand this will take care of the fact 
that the longitudinal polarization is defined by the 
propagation vector k which changes its sign for Jt and 
IN, while, on the other hand, it will take care of the fact 
that the component u=0 has different transformation 
rules in Eqs. (5.28) and (5.29). The relations corre- 
sponding to Eqs. (5.21), (5.22), and (5.23) will be as 
follows: 


Cos (Cieee, 7=1,2,3, | 
= (5.30) 
Gea(k)t=g;(k), r=1,2,3, J 
(Rtg. (K)R) =e), r=1,2, 
æ =—eri8g,(—k), r=3. 
ee eS (5.31) 
(R1g3,(k).R)? = g3-(—k), =1, 2, | 
=F ¢3,(—k), r=3. 
Mg (k)M >= e+Yg,,(—E), FH, Be 
y=0 Or 7, 
=—etig, (—k), r=3, (5.32) 
Mgs-(k)M= (+)'g3,(—k), 7=1,2, 


=—(+)’gs(—k), r=3. 
It is assumed in these expressions that the positive 


directions of the transversal polarization, 7=1, 2, are 
defined to be the same for both k and —k. 


,6. ELECTROMAGNETIC FIELD 
A. Linear Momentum Representation 


In the case of electromagnetic field, the sign-functions 
pc, pr, and par of the field variables are well-defined. 
‘Therefore, all we need is to show that the C, R, and M 
defined by these sign-functions do not change the sign 
of the free Lagrangean and that they leave the Lorentz 
condition unchanged. The invariance of the interaction 
Lagrangean will be guaranteed, if the other fields are 
so adjusted that the electric current generated by these 
fields becomes a second kind pseudovector. 

By the direct application of Eqs. (3.11), (3.12), and 


(3.14) to the electromagnetic potential, we obtain 
CA,C1=—Ay,, 
(R44, (DR) T=—4"(— À), 
MA, (x) M= — A#(—x). 


(6.1) 
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The transformation of a covariant component into the 
contravariant component takes care of the required 
transformation of a second kind pseudovector. We see 
that Eq. (6.1) is compatible with Eqs. (3.45), (3.50), 
and (3.57) with W =1. The transformations in Eq. (6.1) 
leave the commutation relations of A unchanged. 

The transformations in Eq. (6.1) certainly make the 
free Lagrangean a regular scalar, and transform the 
Lorentz condition, 


(0A#/dx*)¥(L) =0, (6.2) 
into the corresponding equation for the charge-con- 
jugate, reversed, and miraged states. 

In this subsection, we shall use the linear momentum 
representation for simplicity’s sake. In the next sub- 
section, we shall use the angular momentum represen- 
tation, because this representation is “compatible” 
with the parity operator. In the last subsection, we shall 
use a mixed representation mainly for the purpose of 
applications. 

If we expand A#(x) by the plane waves: 


Ar(x, D =E (2Vw) Lek) exp(+—) 
+k) exp(—4)], (6.3) 
|k[ =a, exp(+—)=exp(+ik-x—iwt), 
the defining equations in Eq. (6.1) become 
Cgt(k)C1= — g*(k), } 
(R-"g#(k)R)? = —g,(—k), (6.4) 
Mg#(k)M—=—g,(—k). J 


Equation (6.4) shows that in a representation in which 
g and g have real matrix elements, R and M become 
identical. The explicit expressions of C, R, and M are 
then (the sign being so determined that CY ra= MYP uas 


A A) 
C= TEE i (6.5) 
R=M=(—1)*, 
3 
p=} E’ E LV) +.¥e(-k) +97 e(—h) 
J (6.6) 


+e DeDIHEWO +I) | 
— Pk) e(—k)-9 (—k)g (k) J. 
B. Angular Momentum Representation 
We only consider a pure A field 
A%=0, divA=0. (6.7) 
A physical quantity Q derivable from A can be written 
in terms of a Hermitian operator O operating on A: 
Q=—i[(aA/a1)-OAJ, ` } 
where the brackets mean. the scalar product often 


— = — 


CH. 
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spatial vectors (@A/d/) and OA, and O is in general a 
tensor in the 3-dimensional space. We have to consider 
energy W, orbital angular momentum L, spin angular 
momentum S, and total angular momentum J, which 
are respectively given by 


0 $ 
cen L= —i[[y(8/əz)—s(3/ðy)]l, etc.; 
t 


00 0 100 (6.9) 
S:=|0 0 —i/,etc.; 1=/0 1 0/;J=L-+S. 
lo i 0 omom 


Except for 


[S2)Sy]-=iS,, [L.,Ly)-=iL., [J2Jyl_=iJ,, etc., 
these quantities commute among themselves, with Eq. 


(6.7) and with 


(6.10) 


Calling the eigenvalues of W, J?, L?, J., respectively, 
@, j(j+1), U(/-+-1), and M, we denote the corresponding 
eigenfunctions by 


ule, jl M ](x,9,2,2), (6.11) 


where the brackets indicate their eigenvalues and the 
parentheses their dependence on space-time. To give an 
explicit form to u, we introduce the three eigenvectors 
elu], (u=—1, 0, +1) of S=: 


e[1]= — (1/v2) (eztie,), e[0]=e., 


(6.12) 
e[— 1]= (1/72) (ez-— iey), 


which satisfy 
(e*u ]- e[u) =5(u,n’). 


Next, we introduce the normalized eigenfunctions 
Yı” (0,9) of L. and L? which satisfy 


(6.13) 


fr "Vy sinddédp=6(m,m')s(1,l’). (6.14) 


Now we can write u-with the help of the Clebsch-Gordon 
coefficients”: 


uloj tA] (2,9,2,1) TE blw] 0) x, Cu (jM; m,y) 
mu 


XF” 0, p)eLu], (6-15) 


where b should ‘be determined by Eq. (6.10) and can 
be expressed in terms of Bessel functions of half-odd- 


—_ 


T= oe $ 3 

2 See, for instance, J. M. Blatt and V. F. Weisskopf, Theoretical 
Nuclear Physics (John Wiley andsSons, Inc., New York, 1952), 
pp. 789 ff. y 2 3 
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integer order, Br; (wr): 


Phor) = (1/7) Bu lore! (6.16) 
A 
which can be normalized by 
R a 
lim(1/R) f O/A Bra lor) (1/7) Bry (ordre 
R=>2 0 K 
=d(ww’). (6.17) 


The left-hand side of Eq. (6.17) can be expressed: again 
in terms of Bessel functions,” and assuming asymptotic ` 
expressions of the Bessel functions for large R, we can 
satisfy Eq. (6.17). In particular, since we are interested 
in the solutions in the empty space, we have to take a 
superposition of incoming and outgoing waves, which is 
free-from singularity at the origin: 


Biz3(wr) = (mw) >J lwr). (6.18) 


Using Eq. (6.15), we can expand A as 


A(x,)=(2R) Z 2 2 at 


Xe, j, l ule, j, l, M ](x,f)-+Hermitian conj.] (6.19) 
From the completeness of the ws, we have the inverse 
orthogonality : 


lim > X D Z (1/R)u [e,i M0) 
t M: 


Row w 7 


Xue, j, M] (s/t) =8(x—x)8(a,8). (6.20) 


With the help of Eq. (6.20), we can easily show that 
the commutation rules: 


[sloj lM), gloj MNL Es 


=ò (ww) (j, NEU NM, M’), etc. (6.21) 


lead to the ordinary commutation rules of A. _ 
Next, in order to determine the effect of R and M 
on the g’s, we first have to determine the sign-change 
of the ws for Ñ and M applied to their argument 
(x,t). Using the symbol introduced in Eq. (2.73), we 
get from Eq. (6.15) 
OMe, AVA Sars 
zuu w, 7,1,M | ( 1) ule, j, M], (6.22) 
ulw, j, p M]= (= 1) iMn w, j l M]. ‘ 


The first relation represents the well-known trans- 
formation rule of Y,” while the second relation can be 
derived by noticing: 9 


Cu(j, —M; =m, =p) me (= 1) HAC), (7,M,m,n), 
Yr”r=(—1)PY m, ef —n]=(—1)"e*[x], -(6.23) 
bo] (r, ae t) = lo] (7,1) 2 


2G. N. Watson, Theory of Bessel Functions (Cambridge Uni- 
versity Press, Cambridge, 1922), Eqs. (8) and (11), pp. 134-135, 
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Then, substituting Eq. (6.19) in Eq. (6.1), we obtain 


Cglo, j, M)C == —g(w,j,1,M), 
Nelo, j, M)R)T= (1) glo, j,1, =M), + (6.24) 
` Mglo, hL M)M = (~1)1g(w, j,1,M). j 


The effect .of C in this representation is essentially 
the same as in Eq. (6.4). The important feature of this 
representation is that, except for the factor (—1)"#1, the 
effect of 'M is to transform a g into itself with the same 
(w,7,1,M). This is of course a consequence of the fact 
that M commutes with the quantities defining the 
eigenstates in this representation. 

We can satisfy the third relation of Eq. (6.24) by 


M=+][(— 1)¥ eiA, 
l 


even 


(6.25) 


where the double-sign is not determined even by the 


condition M?=1. However, if we want to have 
e 


MV ync=+Vrvacy (6.26) 


we have to take the upper sign. Any state in this 
representation may be expressed by 

Y=) aiio Yvas (6.27) 
with the g’s used in Eq. (6.19). If the number of g’s 
with J=even in each term of Eq. (6.27) is even (odd), 
the state of Eq. (6.27) is an eigenstate of M corre- 
sponding to the eigenvalue +1(—1): 

MY=r¥, 


moti. (6.28) 


The value of r, is the “space-parity” of the state Y. 
The eigenstates in Eq. (6.28) of M are “self-miraged” 
states in the sense of Eq. (3.54). 

o In particular, if a single quantum is excited, 


P= Frac (6.29) 
(—1) gives the parity of the state, where / is the 
orbital angular momentum of g. Comparing this result 
with Eq. (6.22), we see that the parity of the state in 
Eq. (6.29) is the opposite to the parity of the corre- 
sponding u in the cnumber theory. This is, however, 
in agreement with the usual definition of parity of elec- 


‘tromagnetic field, since the parity of u is the same as 


the parity of the electric field and opposite to the 
parity of the magnetic field, and the parity is defined 


usually by the magnetic field h”: 
h(x)=+h(—x). (6.30) 


The only caution we have to take is that the c-number 
theoretical definition of parity in Eq. (6.30) should be 
reinterpreted in the q-number theory through Eq. (6.22) 


2# J. M. Blatt and V. F. Weisskopf, reference 23, p. 585. 
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and not through 
(W,h(x)) = +: (W,h(—x)v). 


For, Y being a self-miraged state, we have always the 
positive sign in this equation (see Part I, Tables ITI and 
V). If Y is an eigenfunction of photon numbers, then the 
phase of the field is completely undetermined, and its 
expectation values vanishes any way. 

If we introduce a distinction between the “electric” 
radiation and “magnetic” radiation by 


“electric”: j=/+1, P 
us (6.31) 
“magnetic”: j=}, 


we have the rule that the parity of an electric radiation 
is determined by (—1)?, and that of a magnetic radia- 
tion by (—1) #1. 

The charge-parity can be defined by 


C=I(—1)¥, CWvaa=Yracs k] 


CY =r, Te=+1. ere 
The reversion operator can be written 
R=(—1)®, ə 
O= 2 Z L L N oj LM) TN o, j l, =M) | 
oye eA f (6.33) 


FA (= 1) M (w, j, p M)g lo, P L —M) 
ra (= 1) EMG (cy, J; L, —M)glw, j: M)].) 
C. Mixed Representation 


In this representation, linear momentum p and cir- 
cular polarization, i.e., spin angular momentum S, in 
the direction of propagation p are used. For instance, 
for the waves propagating in the z-direction, the circular 
polarization can be expressed by S: Obviously, 5S; 
does not commute with pz and p,, but if p.=0 and 
p,=0, we can still use S, to characterize eigenfunctions. 
Thus, we can write for such waves 


Aes) = OVJ E Eelko] (t) 
-Hermitian conj.] (6.34) 
u[k. u (x) =exp(ik:-Z—ivt)e[u], 
a a (6.35) 


where e[0], [see Eq. (6.12)], should be absent on 
account of transversality as seen in Eq. (6.7). From 


this follows 
Cg (k:,u)C*= —gk.s), m3 a 
(Rg (k.u) R)” = gE kas afr (6.36) 
Mg(k.uw)Mt=—g(—ky we), J T 


For the purpose of a later application, let us~also 
consider the simple reflection Wt. of the z-axis: 
(x,4,2,1)—>(—2, y, z, i% The operator M corresponding 

O 


with 


` > = 
nari Collection, Haridwar 
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to Mt, should then be defined by 


M,A,(x,y,3,)M=—A,(—4, Y, 5, l), 
(6.37) 
MAy, 2(%,9,2,/) Ma = ‘FA v z5, Y: 2, 1). 


Since e[1] and e[—1] differ only in the sign of ez, [see 
Eq. (6.12) ], we obtain 


Maglkau)M a= g(k:, — u). 


One of the advantages of this mixed representation 
is that we can easily discuss the connection between the 
spin und the linear polarization of a photon. The 
operators, in the sense of O in Eq. (6.8), representing 
the linear polarization are 


10 0 0 0 0) 0 0 0 
m= 10 0 Ol, m\0 1 0}, m:=|0 0 OJ, 


0 0 0 0 0 0 © @ i 


(6.38) 


(6.39) 


and the corresponding physical quantities Q are 
ees ii [(0A/di)-reA AV, (a=x,y,2). (6.40) 


We can give more uxplicit expressions to II. by the use 
of Eqs. (6.34) and (6.12). In particular, for a given 
value of k., we get 


TI. (#2) =3LN (kz, HIEN (kz, —1)— 0 (k, — 1) 
Xg (kz, +1)— g (k, +1)g(k., =i, 

(6.41) 
p(k) =3LN (key +1) +N (Bz, —1)+9(:, —1) 
Xg(k:, +1)+9(k:, +1)g(k., —1)]. 


These quantities will be profitably used in our discussion 
of the positronium decay in Sec. 13. 


7. CHARGED SPINOR FIELD—LINEAR MOMENTUM 
REPRESENTATION 


In the case of a charged spinor field, we have to con- 
sider the free. Lagrangean given in Eq. (4.5) and the 
current-density given in Eq. (4.6). When these quan- 
tities are substituted in Eqs. (3.11), (3.12), and (3.14), 
we should have 


po=pr=pu=+1 for £, 
—pc=—pr=—pu=+l for s*, (a=1, 2, 3), (7.1) 
—pc=pr=pmu= +1 for s’. 
These requirements can be satisfied by 
CyCt=—eiKyX, CYC = eig, (7.2) 
a (RY (DR) =e EoKY*(—4, (7.3) 
(RYX()R)T =e *Y(—) KE, 
` MYM =e Evy (—x), 


MYX M+ =—e-y*(—x) Ean. 


(74) 


It can easily be seen, with the help of Eqs. (2.18) and 
(2.20), that these transformations are compatible with 
the unitarity of C, R, and M. The usual commutation 
rules of y are left unchanged by these transformatic.ss. 
The combination of the two relations of Eq. (7.2) 
gives (C)?¥(C7)*=y, agreeing with C?=1 ‘rrespec- 
tively of a. The reiteration of the transfermations in 
Eq. (7.3) gives 


RTR-W()RR™41=— VY), 


implying 
RTRI=A. (7.5) 


The double mirage, according to Eq. (7.4), yields 
May (My =— ey, (7.6) 
which means, among other possibilities," that 


M2=1 for e=, 


(7.7) 


M2=A for e7=+1. 


According to Eq. (3.50), we shall adopt the former 
alternative. s 

We can easily see that the double-time-reyersal in 
the sense of Eq. (3.56) can be a cnumber or A, or a 
more general W. If we apply Eq. (7.2) first and then 
Eq. (7.3) we get 


(RCW (1)C7R)T = — eH Ew (— i). (7.8) 

Tf we invert the order of these two, we get 
(CTR Y (NRC) T = — ee) Egy (— i). (7.9) 
By using Eq. (3.57), we obtain from Eqs. (7.8) and (7.9) 


WYW = 2ile- Py, (7.10) 
which means thatifa=6 ora=p—7 then W=c-number, 
and if a=$-+7/2 then W=A. In the latter case, the 
double-time-reversal in the sense of Eq. (3.56) will 
become a c-number because of Eq. (3.58)..The more 
general case of IV where a and £ are arbitrary will be 
included in our discussion of the superselection rules. 

The relation in Eq. (7.2) satisfies the requirements of 
Eqs. (7.1) for Eqs. (4.5) and (4.6) only on the assump- 
tion that y and yX anticommute. As a result of the appli- 
cation of the commutation relations, there appears a 
c-number additional term. In order to avoid such an 
additional term, it is desirable to write £ and s# in the 
form ? 


£->}(£+pcC2C-) 3 
T LE, (a/ 0x,)—E, (a/ xa) H 2imEs Y 
ae 


—10[E,7 (8/dx,)—E,7(0/dx,)—2imEs? WX, > (7.11) 


Fo sebpcC#C9) = (XEN VE), 
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Naturally, Eq. (7.11) is written so that the requirements 
in Eq. (7.1) are automatically satisfied. If we assume 
the Bose statistics, Eq. (7.11) becomes essentially a 
c-aumber. Š 
We shall now proceed to determine the effect of C, 


‘R, andMVf in the linear momentum representation. This 


actually cérgesponds to what we have called mixed 
representation in the case of photons. In this represen- 
tation, we use the following mutually commuting 
Hermitian operators: ° 


pa=—1(0/dx.), 
H= EoEapat EoEsm, c= EE sEabal 2p, 


(a=1, 2, 3), d=1(0/dt), 7 a 
(7.12) 


with 


p=|(X pal, (7.13) 


where o means the spin in the direction of propagation 
pe. Denoting the eigenvalues of pa, Ho=9, o respec- 
tively by k, w, m, we use the expression: 
. 
Yik onu](x t), o= (k-Hm), (7.14) 


for the corresponding eigenfunctions. The absolute 
value of &is determined by k, but its sign is an inde- 
pendent quantum number. Thus we can expand any 
solution of the wave equation as 


¥(x,)=(V)? 2X pA) (7.15) 


` 


where À stands collectively for (k,w,u). 

Next we investigate the effect of the transformations 
appearing on the right sides of the relations in Eqs. 
(7.2), (7.3), and (7.4) on the eigenfunctions in Eq. 
(7.14). In other words, we shall study the behaviors of 
the functions: 


s KDG), BKPDIG -D aio 


o Ev) |(—x,)), 


towards the operators in Eq. (7.12), as compared with 
the behavior of the original function of Eq. (7.14). 

The transformations from Eq. (7.14) to Eq. (7.16) 
can be decomposed into: multiplication by K, Hermitian 
conjugation, multiplication by J (to obtain y*), and 
multiplication by 


T=1EM: and P = Eo? ruz (Eo = —iE12), (7.17) 


where the nieaning of 2 symbol is given by Eq. (2.73). 
The factor 7 is inserted to make T and P Hermitian, but 
does not have much importance. 

The defining equations for Eq. (7.14) have the form 


Ov=O'v, (7.18) 


where © stands for ‘any one of the operators in Eq. 
(712) and O’ stands for the eigenvalue. From Eq. 
(7.18) it follows, by the application of the above-men- 
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Tapir I. The sign functions p's defined in Eq. (7.20) and used in 
Eq. (7.21) for various operators introduced in Eq. (7.12). 


PI PR Pr PP PIO PsP EPS 
pa H - + - - -- 
ò F = fer me +f 
Ho — — L = $ 
o + + ns A 


tioned transformations, that = 
(K(JHOS)7 K-K Y” (x,t) =O'RY“(x,0), | 
(TKUOJ)TKOT)EK Yx, —l) | 


(7.19 
=0'EoKY* (x, —2), | ) 


(POP) Evo (—x, )=O' Ern (—x, Ù. j 


The functions mentioned in Eq. (7.16) have the eigen- 
values O’ for the operators appearing on the left sides of 
Eq. (7.19), instead of the operator O for which (x,t) 
has the eigenvalue O’. To characterize these operators, 
we introduce the following sign-functions: 
J~10J =p s0= ps0, 
K0K=px0", 
(7.20) 
TOT =p70, [ 
| 
P“OP=p,0, J 


where OT should be understood as the transpose of O, 
as explained at the end of Sec. 2D. With the help of 
these p’s, we can write Eq. (7.19) more compactly as 


OY(x,) =O), 
OK Y*(x,f) =pspcO'Ky*(x,)), 
OE .KW*(x, —1)=p.spxp1rO’ EoKy (x, —1), 
OE way (—x, })=prO' Eva) (—x, t). 


} 
{ 
| 
poran 


The p’s for the operators mentioned in Eq.- (7.12) are | 
listed in Table I. | 

Reading the column for pypx in Table I, we see on 
account of Eq. (7.21), that Ky*(x,!) has the opposite 
signs for momentum, and energy and the same sign 
for spin ¢ as compared with ¥{x,f). Since ¢ is the spin 
in the direction of k, the same sign of o here actually 
means the opposite spin. In any event, if y[A] repre- 
sents a positive energy, Ky{A] will represent a nega- 
tive energy. Therefore, we can write, instead of Eq. 
(7.15), 


¥(x,)=V3 Clove) HP DIME) (7:22) 


An important difference between Eqs. (7 45) and (7.22) | 
is that à in Eq. (7.22) stands only for (ku) whileXin 


=. 


Eq. (7.21) stood for (kwy). Hereafter w= + [E-m]. 
The spinor adjoint ¢f Eq. (7.22) is, due to G*K)J> . 
me nes 


> a =~ 
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=yK-, i; Finally using the explicit expressions of y as plane 


¥*(x,)=V-4 Law IG, ONAN]. (7.23) 


The fact that AyX(=—y*K) has the opposite 
values of p and spin is just compensated in the q-number 
theory by the fact that, in an expression like y*JOp 
=yOy, the amplitudes a, and da) stand in the order 
a, while the amplitudes 5, and 6, stand in the order 
baba= 1— brbn. Therefore, baba will represent the number 
of particles having the same values of \ as daa, only 
with opposite charge. 

Reading the column for pyspxpr, we see that 
E,yKy*(x, —i) has the opposite k and the same value 
of ò, H, and o as W(x,/). Since k is opposite, the same 
value of o actually means the flipping of spin. Therefore 
we can write 


Exky* [k,n I(x, =e'*y[—k, u] (x, —9, 


where u on the left side refers to k and y on the right 
refers to —k. If we solve for ¥y[—k,»], keeping the 
same values of u, k, x, and ż, we obtain, by the use of 
Eq. (2.20), 


EkV(—ku ls, =)= eiku]. (7.25) 


If we rename —k and —/ as k and #, Eqs. (7.24) and 
(7.25) become contradictory. This contradiction is of 
course only apparent because the same value of y 
referring to —k and to k means opposite spins. The 
simplest way to avoid the confusion would be to divide 
the space into two hemispaces ©; and ©» such that 
x and —x cannot belong to the same hemispace, and to 
introduce a new spin-index 7 such that 


7=+1 


7=—1 if the actual spin direction is in Go. 


(7.24) 


if the actual spin direction is in Gy, 
(7.26) 


Then we can write consistently 
E-Ky*(k,7](x,)= n| —k, =r] —1) (7.27) 


suitably choosing e'*. Writing this way, we can now 
freely change the sign of k and ¢. Equation (7.26) is just 
a matter of convention, and we still consistently use the 
relativistic eigenfunctions. 

Next, according to the column for pp in Table I, we 
see that E24 (—x, t) belongs to the same energy and 
the opposite p and y (i.e., the same spin). Therefore we 
can write 


: 1B xewk,7 (x,t) =¥L—-k,7 ](—x, 2). 


Tko factor i is added to secure the symmetry with 
regard to the signs of k and x. It can be easily shown 
that there is no conflict between Eqs. (7.27) and (7.28), 
in the sense that the application of the transformation 
in-Eq. (7.27) first and Eq. (7.28) next gives the same 
result as the application of £q. (7.28) first and Eq. 
C2 a : 


(7.28) 


a 
po 


co 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


Waves: 


Y[k,7 J=a[k,7] exp(+ik-x, —iwl), (7.29) 
we can rewrite Eqs. (7.22) and (7.23) as ae 
V(x)=VAY Cle (ky alkz] exp(+—) « 
© $9-k,)eCk,rIK exp(=+)], 
¥%(x,)=V4 X Pa ail al exp(— +) 
+g (k,r)a[k,r]K exp(+—)]. 
Using Eq. (7.27), we can also write for y% 


y*(x) =à 2 Z (—7)[g+(k,7) 


| (7.30) 
| 


Xa(—k, —7)K!Eo exp(— +) 
+g-(k,7)a*(—k, —7) Zo exp(+--)]. 
and using Eq. (7.28), we can wiite for y 

vx)=V4 x L [g (k,7) ((L123)a(—k, DEE), 
+g_(k,7)o%(—k, 7) (¢#321)K exp(—+)]. (7.32) 
Substituting first Eq. (7.30) into Eq. -(7.2), we get 

Cgi (k,7)C1=e'4g_(k,7), 
Cg_(k,7)C =~‘, (k,7). 


Substituting Eqs. (7.30) and (7.31) into Eq. (7.3), we 
obtain 


(R"g+(k,7)R)7 = —e?7g.(—k, —7), 
(R-"g_(k,7)R)? = —e7"rg_(—k, —7. 


(7.31) 


| (7.33) 


| (7.34) 


Substituting Eqs. (7.30) and (7.32) into Eq. (7.4), we 
obtain 
Mg. (k,7)M—=—ie%g,(—k, 7), + 
g+(k,7) B+ laas) 
Mg_(k,7)M— = —ie—“g_(—k, 7), 


which should lead to M?=1 for y=-7/2. The charge- 
conjugation operator C as defined by Eq. (7.33) can 
easily be obtained. Thus, 


C=C==0=]] JI C(k,7), 
KT: 


C(k,7)=1—WV,.(k,r)— N_(k,7) + '9g,.(k,7) 
X g_(k,r)-+e-9g_(k,7)g,(k,7). (7.36) 

Next for the reversion operator R, we have to satisfy 
Rg.(k,+)R=—e#®g,.(—k, —), 
Rg,.(k,—)R=+e*#g,(—k, +), 


where the usual representation of g and g is assumed in 
which g and g have real matrix elements. We can first 


(7.37) 


t 


19 


t9 


™ 


ae 
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get rid of numerical factors by putting 


R=RoR’, 
b =L I (etaa orena (7.38) 
Then Eq: (7.37) becomes 
Rig. (k,7)R'=g,(—k, —7) 


which is obviously satisfied by 


R=TT TI R'+ W, 


(7.39) 


RA =1 -NaH N k) 
tisk Heak tg k, ges), 


where the multiplication symbol II’ with a prime means 
that k showd be taken from one hemispace. Now we 
can confirm that the R, thus determined, satisfies 
RRT=A. Noticing that RoT= Ro and R’7=R’ we get 


(7.40) 


Sp RORT=R'RoOR'Ro. 


From Eq. (7.39), we see that the effect of the trans- 
formation by R’ is such that V..(k,+) and V.(—k, —) 
are interchanged. This amounts to an interchange of 
the factor of the type (—e*'#)* and the factor of the 
type (+-e*#8)¥ in the expression of Ro or of Ro, which 
results in a multiplication by (—1)¥ for all oscillators. 
Thus we have 


RR R'=AR0', 
RORT=A. 


and 
(7.41) 
Finally the mirage operator M satisfying Eq. (7.35) 

can be written as 

M=M'M,, (7.42) 

with 
Mo- Td (tie) 8+?) (Hierin. (7.43) 

Vg E 


‘and 


M’=T TON) —-Na(k, 7) 
k ae 
+9.(k,7)gs(—k, 7) +92(—k; 7)ga(k,7)]. (7-44) 
If we make the square of M, we get 
2=-M r=] Wee) Gee, (7.45) 
kor 
which is unity for y=n/2, and A for y=0 or +q, in 


agreement with Eq. (7.7). To satisfy M?= 1, we should 
take 


M=] ILC pran or M-I T- 1e, (7.46) 


The mirage operator thus defined satisfies 


MV vac = Vesc- (7.47) 


LAWS. PART II 63 
Evidently this is a matter of convention for we could 
as well take the negative of Eq. (7.42) as M. 


8. CHARGED SPINOR FIELD—-ANGULAR MOMENTUM 
REPRESENTATION 


In this representation, the operators used to charac- 
terize eigenstates are, besides H andò as in Eq (7.12), 
the following three operators which commute among 
thernselves and with H and Ù: 


1=LaEatEsks, (a=1, 2,3), (8.1) 
J= Lath E0EsE3, (8.2) 
P= Eo Esx, (8.3) 
where : ) 
Liı=— if x2(3/3x) —x3(d/dx2) |, etc., (8.4) 
and j : 
Qxf(x) = f(—x), (8.5) 
or g 
(x!|Ox|x)=(x!-Ex”). (8.6) 


It is convenient, in this representation, to introduce 
the following operators, X, Y: 
X=(i/r)Eotx, (a=1, 2, 3), 


v= (i/r) (Eyota + Esei Ezz). 


(8.7) 
(8.8) 


It is easy to see that X, Y, and P satisfy the same 
commutation rules as the spin matrices: 


X= V= P=], 
¥P=—PY=iX, 
PX=—XP=i¥.) 


XY=—YX=?P, (8.9) 


These operators, X, Y, and P, are analogs of Sy, Se 

and S:, as seen in Eqs. (2.58), (2.59), and (2.60). 
Now noticing that 

pr=—1(0/d21) = (x/r) (p,+i/r) 

— (co/7")L 3+ (x3/r7) Lo, etc, 
with wee 
pr=—iL(8/dr)+ (1/7) ] 

= (1/7) (epr trpa xapa —i) (8.11) 


we can rewrite the Hamiltonian by a straightforward 
substitution in the form? 5 


Ho= —iEoX%p,— iE Y (1/r)L-+mPO%. (8.12) 


(8.10) 


The eigenfunctions in this representation will be- 
characterized by w, x, m, and p, which are, respectively, 
eigenvalues of H=b, I, Jz, and P. Thus, = 

Hompo]: (8.13) 


It is obvious that if w is an eigenvalue then —w is also 
an eigenvalue. However, in contrast to the case of ie 


235P. A. M. Dirac, The ‘Pzinciples of Quantum Mechanics 
(Oxford University Press? London, 1947),.third édition, p. 268. 
= T% = ~~ E 


aene 
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TABLE IT. The sign functions p’s defined in Eqs. (7.20) and 
(8.32) for various operators used in the angular momentum repre- 
sentation. 


PJ PK PT PP PJPK PJPRKPT PPr 
Hm à + -= +o ë =- +o + 
b oe E 4 
P - a - -H - + + 
I = gp E + 
Js Be E > 


linear momentum representation, |w| is not determined 
by the other quantum numbers. Similarly to the case 
of the linear momentum representation, here too, we 
have four quantum numbers. 

To determine the relationship among eigenfunctions, 
we calculate the sign functions py, px, pr, and pp, 
defined by Eq. (7.20), for the various operators used 
in this representation. The results are listed in Table I. 

We first notice that pypx=—1 for all the operators, 
which means, in virtue of Eq. (7.21), that Ky*(x,/), as 
compared with y(x,¢), has the opposite sign for all the 
quantities involved, including energy. Therefore we can 
expand y (x,t) with the help of Eq. (8.13) and 


Kylo mp], 
while limiting the values of w to positive values. Thus 
VO) =R ALTE ANAT) 

+9-OVYDIG NK], 


where 2 stands for [w,x,,p ]. As before, the fact that 
Eq. (8.14) has the opposite values of à is compensated 
by the fact that the emission operator g, instead of the 
absorption operator g, is standing as its coefficient, 
where g and g anticommute. The commutation rules 
here should read 


(8.14) 


(8.15) 


[gs (lomp), t (o'm p") 1 


=ô (wv) (xx) (m,m) lo, p) (È, +), etc. (8.16) 


Next, we see from Table II that pypxpr is negative 
only for Js. This means, in the light of Eq. (7.21), that 
E,xKy*[\](—i), as compared with y[A](), has the 
opposite value of J; and the same values of H=9, J, 
and P. Therefore we can write 


ExKy*[,x,m,p |(x, ve: t) = elu, K, — M, p] (x4). (8.17) 


„Solving for Y[—m], and tuking the spinor adjoint by 
the help of Eq. (2.20), we obtain 


= EKP To, Kee! p(x!) 


> y =— eiw, x, m, p](x, —1). (8.18) 


a 


The~apparent dissymmetry between Eqs. (8.17) and 
(8.18) can be overcome by noticing that a can depend 


on m. Thus we can adjust a so that both Eq. (8.17) and 


Eq. (8.18) are incorporated in 
EoKy[w,x,m,p ](x, =) 
= (m/ [m| Wo, «, —m, P1). 


One of the advantageous features of thẹ angular 
momentum representation is that P commutes with 
the other operators defining eigenfunctions. This fact 
is reflected in Table II by pp being positive for all the 
operators. Thus we should have, according to Eq. 
(7.21), 


(S219) 


Eryl wm, p](— x,/) Fe ep omp] (x,), 


or more directly, because of the fact that ¥[A] is an 
eigenfunction of P, 


Eosl w, mp] (= X, i) =pto,x,m,p | (x,!) è 


Now we can determine the effect of C; R, and M 
on the q-number amplitudes g’s. Applying Eq. (7.2) 
to Eq. (8.15), we obtain 


(8.20) 


Cga lom p)! = e#"*g+(o,x,m,0), (8.24) 

which is satisfied by 

C=C =0=]] [1-N+A) -N-A Heg Ae) 
Heeg AeA). 


The application of Eq. (7.3) to Eq. (8.15) yields, in 
virtue of Eq. (8.19), 


m 
RE +(w,K,m,p R)" = E alo K, — M, p), (8.23) 
; | 


(8.22) 


m 


which is satisfied by 
R=RoR’, 


z NEO), 
l( ———ez# 
m 


R=TLT INAN mN A, —m) © 


+N iml 
HILA mgA, —m) +9200’, —m)gQ’,m) I, 
where à’ stands for (w,x,p). This R obviously satisfies 


RTRI=A. (8.25) 


(8.24) 


Finally, the effect of M on the g’s is obtained by 
applying Eq. (7.4) to Eq. (8.15) with the help of Eq. 
(8.20). Thus, 


MgO p) M= —ierrpg,(\'p), 
g+(X’,p) ie'”pg (d'p) | (3.26) 
Mg_(\’,p)M- = — ie ipg- A',p) 


and are satisfied by 
M=TI I (Hip inno (Lipetir)¥-O'), (8.27) 
N p 
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7 ; 
where A stands for (w,x,m). We can satisfy the condition 
M?=1, by either one of the following two alternatives: 


Mi=] JI p¥+0)(—p)¥-0"), 
A p 

hS M=] J] (—p)¥+00)p¥-0".0), 
2 Np 


(8.28) 


AN wo 

This jambiguity is not surprising since the deter- 
mination of p itself is a matter of convention; we could 
as well use —P for P. In’ spite of this, Eq. (8.28) can 
give asdeéfinite value of parity when the total number 


5 . . . 
of spinor particles is even. Moreover, we can speak of 


a definite “relative” parity of two states, each having 
an odd number of spinor particles, since the “relative” 
parity (i.e. equality or inequality of parity) has a 
definite meaning no matter whether we use M, or Mo. 
Both M, and Ms satisfy MWyac=Vyac. 

For the purpose of a later application, let us briefly 
consider the reflection with respect to the x-axis: 
(x,9,2,l)—>(—«, y, z, t). The unitary operator M, is 
obviously defined by 


a Myle NM =i Ep(— x, y,2,), (8.29) 
whose adjojnt is, on account of Eq. (2.18), 
Ma*(x,y,2,) Ms = ei — s, y, z, DEn, (8.30) 


where a has to be =0 or +7 in order that W?2=1. The 
right-hand side of Eq. (8.29) is obtained from p(x,y,2,1) 
by applying the operator 


P= Ez. (8.31) 


The comparison of an eigenfunction ¥[\] and P4[A\] 
can easily be done by examining 


P="0P.=pp0, (8.32) 


far various operators. The results are listed in Table II, 
which slows that the only quantity that changes its 
sign is J3. Therefore we can write 


EypLo,x,m,p (—, J, 4, t) 


= ew, Ky — m, p](x,y,2,t), (8.33) 


where e`% can be put =1 without loss of generality. 
Applying Eq. (8.29) to Eq. (8.15), we obtain, with the 
help of Eq. (8.33). 

. Maga m ]M eig, —m], (8.34) 


where \' stands for (w,x,p) and e** must be +1, in order 
that M2=1. M, can be given an explicit expression: 


M= I [1 [1-V=0',m) —N (N, — m) 
+N [ml 
S +(&Vg0',m)ga A’, — m) 


HEVIN, —mge0ym)], (8-35) 
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where (+) is an arbitrary, double sign independent of 
the double sign of the charge. This M, satisiies M Vyao 


=ï 


vace , 
9. TENSORIAL QUANTITIES BUILT WITH SPINORS 


We have established that charge-conjugation, rever- 
sion, and mirage with regard to spinOr fields are en- 
gendered by C, R, and M defined by 

CY HCI =c'p*(x,)K, j 
(RYC DR)"=6"0(x, —)EoK, ee 


Myx, DM =e Erb (—x, i), y= +7/2, 


and their adjoint equations. We have also shown that 
such operators C, R, and M actually exist. These rela- 
tions have been derived from the requirement that the 
free Lagrangean and the current have the correct 
transformation for charge-conjugation ©, reversion Jt, 
and mirage St. Now, we can conversely use these rela- 
tions in Eq. (9.1) to find the transformation rules of 
various tensorial quantities O(x,!) built with spinors 
for ©, R, and M. Their transformation properties are 
determined by 


pe= (CO()C9/0(x), | 
pe=(R“O(x,)R)7/O(x, —2), (9.2) 
pu =(MO(x,)M-)/O(—x,!). J 

The quantities to be considered have the general form 


Q(x) =0*(x,)0 (x) (x). (9.3) 


In the next section we shall investigate the case where 
the two spinors involved in Q are not the same. In Eq. 
(9.3), however, Y% is just YJ, where is the Hermitian 
conjugate of Y, 


Q=9s-0p. (9.4) 


The requirement that Q be Hermitian is equivalent to 
the requirement that J~'O be Hermitian: 


(J10)-=J-10, or ps=(J70J)/Ö=+1. (9.5) 


In the following we shall assume that the operators O 
are written so that Eq. (9.5) is the case. The operators 
considered in the preceding sectiorts are JO rather 
than O. See Eq. (2.40) in this connection. 

To determine pc, pr, and pw, we use the auxiliary 
` . ` Ld 
sign functions px, pr, and pp defined by 


KOK =px0", T7OT=p0, 
P“OP=p,0, JI*QJ =pi0. sf 


See Eqs. (7.17) and (7.20). It is easy to see; by applying 
Eq. (9.1) on Eq. (9.2), that eos a 


° . 
pc=pK, pR=—PKpr, Pu=—pP- (9.7) 
& 


a > w 


6) 
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TABLE III. The various p’s for the tensorial quantities built with 
the same’spinors. 


PN) 1 ths the iEo isa Eso iEa8 iEao 
pJ ap + te paty AP =P ++ 
PK o> tp Sa Ss ote al = = 
pr ie oe os tr ai = + = 
PP + = ch = Ga ae Ste = 
pe SP a mi ar + = = 
PR = ay =% “ip a ar aF = 
pM = a = ie He = al + 
Kind 1 reg. 2 2 1 1 3 3 


Note-that pc=px is based on the anticommutability of 
y and yX. 

As an illustration, let us consider the free Hamil- 
tonian Ho: 


Ho=EvEabatEvEsm, (a=1, 2, 3). 


The corresponding operator O=J Hy is then 
O=iE pati. 


Since J71B,J=—E,, as in Eq. (2.18), and Da= pa, We 
have py=-+1 due to the imaginary unit 7. Compare 
herewith that we had py=—1 for Ho. See Table I. 
Since O anticommiutes with Eo and since O does not 
contain the time variable, we have pr=—1. Next, 
K-£,K=—E,', KE;K=£;7, as seen in Eq. 
(2.19). And, furthermore, since fa"=—pa, from Eq. 
(2.71), we have px=1 for O. Since Q,p.=—p.22, in 
Egs. (2.70) and (2.74), and since E12, commutes with 
th E. and anticommutes with Es, we have pp=—1. 
Therefore, by Eq. (9.7), pc=pr=pu=1. 

We are mainly interested in the O’s which are the 


H basic 2s or their products. There are, as is well known, 
ki six types of quantities up to the second rank: 

ai (S) O=iEs, (PS) O=1, 

i (V) O=iB, (PV) O=iE™,  b(9.8) 


Jae Aig 


, 2) O=E™, (PT) O=1E". 
The imaginary unit 7 is added wherever it is necessary 
to have py=+1. As we can write E°?=7E;39, etc., due 
to Eq. (2.17), the relation between (T) and (PT) is 
just that of complementary tensors as defined in Part I. 
Therefore, we need not investigate them separately. 
The various p’s for the O’s are listed in Table III, in 
which the indexes a and £ refer to space-coordinates, 
a,B=1,2,3. 

m Fron the line for pe, we observe that 2H,, iE and 
Es,» are “electromagnetic” quantities (o¢=—1) while 
all-the others are “mechanical” (o¢=1). From the lines 
for pr and pm, we can determine the “kinds” of the 
quantities which are listed in the last line of the table. 

Comparing this result with (2.43), we observe that 
1,.#2s, and iEs, change their “kinds” by the passage 

_ from the c-number theory to the gq-number theory. This 

— ange is such that par rémains the same while pr 


TEE 


tare 


WATANABE 


changes its sign. It should also be noted that these 
quantities are “mechanical”? quantities. In other words, 
the q-number theory changes the behavior of the 
mechanical quantities for reversion, as was required in 
Sec. 4. These results were mentioned in Part I, Tab! VI. 

It should also be recalled that the various alt~inative 
expressions for C, R, and M pertain to the arbitrariness 
that is allowed without altering the transformation of 
the bilinear expression of the type in Eq. (9.3). There- 
fore, the above-mentioned results are not affected by 
the choices of the operators C, R, and M. 


The transformation rules in Eq. (9.1) are determined ` 


by the consideration regarding £y and s“. But, for the 
neutral spinor particles, s* does not have the meaning 
of electric current, and one may argue that particles 
and antiparticles are physically undistinguishable. If we 
take this point of view, we could define C, R, and M 
by a combination of those C, R, and M given in Eq. 
(9.1) with charge-conjugation ©: yoy*kK, Poyk, 
i.e., by 


Cy =e, 
(RW ()R)? =e" Ew (À, (9.9) 
My (x)M4 =e Eyo3y* ( = x)K. c 


But, such an assumption will immediately encounter a 
serious difficulty. Consider the pion-nucleon interaction : 


N+P, (9.10) 


to which corresponds an interaction Hamiltonian of 
the type 


Ye”ýnx, (9.11) 


where the Dirac matrices are omitted for simplicity. 
Assuming Eq. (9.1) for protons and ¥q. (9.9) for 
neutrons, we get as the result of mirage, for instance, 


vend, (9.12) 


which certainly should not exist in the original inter- 

action Hamiltonian, since it implies a violation of “con- 

servation of heavy particles” : 
T—P+N. (9.13) 


The relations in Eq. (9.9) for C and R encounter the 


same difficulties. We therefore have to assume Eq. (9.1) 


and not Eq. (9.9) for neutrons. 

We can further argue that the neutrino also has to 
obey Eq. (9.1) and not Eq. (9.9). If the neutrino would 
obey Eq. (9.9), the interaction Hamiltonian of the type 


PoN+et+o: (Wye) He), (9.14) 
would go over, by mirage, to 
PoN+etty: (WYW), (9:15) 


where the neutron, proton, and electron are assumed to 
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obey Eq. (9.1). The combination of (9. 14) and (9.15) 
would mean 


vy or pp A), (9.16) 


W whied js hard to accept. 


\ For “ese reasons, we shall assume in the follor wing 


setions that the neutron as well as neutrino obeys Eq. 
(9.1). 


10. NUCLEON-PÌON INTERACTION 


Let us consider the nucleon-pion interaction of the 
type 
L= fYO.7 WF *, (10.1) 
where the index æ designates the tensorial components 
and the arrow means a vector in the 3-dimensional 
Euclidean isotopic spin space. Here, y is actually 
an 8- -component SPEO composed of neutron spinor Yy 
and proton Spinor Yp, which we define by 
» TIN = Fyr, Tapp=— ýr. (10.2) 
The factor F< is a Hermitian quantity derived from the 
pion field and the 7’s are the well-known isotopic spin 
matrices which are also Hermitian. Then, if we take O, 
such that 
J>OJ=0, ice. 


ps=1, (10.3) 


£ will become Hermitian with real f. 

Although we take the “symmetric” formulation, in 
Eq. (10.1), as the standard expression, it will be easy 
to infer the situation with the “charged” or “neutral” 
formulation from our results. As O in Eq. (10.1), we 
can take any one of the operators introduced in Eq. 
(9.8). The Fa for (S) and (PS) is the scalar pion field 
strength itself %. The F< for (V) and (PV) can be the 
vectorial field strength xë or the 4-dimensional gradient 
of, the scalar field 7. The F< for (T) and (PT) must be 
the 4-dimensional curl of a vectorial field 2. 

In this section, we take the viewpoint that yy and 
Wp obey thestransformation rules in Eq. (9.1) and that 
the transformation rules of the u-field are to be deter- 
mined by the requirement that £, as in Eq. (10.1), be a 
regular scalar. There is no reason why the phase angles 
a, 8, and y should be the same for protons and neutrons. 
Therefore, we shall write ap, Bp, and yp for protons and 
ay, By, and yv for neutrons. 

By decomposing Eq. (10.1) in three components in 
the isotopic spin space, we get 


L= f(Wv*Oaet Wr 0a) F OE 
+f(—iby*0 

+f (Yn Oyy 

Or, by ifitroducing, in analogy to Eq. (5.1), 


Gx=(FM2piFO)/\2, Ge= (FM«— iF 22) /y2, 


betipeOn)F* p (10.4) 


YOy p) FO. 
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we can write 


* 


. L=V2f(KaGt+eK G9 + fH FO* (10.5) 
with Kacy Oan, 
Ka=ys” Oar, 7 (10.6) 
Ha= bx’ Oops — ye Opp. j 
In the “charged” formulation, the third term with 


F® is absent. In the “neutral” formulation, the first 
two terms are absent and we have a plus sign instead of 
minus sign in the expression of Ha. . 

Our next step is to determine the transformation 
rules of Ka and Ha by the use of Eq. (9.1). The results 
are as follows: 


CKC = [t Jelena ‘ocKa, | 
ý i " (10.7) 
CH= peH a- J 


(R Kal DR) =e! r Fripp Kal), 


(RH ()R)? = prila(—D), 


MK a(x)MO =e) Gs?) pK alx), | 


(10.9) 
MH. (x)M'=pHa(—x)° Í 


The sign [+ ] is positive if Yy and Yp anticommute and 
negative if they commute. pc, pr, and ps are the ones 
determined in the last section for various O’s. See 
Table IH. 

By the requirement that £ be a regular scalar, we can 
infer the transformation rules for F® from the results 
of Eqs. (10.7), (10.8), and (10.9): 


CRBM«C4 =pch@-«, (ROE @<()R)? =ppFY=(— i), 
MF®2(x)M=puF®2(—x), (10.10) 


hat pc, pr, and psr are those of the operator O used 

n (10.9). Since the Fis the wu itself, or a quantity 
aes from « by an operation of differentiation 
which is a regular tensor, the “kind” of F” is the same 
as the kind of u®. Also, the charge-sign-function pc 
must be the same for F% and u. These properties of 
u® are tabulated in Table IV. 

From these properties, we can conclude an important 
rule concerning the mixing of different types of inter- 
action. Mixing of (S) and (V) for scalar field and mixing 
of (PV) and (PT) for vector field are not allowed. This 
conclusion can be drawn from consideration of the 
“kind” as well as from the consideration of pe. Thi con- 
clusion is also valid in the “neutral foanio nen 


TasLe IV. The transformation properties of ee for various 
interaction ty 


eR 
) A) (1) (PS) erv) “SR 
a E n aii 
Kind reg. 2 2 i $4 ak 5 
Pc + - a— + +. = 
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Contrary to the case of .®, we cannot infer any 
definite conclusion regarding the “kind” of # and u® 
because of the unknown factor such as expliyy—iyp) 
in Eq. (10.9). By the same argument as the one which 
led to Eq. (10.10) we can write 


CoC = cilre) pci, 
(RV()R)T =e!) pxd(—2), 


MoM = er) puo — x), 


(10.11) 


where pc, pr, and px are those of u®) for the correspond- 
ing. interaction type. These relations agree with Eqs. 
(5.8), (5.11), and (5.16). Since yp and yw are limited to 
-e7/2, exp(typ—tyn) as well as par exp(iyp—tyy) is 
limited to +1, which is in agreement with Eq. (5.17). 

The prohibition rules of mixture, explained with 
regard to 2), is still valid for the charged v. For, what- 
ever the factors exp(iap—iay) and exp(i8p—i8x) may 
be, the phase factors in transformation rules for v must 
take the same values throughout the entire Lagrangean. 
Ttis true that due to the so-called equivalence theorem,” 
the (S)-interaction and (V)-interaction, for instance, 
are equivalent. Thus one may say that the mixture of 
(S) and (V) is unnecessary. However, these equivalence 
theorems are proved only in lower approximations and 
we cannot in general replace (S) by (V) or (V ) by (S). 

Before closing this section, a few lines may be spent 
in connection with the contention often raised” to the 
effect that the (S,V,T)-types can be mixed with 
(PS,PV,PT)-types. For simplicity, let us consider the 
mixture of (S) and (PS), in accordance with the 
usually proposed procedure: 


L= f PFI (1/24) fPLwaryyrrt, (10.12) 


where Ewa =+iEs according as (u,v,x,\) is an even or 
odd permutation of (1,2,3,0). See Eq. (2.41). According 
to the c-number theory, Y“Eyaw is a third kind pseudo- 
tensor of the fourth rank, corresponding to the fact 
that YXEs), which is the complementary tensor of 
Y*Ewav, is a second kind pseudoscalar. In the 
q-number theory, ¥*Ewoy is first kind and y*Esy is 
regular. 

Now referring to Eq. (10.12), the first term will be 
a regular scalar if w® is a third kind in the c-number 
theory and a first kind in the q-number theory and v 
obeys the corresponding relation in Eq. (10.11). Then, 
the second term in Eq. (10.12) can be made regular, if 
we assume 7” to be regular in both c-number and 
q-nuthber theories. For, (third kind) x (regular) X (third 
kind)=regular, and (first kind) X (regular) X (first kind) 
=,cgular. 

This regular tensor 7%” is nothing but what we 
Genoted by the same symbol in Part I, Sec. 2, as distinct 
from e>. The appearance of 7” in a physical law is 


— n 

Tas E. C. Nelson, Phys. Rev. 60, 830 (1941); E. J. Dyson, Phys. 
Rev. 73, 929 (1948); K. M. Case, whys. Rev. 76, 14 (1949). 

21 M. Schoenberg, Phys. Rey. 60, 463 (1941). 


highly questionable. For, 7'™, for instance, is +1 in one 
coordinate system, but —1 in another coordinate 
system which is connected to the first by mirage. This 
means that the physical law is different for the right- 
handed coordinate system and the left-handed svStem. 
How can one decide whether he should take r2” = -+1 


or =—1 in his coordinate system? In other words,-we 
can write the second term of Eq. (10.12) in the form 
fav Eryn, (10.13) 


assuming that fə is a first kind pseudoscalar. Is it per- 
missible that a natural constant change its sign by 
mirage or reversion? Can it differ for a process and its 
miraged or reversed process? 

Furthermore, the mixture of (S), (PS), or (PV) with 
one of (V)(Z) or (PT) cannot be made allowable by a 
similar artifice, because pe is different for these two 
groups. 


11. NUCLEON-LEPTON INTERACTION 


The usual nucleon-lepton interaction Lagrangean can 
be written 4 


£= D fa (Wx*O apr) (Y 20 “y) = 
+ De yer (WpXO aWn) WO.) ? (1 1. 1) 


where the “particle,” as distinguished from the “anti- 
particle,” of the y field is positron, and the “particle” 
of the y, field is the neutrino defined as the neutral 
partner of the negative beta-decay. The index a 
designates five independent types of operators men- 
tioned in Eq. (9.8). Because of the existing condition 
in Eq. (9.5), £ is hermitian if fa“ is the complex con- 
jugate of f. 

By applying the first transformation of Eq. (9.1) 
on the first term of Eq. (11.1) we obtain 


ik aC (Wn*OaWp) wy Oy) 
= fa(pvOa yr) (YOT) 


xX gilap mantar ae) (pc), 


7 


(11.2) 


where pc(==t1) refers to Oa, and naturally, pc?=1. It 
may be quite natural to assume that the commutation 
relations between Wy and Wp and the commutation 
relation between y, and ye are of the same type. There- 
fore the q-number part of Eq. (11.2) is identical with 
the g-number part of the second term of Eq. (11.1). 
Therefore we get the relation 


fea IPO, (11.3) 
or z 
i fa=| fale, (11.4) 
with 
a= (—aptoay—aytaet+2nam)/2; 
(na=0, 1, 2: -). (11.5) 
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ə By the same token, from reversion and mirage, we 


obtain (without any assumption regarding the types of 
commutation rules) 


Ja” = faci Cr-Brt8b.) (11.6) 


eN e Ja= faci Vert), 


aA (11.7) 
The compatibility of Eq. (11.3) with Eq. (11.6) requires 


ap—an tæ, =a =Bp—By+6,—B.+2n7, 


(n=0, 3 o) (11.8) 
and the self-consistency of Eq. (11.7) requires 
Ye—yn tY» —Ye=2mr (m=0,1,2,---), (11.9) 


while individual y’s are +7/2. 
Whether òr not’ exp(#.) separately for various a’s 
have a physical meaning, the relative phases 


exp[7(8.—8a") | 


haye certainly a physically detectable effect.28 From 
Eq. (11.5) follows 

Da— Oat = (ta— Mat) (11.10) 
or 

ei Ga—8a") = 4-1, (11.11) 

This means that for a mixture of the type in Eq. (11.1), 
the f’s can be complex but the ratio fa/fa for all com- 
binations of a and a’ must be real.” Obviously the same 
conclusion can be drawn from the consideration of 
reversion, i.e., from Eq. (11.6). 

Since the tgansformation of a quantity of the type 
(Wi*Ov2) contains an undetermined factor, such as 
exp[782—781], we can no longer maintain the old 
prescription that a tensor (pseudotensor) of nucleon 
field should not be coupled with a pseudotensor 
(tensor) of lepton field. Therefore we can assume,” 
instead of Eq. (11.1), 


£= y la (WO) (W.-XOW,) 
SPS, Na* (We*Oabw) (W,*O5%p.), (11.12) 


where O52 means E; times or 7E; times O. Table V gives 
the exact correspondence. 

By the same argument as before, we obtain from 
charge-conjugation, reversion, and mirage, the following 


TABLE V. Correspondence of O° to Eg. 


O, iEs 1 iEn iEsu Esp» tE yy 
of 1 iS iBH iE" iE?” Beer 


SS O 


28 L, C. Biedeharn and M. E. Rose, Phys. Rev. 83, 459 (1951). 
SCINS Yang iA J. Tiomno, Phys. Rev. 79, 495 (1950). 
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Taere VI. The values of pc(Oa)pc(Osa), etc., as dependent on 
the type of Oa. 


(PS) v) (PY) (7) 

pc(Oa)pc(Osa) t + = = -F + 
pr(Oa)pr(Osa) Za Za + =f = = 
pu (Oa)ou (Osa) = = i = Ome = 
Class I T c I 
results: 

ha = ilr artara) pel JpclOsa)Has 

hA = ei Cr Prt don (Opr (Osa) lta, a fs 13) 


ha= ei teorite) p ag (Oajpse (Oza) Ei; J 


where pc, pr, and ps, are the ones given in Table IIT. 
The products such as pe(Oz)pc(Ose) are listed in 
Table VI, in which the first line indicates the operator 
Ou 
We notice that psr(Oz)psr(Osa) is always —1. There- 
fore from the third relation of Eq. (11.13), we have . 


YP— YN Ye — Y= (2n-+ 1). (11.14) 


Regarding pc(Oa)pc(Osa) and pr(On)p2(Os.), we observe 
that the types of combinations can be divided into two 
classes: 


(IT) (S)—(PS), (PS)—(S), (T)- (PT), } 


ee 


(I) (V)PV), (PV)-(V). 
of course, (T)— (PT) and (PT)—(T) are actually the 
same interaction. For class (I), pc(O.)pc(Osa) is +1, 
and pr(O.)pr(Osa) is — 1. For class (IT), pe lOa)pelOsa) 
is — 1 and pr(O.)pr(Ose) is +1. 


Writing 
A=ap—ayt+a,—a,, B=Bp—Bx+B,—Be (11.16) 
we have for class (I), nm 
ha= | Hale (11.17) 
with 
ba=—}A-+nar= —}B+(mat4)r. (11.18) 
And for class IL 
a= —3A+(tat})e=—}B+max. (11.19) 


From Eq. (11.18) it follows that if æ and «both | 
belong to I or.II 
ela/eitar— +4 (g,a'el or a,a’ell) (11.29) 


and if one of a and a’ belongs to Í and the other belongs 
to I, hele 


ey 
ele /eitar— 47 (ad, TI or adl, à&'T). (11.20 
a 


In other words, withiñ the same class, -Ï or II, the 


© 
a = ~ 


? ~ 


(PTY 
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relative phase factors of /’s are real, and the relative 
phase factors of the #’s belonging to the different 
-~ classes, I and If, are imaginary. ` 
It should also be noted that a mixture of a type 
belonging to Eq. (11.1) and a type belonging to Eq. 
(11.12) is not allowed. For Eqs. (11.9) and (11.14) are 
in direct contradiction. The same conclusion can also 
be drawn from the comparison of Eq. (11.8) with Eq. 
(11.18) or (11.19). 


12. SELECTION RULES AND SUPERSELECTION RULES 


A self-miraged state W is a state such that the expec- 
tation values of each physical quantity Q(x) is pa 
times that of @(—x). In particular, a quantity Q, which 
does not depend on x and whose par is negative, must 
have a vanishing expectation value. The total linear 
momentum is an example of such a quantity. 

By definition of the mirage operator M, we should 
have for a seli-miraged state 

MY=HeV:. IMeV=H=e*V. 
jIF=1, (12.1) 


we have exp(2ia)=1. The seli-miraged states are clas- 
sified into states of even parity and odd parity such that 


t M Veren ~ Veven, MWY o3a= — Voca- (12.2) 
= This definition is not yet unique, for we can still mul- 


tiply M by (—1) without changing Eq. (12.1) but inter- 
changing Veven and Yoda. Therefore, we usually impose 
an auxiliary definition, 


MY =Y, (12.3) 


which is just a matter of convention. 

In order that this classification in Eq. (12.2) is com- 
patible with the other specification of Y, we have to 
use those physical quantities which commute with M 
or whose expectation values in Y are zero. For those 
quantities Q which do not depend on x, the relation in 
Eq. (3.12) simply means that par=1 or —1 according 
as Q commutes or anticommutes with M. Therefore, 
W must have vanishing expectation values for these 

à Q’s with px=—1. This is a condition that a self- 
miraged state satisfies. 

This restriction to “self-miraged” states would seem 
to delimit the Hilbert space to its part or subspace. 
Tt is not so because any arbitrary state can be decom- 

|= posed into two kinds of self-miraged states defined in 
=" Eq. (12.2). For any Y, we have 


We Vevent Fodd; Veven= (4+ w)/ De 
hae eae Woaa= (V—MW)/2, (12.4) 


here the resultant Y is not self-miraged. 

[ n linear momentum representation, the self-miraged 

tes must have the total linear momentum equal to 

e have already noticed that the angular mo- 
a 


-n 


E 
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mentum representation is particularly suitable for the 
parity consideration. It should also be recalled that 
except for the case of photons and neutral mesons, there 
is actually no physical means of determining the pr*ity 


c 
n has been explained in sometietail 
with regard to the charged spinor fields in the'angular 
momentum representation.. A similar consideration 
applies also to the charged pion fields. 

The selection rule regarding space-parity means that 
the transition between an even-parity state and an 
odd-parity state is prohibited if the transition operator 
U (i, — i) commutes with the mirage operator M. This is 
quite obvious, because 


(Tr UV eren) = (Y UMT een) 
= (M Vosa, UVeven) = — (Yoda, UFeren)- (12.5) 
We can do exactly the same’ thing with the charge 
conjugation operator C: 
3 
| 
ie 6) 


é) on the left and right sides 
ge-conjugate states must have 
vanishing expec 1 values of all “electromagnetic” 
quantities (pce=—1). By a superposition of a state 
with even charge-parity end a state with odd charge- 
parity, we can build a state in which “electromagnetic” 
quantities have non vanishing expectation values. 

To have a concise view of the entire problem and 
also to prepare a step toward the consideration of super- 
selection rules, let us consider a unitary or Hermitian 
operator IV which has eigenvalues IV’; and corresponding 
eigenfunctions W;. Here 


WY =Wa:, 


(12.7) 


where we use the same value of i insofar as W; is the 
same. In other words, Y; determines in general a sub- 
space in the Hilbert space. Now, take any operator X 
that commutes with JV: 


XW=WX, (12.8) 
then we have 
(Y,,XWY,) = (Y; WXY; = (WY;,XY,), 
from which follows 
WAY; XV) =(1/W);, XV) (W: unitary 
(WY; ) (1/ PO; EX D) | (499) 


or Wi, XY; =W; ($; XW;). (W z Hermitian) 


Putting first X=1, we get the well-known theorems 
that the eigenvalues of a unitary operator are of ab- 
solute value unity and those of a Hermitian operator 


le 
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ə are real, and that in either case Y; and W; are orthogonal 
if WiAW;: 


(WV )=0, ij. (12.10) 
_ Furvher putting X=U, we get 
Nase 2 "o (Y,UF)=0, ix; (12.11) 
on condition that 
UW=WU. (12.12) 


> This leads to the ‘ ‘conservation law” of the eigenvalue. 
The selection rules with regard to space- parity and 
charge-parity can be obtained by putting respectively 
W=M and W=C. 

The concept of a superselection rule is more re- 
stringent than that of a selection rule. If there is a 
unitary or Hermitian operator W that commutes not 
only with U but also all physical quantities Q, then we 
speak of a superselection rule. Under this condition, we 


shall have 2 


1 QW=WỌ. (12.13) 
Putting X=QỌ we get from Eq. (12.9), 
= (,0¥)=0, i}. (12.14) 


If W commutes with all Q, then it will commute with 
the Hamiltonian and also with the transition matrix. 
Therefore, if Eq. (12.13) is the case, it is not necessary 
to require Eq. (12.12) separately. Equation (12.14) 
means that the expectation value of Q in a state 


=> aw; (12.15) 


is given by 

° (,0¥)=Dila?| (FF). (12.16) 
This shows that tħe phase of a; is deprived of any 
physical meaning, i.e., not only the phase of the total 
W given in Eq. (12.15) but the relative phases of indi- 
vidual terms belonging to different subspaces lose their 
meaning. lit other words, the knowledge of a state ex- 
pressed by Eq. (12.15) is nothing better than a mixed 
density G such that 


g=Lla?|OL%], 


where @[W;] is the projection operator onto the direc- 
tion of V;. We have 


nth CY: i=, Oly, j= 0, 


where ® is any sfate-function orthogonal to W;. Thus 
we need not consider any “superposition” of two state- 
functions arising from two separate subspaces. See Eq. 


(12.7). 
Furthermore, it should be noted that a state ¥ and 


WY are physically indistinguishable if W is unitary, i.e., 
(WY,OWY) = (4, WOH) = (4,0%). (12.19) 


(12.17) 


(12.18) 
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The simplest way to discover superselection rules is 
to search for gauge transformations under which all 
the known physical quantities remain invariant. We . 
have seen in the body of this paper that the sign of the — 
electromagnetic field and the u@-field seem to have 
physical meaning. All the rest of the field strengths are 
open to unobservable phase-change. The physical 
quantities which refer separately to a spinor field y and 
a charged boson field v have the form 


y“~p and ñ, (12.20) 


insofar as their dependence on wy and v is concerned. 
The electromagnetic interaction has a term like Yy, A 
or Yp“WpA, where A stands for electromagnetic poten- 
tial and the suffices e and P refer to (positive) electron 
and proton. But, as A is not changed by a gauge trans- 
formation with a constant phase, these quantities can 
be included in Eq. (11.20). The ‘interaction Hamil- 
tonians between nucleon fields, lepton fields, and boson 
fields are of the type 


YN “Wp Ü, 


See Secs. 10 and 11. 
One of the simplest gauge transformations that 
leave Eqs. (11.20) and (11.21) invariant is 


Vw prp Wr. (12.21) 


y>, Pe yA (12.22) 
for all the spinors. It is very tempting to try the same 
thing with respect to v, i.e., to assume vv. However, 
in this case, we have to take into consideration a term 
like y*%ppd. If te-*0, then Yy“Pp must go over to 
el yp. Furthermore, if Yy“bp—-e8byWp, then from 
a quantity like Yy”%rý. W, we conclude that Y y, 
should pass to e~*y.“y,. All this chain of transforma- 
tions can be written as follows 


ve, pe HG, 


Wye Biyy, peert p, (12.23) 


Yett BiR, peme PIR o 


where 8, y, and ô are entirely arbitrary except that they 
are real. If we put 8=0, and y=4, we come back to Eq. 
(11.22). If we put y=6=@/2, then we obtain the 
familiar gauge transformation in which all the positive 
fields are multiplied by e”, the negative fields by e", 
and the neutral fields unchanged. If we put y=é=0, 
we get a transformation de can be interpreted asa, : 
rotation by angle 8 about the third axis of the iso 
spin. This last transformation is obtained by consi 
ing Y in ¥*fWu as a spinor in the three-dimensi 
space in which 7 plays t the role Of’spin rhatrices and hy 
applying Eq. (2.8) to ú and Eq. (2.57) toy. - 

In (12.23) the wave fields are not yet se 
into “positive” waves and “negative” ‘waves, o1 
“particle” waves and,“antiparticle” waves. T 


a = 


{ 


iN 
i 
af 
ny 


Bago na 


AO te en PE TT 
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confusion in doing this separation, let us make some 
agreement about symbols: We use (+) and (—), to 
designate positive and negative charge or to designate 


~ “particle” and “antiparticle.” Thus, (+) will include 


proton, positive electron, neutron, neutrino, and posi- 
tive pion, while (—) will include negative proton, 
negative electron, antineutron, antineutrino, and 
negative pion. We shall use (0) to indicate neutral 
fermions. Furthermore, (4) and (}) will differentiate 
the heavy particle (nucleon) family from the light par- 
ticle (lepton) family. With this convention about the 
symbols, we can rewrite (12.23) as follows: 


veto, 


Paeti, Yosh seHO-A™poyh, | (12.24) 


Pa eH HPD l pop etidye, 


Now we reinterpret these relations in the quantized 
field theory by expanding the vs and the w’s in the 
absorption operators and regarding the transformations 
in (12.24) as a unitary transformation, e.g., 


Wv,W=e¥0,, etc. (12.25) 


Such a W is very easily found!® to be 

W=TI expl—i8(M,—M_)—i(y +46) (V!—N_) 
—1t(y—38) (Nox*— No") —1 (6+ 38) (N+ — N’) 
—1(5—36) (Wo4’—No_') ], 


where the V’s and the M’s stand for occupation 
number operators respectively for fermion eigenstates 
and boson eigenstates. The multiplication I should 
extend over all the possible eigenstates. 

To make the implication of Eq. (12.26) more trans- 
parent, let us break it down to three independent cases: 
(1) y=d=0, (2) B=6=0, and (3) B=y=0. In the first 
case we have 


(12.26) 


W= Il (Gib i2) AO AMEN N Nott Nom 


(12.27) 


where the N’s stand for both (4) and (J). The eigen- 
values of W in Eq. (12.27) are (e-#/?)", r=0, +1, 
Æ2, ---, and the selection rule engendered by this W 
means the conservation of 7, i.e., conservation of 


2Ms—2M_+N,—N_-—:No,+No_)=constant. 
A Uri j Bee) (12.28) 


Tn the second case we have 
-W= expl ir (N4 — NHN o —No)] (12.29) 
from-which A the conservation law of 
EWN ++Not Ne) =constant. (12.30) 


aS 


—_ 
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In the third case, we have 


W=]J] expl—15(V,'—N_!+-No,'—No)J, (12.31) 


A 
and 


D (N-N-N o! Np) = constant. (12.32) 


It is to be noted that Eqs. (12.30) and (12,337 can 
exclude the undesired transitions in nucleon-pion inter- 
action and nucleon-lepton interaction,” such as 

N+Port, N+Portet, N+Pop-+et. 
By adding the three conservation laws in Eqs. (12.28), 
(12.30), and (12.32), we get 
M,—M_+-N,—N_=constant, (12.33) 
which is nothing but the law of conservation of electric 
charge. This law is obtained by putting 6=2y=26 in 
Eq. (12.26). 3 
By adding Eqs. (12.30) and (12.32) we get 


>D TO N e NO E (12.34) 


(Lh) 


In particular, according as the quantity (12.34) is even 
or odd, we shall have also 


N= D(N,AN_+Noy+No_)=even or odd. (12.35) 


(Lh) 


This means that the parity of the total number of 
spinor particles conserves. This is the original super- 
selection rule first introduced by W.W.W.*! The gener- 
ating unitary operator W can be obtained by putting 
B=0, y= or in Eq. (12.26). Then Eq. (12.26) 
becomes A as introduced in Eq. (2.80). 

The W considered in Eq. (7.10) is obtained from Eq. 
(12.26) by putting B=0 and equating y(=6) to 2(a—8) 
of Eq. (7.10). The transformation considered in cou- 
nection with Eq. (5.12) is essentially equivalent to the 
first line of (12.23). If we allow the value of y in Eq. 
(5.16) or in Eq. (7.4) to be arbitrary, then the square 
of mirage operator becomes in general a W.!5 

Since W as seen in Eq. (12.26) commutes with all 
the known physical quantities and the transition 
matrix U, we may consider the selection rules en- 
gendered by it are actually also superselection rules. 

Furthermore W also commutes with the mirage 
operator M and satisfies V=W7=R—WR. This means 
that a state function W and its miraged state Wy, and 
its reversed state Wr all belong to the same subspace 
defined by W. The only exception is the charge-con- 
jugation operator C, which has obviously the effect on 
W of interchanging the positive particle numbers with 
the negative particle numbers (and “particle” numbers 
with “antiparticle’ numbers). Therefore, whether we 


2 C. N. Yang and J. Tiomno, Phys. Rev. 79, 495 (1950). 
3l Wick, Wightman, and Wigner, Phys. Rev. 88, 101 (1952). 
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can call the law engendered by W a superselection rule 
or just a selection rule depends on whether or not we 
should exclude C from the definition of “physically 
observable quantities.” We can of course make W 
comautable with the C by putting either B=0, y=0,7r 
and 5\Q,r, or B=n, y=-bn/2, and §=-£r/2, which 
include eight possible combinations including W=1. 

In this corinection, it should be noticed that a self- 
charge-conjugate state and an eigenstate with M.—M_ 
+N,—N_=0, Eq. (12.33), are two entirely different 
concepts; although both seem to mean “electrically 
neutral.” For example, a “superposition” of a state 
with one positive particle and a state with one negative 
particle can be a self-charge-conjugate state but can- 
not be an eigenstate of the generating operator of 
qn (12133); 

The arbitrariness of the phase factor, which we have 
exploited in the foregoing, is not the only arbitrariness 
in the matzix representations of the emission and ab- 
sorption operators. The absorption operator of the jth 
spinor eigepstate, for instance, has the expression: 


o Í g=} (1—2N,)-f;- TL lk (12.36) 
h i<j k>i 
with 
à i O 1 
a-2v3=( ) ; f=( J; 
0 —1/; 0 0/; 
1 0 
1=( ) ; (12.37) 
0 1⁄/; 
For brevity we shall write, 
gj;=A; fj, Aj;=[[(1—2N)). (12.38) 


a i<j 


If we extend z in Eq. (12.38) to all the eigenstates, A; 
becomes A in Eq. (2.80) 
Now, it is obvious that this representation has a 


` basic arbitrariness with regard to the ordering or 


labeling of the eigenfunctions. In particular, it is of 
some interêst to investigate the total inversion of the 
ordering. This amounts to putting (1—2.V,) where we 
had unity matrices 1, in Eq. (12.36) and putting unity 
matrices l; where we had (1—2N,) in Eq. (12.36). 
This can be done by simply multiplying Eq. (12.36) by 
the A in Eq. (2.80) from the left: 


Yg: Y'= Ag:= —g:;A, Yg: Y !=ğ;A= — Ağ: (12.39) 


By a simple calculation, we can discover the explicit 
expression of Y: 


y= -J10 -N+AN). (1240) 


e 


However, this unitary transformation, as such, does 
not commute with all the physical quantities. If we get 
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a unitary transformation W such that 


WYW =A, WYW- =A (12.41) 
this W will certainly commute with all the physical 
quantities. This is the type of transformation considered 
in Eq. (4.14). Equation (12.41) will mean, in terms of 
“particles” and “antiparticles,” that, ` 


WeisW=Agi, WånW =A, 


Wg Wo=gA, WgjiW= Ags oran 
This W can be given the explicit form i 
W=W=]I (1 Na+ Ap iN 4) 

Ms X(1—N_j;—A_;N_;). (12.43) 


a 


Again this W is the generator of a superselection rule, 
but the physical conclusion that can be drawn from 
Eq. (12.43) brings nothing new. If we have a state with 
N, “particles” and V_ “antiparticles,” the eigenvalue 
of W, in Eq. (12.43), for such a state is 


(—1) 04-4) 4 Wf (12.44) 


The upper sign applies when .V;—V_is 0 or 1, modulo 4, 
while the lower sign applies when V.—NV_ is 2 or 3, 
modulo 4. In any case, if (V,—V_) is a constant of 
motion, as we have stated in relation to Eq. (12.34), 
then Eq. (12.44) is automatically conserved. 

To avoid a possible misunderstanding, it should be 
emphasized that the superselection rule engendered by 
Eq. (12.26) is just an explicit formulation of the more 
or less known invariance laws underlying the present 
field theory and that it cannot claim any unconditional 
validity beyond the current field theory. However, 
when one tries to formulate a new theory, it will be a 
wise policy to be clearly aware of the existing invariance 
laws in the present theory, and to determine which 
laws would be discarded and which orfes should be 
withheld or generalized. For instance, the conservation 
law of (V.—N_) with regard to spinor particles is 
deeply rooted in the spinor field theory and will not 
be so easily abandoned. Indeed, if the physical quan- 
tities can be written bilinear in a spinor and an ad- 
joint spinor and if the decomposition 


ped K coe 
can be made, this type of conservation law is inevitable. .” 
Furthermore, the parity law of spinor particles 
(W.W.W.’s law) is more fundamental. This law is 
equivalent to the postulate that physical quantities are 
expressed in even powers in spinors. The charge con- 
servation law, on the other hand, cannot be discardéd. 
Therefore, we can believe that a superselectien rule of 


mo 
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the type in Eq. (12.26) or one similar to it will survive 
until the basic postulates of ‘the field theory are altered. 


13. ILLUSTRATIONS—DISINTEGRATION OF 
POSITRONIUM AND NEUTRAL PION 


The purpose of this section is to show by examples 
how the general, principles developed in the foregoing 
sections can be applied to concrete problems. Therefore, 
it is not intended to give any exhaustive study of the 
~ decay modes of the positronium and the neutral pion, 
but only to derive some of the simplest facts about 
these decay processes. Except for a few among them, 
the results mentioned in this section have previously 
been discovered by other authors by more or less 
similar methods.” 

Let us first determine the charge parity mc, space 
parity ms and -parity m- of positronium state V,,:,: 


CW pstr=TOV pstr} MWY pstr= TSY pstr; 


MW pstr= TF pstr, (13.1) 
where C, M, and MM, are given by Eqs. (8.21), (8.26), 
and (8.34) using the angular momentum representation. 
The lowest positronium state V,,,, may be a singlet 
state or a triplet state, i.e., we shall have an expansion 
of the type 


Wostr= >, 09+ (w,x,m,p)G_(w',«’,90',p')Wrac (13.2) 


corresponding to the resultant total angular momentum 
zero or one. 

However, for our purpose of determining the 7’s, we 
do not need to handle the general expression in Eq. 
(13.2). From Eqs. (8.21), (8.26), and (8.34), we see 
that C, M, and M- only change the charge and the 
magnetic quantum number and leave unchanged the 
other quantum numbers. Therefore, in an expression 
like Eq. (13.2), we have only to pick up, as repre- 
sentative terms, a small group of terms such that the 
effect of Ç, M, and M, is limited within this group. 
Since C and M must commute with the resultant 
angular momentum and its z-component of the com- 
pound system, the value of mc and ms determined with 
respect to a small group of representative terms must 
be the same as for the entire expression. The operation 
4M, is commutable with the total angular momentum 
but not with its z-component, therefore zz will have a 
meaning only for a state for which the z-component of 

athe total angular momentuin is zero. 

The simplest representative terms in Eq. (13.2) 
- would be those which correspond to a positron and a 
n Sra both in the same S-state: o= w, k=K=—1, 


g A e C. N. Yang, Phys. Rev. 77, 242 (1950); 
en D D. G. Ravenhall, Phys. Rev. 88, 279 (1952); 
y R. Yost, Pi Rev. 87; '871 (1952). 
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Taste VII. The charge parity mc and the space parity ms and 
the x-parity mz of the lowest positronium states. 


Waring Wtrip™) Wtrip ®) 


Vtrip-)) 
TC + — = a 
TS oe a 9 A 
Tz n + g 


pe 


the following representative terms: 
Voing= = (1/V2)[9-, il, 3, 1)9_, = il, = 1) 
> 1)9_-(, =i; D DIL 


—ğlw, =l (13.3) 


The triplet state will have the representative terms: 


Vip =9;(, —1, 4, 1g-(@, —1, 4, 1) Wve, (13.4) 
Vein = (1/V2)[9+(@, —1, 3, Dg_-(@, —1, =, 1) 

+9+(o, —1, —3, 1)g_-(, -1, 4, 1) vee, (13.5) 

To =§ (w, —1, —3, Dg_(@, —1, -3, 1) Vac (13.6) 


The superscript on Wirip indicates the z-component of 
the resultant angular momentum. 

By applying Eqs. (8.21), (8.26), and (8. 34) to Eqs. 
(13.3), (13.4), (13.5), and (13.6), we can determine the 
ms in the sense of Eq. (13.1). The results are listed in 
Table VII. However, Wtrip and Firip are not 
eigenstates of M+; they are interchanged by application 
of Mz. 

Remembering that the charge parity of a photon 
state is just the parity of the number of photons present, 
as in Eq. (6.32), we conclude that Wirip cannot decay 
into an even number of photons and Wsing cannot decay 
into an odd number of photons. 

Next we shall study two-photon states more closely 
using the representation used in Sec. 6C. If we fix the 
coordinate system to the center of mass of the decayiny 
positronium, the two photons will have equal and op- 
posite momenta, say k; and —k.. We can make four 
independent states: 


W,= (1/v2)[9(&.,1)9(—k, —1) 
AG (Re —1)9(— he, +1) vee, 
Y= (1/V2)[9(k.,1)9(— ke, — 1) 
~G (k —1)9(— ka +1) JY vas 
Ws=G (he,1)9(—Re, 1)Wvea0; 
W4=G (ka, —1)9(—k., —1)Wvoc 


(13.7) 


The states WY; and Y, do not interest us very much, 
because the 2-component of the angular momentum is 
+2, which is certainly too large for comparison with 
the lowest positronium states or with the neutral pion 
at rest. 


> 


i 
$ 
i 
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TABLE VIII. The charge parity xc, space parity xg and z-parity 


« mz of two-photon states. Here “pol” stands for planes of polariza- 


tion of two photons. 


Tagi IX. Connection between the “Kind” 
and the intrinsic parity. 


7 Scalar Vector 
Yı Ve P3 EZ Kind reg. 2nd Ist, 3rd reg., 2nd Ist, 3rd 
— — £ er 
Te + + + + Intrinsic parity ae = = A 
its =F = + T = < PRESE he 2e 
Tz ° ae — x X TT Toe in 
pol i ails x x 


The charge parity re, space parity rs, and x-parity 
Tz are determined with the help of Eqs. (6.36) and 
(6.38). The results are listed in Table VIII. 

Since rs of positronium is — 1, the only possible two- 
photon state into which it can decay is WY». Since re of 
any two-photon state is +1, it can only originate from 
YFsing. Thus, the two-photon decay of positronium must 
be 


e Wsing Wo. (13.8) 
Before the fruitful application of charge-parity was 
introduced, they excluded ¥;,;, WV, by comparing rz. 
“It is interesting to characterize Y; and Ws, in Eq. 
(13.7), by the correlation of planes of linear polarization 
of the two involved photons. In Eq. (6.41), an operator 
II_(#.) was introduced, which measures the probability 
of a photon having momentum k, and linear polarization 
in the x-direction. Then the operator, 

Mn= M, (-.)-(—&.) +I, (k) Iy (— k), (13.9) 
will measure the probability of two photons having 
momenta k, and —k. and having parallel linear 
polarization. In the same way, for the probability of 
two photons being observed as having perpendicular 
planes of lineàr polarization, we get an operator 

M= N, (k), (— k:) +1, (kU -(—k.). (13.10) 
By applying Eqs. (13.9) and (13.10) to Eq. (13.7), we 
obtain, with the help of Eq. (6.41), 


MY =Y;, 
H¥:=0, 


HY: =0, 


lasan 
MY: =Y. 
Although, Y; and Y; are not eigenfunctions of II, and 
TI,, we can make eigenfunctions by taking Y:+Y¥, and 
W;—W,. This analysis shows that if we observe the cor- 
relation of linear polarization of two decay-photons of 
the positzoniim, they will turn out to be perpendicular 
to each other. x 

It may be worth noting also that, in the angular 
momentum representation, a two-photon state can 
have odd parity only if the values of / of the two 
photons are of different parity. See Eq.. (6.24). 

Next fet us briefly review the decay of the neutral 


%3 J. A. Wheeler, Ann. N. Y. Acad. Sci. 48, 219 (1946). 
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pion into photons. We have scen in Sec. 10 that the 
charge parity and the “kind” of the neutral pion are 
the same as their source, and they are given in Table IV. 
The “intrinsic” parity of the neutral pion is the space- 
parity of the state in which a single particle of. this 
neutral field is present and at rest (k=0). This sign 
corresponds to the symbol (+) in Eq. (5.23) or (5.32), 
and the double sign (+) in Eq. (5.16), and is deter- 
mined by pw in Eq. (10.10). The connection between 
the “kind” and the intrinsic parity is given in Table IX. 
This is nothing but a rearrangement of a portion of 
Table IN, in Part I. In the case of a vector, the space- 
component determines the parity. 

According to Sec. 10, there are actually six cases of 
2) -field : 


(1) scalar having (S)-interaction, 

(2) scalar having (V)-interaction, 

(3) vector having (V)-and (7)-interaction, 

(4) pseudoscalar having (P.S)-and (PV)-interaction, 
(5) pseudovector having (PV)-interaction, 

(6) pseudovector having (PT)-interaction. 


The mixing of (1) and (2), and the mixing of (5) and 
(6) are forbidden, because of charge-symmetry and 
reversibility. Table X gives the intrinsic parity and 


TABLE X. The charge parity re and the intrinsic space-parity 
zs of a neutral pion classified according to its source. 


(1) Q) 3) a) 5) (6) 
S(S) SV) V(V,T) PS 
(PS.PV) PV(PV) PVIPT) 
- n Se 
Kind reg. 2 2 1 54 3 
TS SF E = z= 48 zig 


charge parity of these six kinds of neutral pions, as can 
be derived from Tables IV and IX. 

If experiment shows that the neutral pion decays 
into two photons, it means that r-=-+-1, and the pos- 
sible type of pion is limited to (1), (4), (5).* If further 
See ees a a 


= 2 
4 More generally, we can see from Table X that a neutral pion 
of type (1), (4) or (S) cannot decay into an odd number of photons, 
and a neutral pion of type (2), (3) or (6) cannot decay into an even 
number of photons. This law is known as Fukuda-Miyamoto’s 
rule. H. Fukuda and Y. Miyamoto, Prog. Theor. Phys. 4, 347 
(1949). K. Nishijima, Prog. Theoret. Phys. 6, 614 (1951). Since tie 
charge parity of a photon is odd, the py of the number of 
photons cannot change if photons are the only kind of pagio 
present in the initial and final states, no matter what the inter- 
mediate process may be. This js Furry’s rule. W. H. Furry, Phys. 
Rev. 51, 135 (1937). n s Poe a 3 
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° 
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experiment’ shows that the planes of polarization 
of two photons are always perpendicular, it means 


___ ms=—1, and the only possibility left is (4). 


The author would like to thank all those who dis- 
cussed with him one or the other of the subject matters 
dealt with in this work, thus indirectly helping him to 
prepare this survey article. In particular, he is grateful 
to Professor E. P. Wigner for many enlightening dis- 


_ cussions. Apologies are extended to the authors whose 


contributions to the field of problems treated in 
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VOLUME 27, NUMBER 1 


this paper have not been quoted at the appropriate 
places.® 


3 Among many others: E. P. Wigner and L. Eisenbud, Phys 
Rev. 72, 29 (1947); W. Pauli, lecture notes, Summer School of 
Theoretical Physics, Les Houches, 1952; F. Coester, Phys/ey 
84, 1259 (1951); F. Coester, Phys. Rev. 89, 619 (1953); LAMfichel, 
Nuovo Cimento, 10, 320 (1953); G. Liiders, Konge Danske 
Vidensk. Selsk. Mat. fys. Medd. 28, 1 (1954); Umezawa, Kame- 
fuchi, and Tanaka, Soryushi-Ron-Kenkyu (in Japanese) 6, 543 
1954; the same authors, Prog. Theoret. Phys. (to be printed): 
Y. Katayama, Soryushi-Ron-Kenkyu (in Japanese) 6, 543 (1954): 
E. R. Caianiello, Physica 18, 102 (1952); and other papers of 
the same author; S. Oneda, Prog. Theoret. Phys. 9, 327 (1953). 


JANUARY, 1955 


Erratum: The Energy Levels and the Structure of Light Nuclei 


a D. R. INGLIS 
Argonne National Laboratory, Lemont, Illinois 6 
[Revs. Modern Phys. 25, 390 (1953) ] 


Figure 9 is in error in having the low ?P plotted four units too 
low and is to be replaced by the accompanying figure. This removes 
what might have been considered a reason for doubting recent ex- 
perimental suggestions! that there may be excited states of Be? 
near 1.8 and 3.1 Mey, indicated by dotted lines in the inserts of the 
figure, in addition to the well-established 2.43-Mev state recently 
observed? to have J > 3 as now appears consistent with this rough 


2 © (Ls) 2.5 455 


exploratory treatment of the theory. The error was discovered 
and very kindly pointed out by Edith Halbert and Sudkir Pandya 
of the University of Rochester, who are calculating the interme- 
diate-coupling transition of the lowest states of several (J,T).- 


1 Kunz, Moak, and Good, Phys. Rev 95, 640 (1954) ; F. Ajzenberg and T. 
Lauritsen, Revs. Modern Phys. 27, 91 (1955) which quotes’ K.j W. Allen 
on B!°(t,@) Be. 

2 F. L. Ribe and J. D. Seagrave, Phys. Rev. 94, 934 (1954). 


ark 10 ë (jp 


Fic. 9. Intermediate-coupling transition for the configuration p: applyin; 
© to Py’. The isobaric spin is T =} except where indicated as T = Mi g 
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Energy Levels of Light Nuclei. V* 


F. AJZENBERG, Department of Physics, Boston University, Boston, Massachusetts 


AND 


T. LAURITSEN, Kellogg Radiation Laboratory, California Institute of Technology, Pasadena, California 


INTRODUCTION 


HE: present compilation summarizes currently 

available experimental information on the loca- 
tion and properties of the energy levels of the light 
nuclei from Heë to Ne”. In form and arrangement, it 
follows closely the plan of four previous compilations'* 
on the same subject. Each nucleus is represented by a 
diagram, on which the known energy levels are plotted, 
together with the nuclear reactions in which they are 
involved. Fhe diagrams are supported by a text in 
which an attempt has been made to synthesize the 
pertinent iaformation on each reaction as abstracted 
from published papers. Insofar as possible, the present 
@dition has been made self-sufficient, in that each dia- 
gram entry of a level position or quantum-number 
assignmert finds some defense in the cited literature. 
Because of space limitation, however, much important 
information contained in earlier versions of the sum- 
mary has been omitted here, and for a more detailed 
discussion of certain reactions and for the complete 
(post-1937) bibliography, the reader may wish to refer 
to these earlier summaries, particularly the 1952 
version. 


Arrangement of the Material 


As before the diagrams contain, in addition to a 
representation of the level structures, an indication in 
each case of the experiments through which particular 
levels have been found. Such experiments generally fall 
in one of two categories: those in which the given nuclide 
occurs as the intermediate stage in a reaction, and those 
in which it represents the final stage. Where a compound 
nucleus is formed, the excitation function may yield 
rather detailed information about the energy levels of 
the intermediate system. To illustrate the general 
nature of this information, particularly as regards the 
breadths, relative intensities of excitation, and pre- 
ferred modes of decay of the various levels, the thin- 
target excitation functions are shown alongside the level 
diagram itself, with approximate relative yields plotted 


* This work has been supported in part by the U. S. Air Force, 
through the Office of Scientific Research, and by the joint program 
of the Office of Naval Research and the U. S. Atomic Energy 


Commission. 


1W. F. Hornyak and T. Lauritsen, Revs. Modern Phys. 20, 


191 (1948). "4 
2 Ý Hee rat N. R. C. Preliminary Report No. 5 (1949). 
3 Hornyak, Lauritsen, Morrison, and Fowler, Revs. Modern 
Phys. 22, 219 (1950). 
cP eae and T. Lauritsen, Revs. Modern Phys. 24, 321 


(1952). 
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horizontally against bombarding energy in a vertical 
direction. To permit direct association with the levels, 
the vertical scale is adjusted to center-of-mass energies, 
with values of bombarding energy indicated in labora- 
tory coordinates. The cross section scales are arbitrary, 
different for various products, and in some instances 
rather strongly distorted to exhibit weak resonances. 
All definitely established resonances are indicated by 
leaders, and where space permits, with the (laboratory) 
energy corresponding to the peak cross section. Where 
no resonances appear in an excitation function, or 
where there is reason to believe that the reaction pro- 
ceeds without formation of a compound nucleus, a 
brace within the level diagram indicates what products 
are observed. 

Reactions in which the nuclide of interest occurs as 
the residual nucleus are indicated by the notation 
X-+a—b, where X, a, and b are the target nucleus, 
bombarding particle, and ejected particle respectively. 
Horizontal lines with this notation are located on the 
drawings at a height above the origin equal to the 
reaction Q value, as calculated from the mass table. 
The highest bombarding energy at which observations 
have been made is shown by a vertical arrow, roughly to 
scale, and observed groups of particles are indicated by 
slanting arrows, terminating on the levels with which 
they are presumed to be associated. Subsequent gamma 
transitions are indicated by vertical arrows. It should 
be noted that not necessarily all of the groups shown 
have been reported at the bombarding energy which 
appears on the diagram; bombarding energies used in 
particular experiments are given in the text. A few 
unreported reactions have been included in the dia- 
grams to provide a convenient reference for their 
Q values. 

On each level in the diagram appears a number which 
gives the excitation energy. Where a single level is 
reported in several experiments, a weighted mean value 
has been used for this number. It is, of course, not 
always certain that the different experiments refer to 
the same level, but we have assumed that they_do so_ 
wherever the probable errors overlap and where there 


appears to be no obvious inconsistency in such an - | 


interpretation. Levels, particle groups, or y transitions 

whose existence is uncertain are represented by dashed 

lines; uncertain quantum numbers are-enclosed in 

parentheses. In some cases, where a level is particudarly 

broad, this fact has been indicated by cross hatching. 

Among the lighter nuclei, where the level spacing is 
2 


a ma > 
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estimated half-width. 

The discussion of reactions in the text is divided in 
the same way as on the diagrams. Features of the reac- 
tions such as cross sections, excitation functions, and 
alternate modes of decay, which relate primarily to the 
compound nucleus, will be found discussed under that 
nucleus; particle-group spectra, y transitions, and 


78 
; large, the breadth of the cross hatching indicates the 
a 
| Q values are entered under the residual nucleus. Angular 


t + 

i distribution data are entered in one or the other 
category, depending upon which seems the more ap- 

i propriate to the case in hand. 

i Since the emphasis in this summary is on experi- 

| mental results, no attempt has been made to treat 

| theoretical papers in any systematic way or to main- 


tain a complete bibliography of such papers. An ex- 
tensive theoretical discussion, specifically directed to 
the light nuclei, is presented in a recent article by D. R. 
Inglis.® 


Masses and Q Values 


The mass values used in the present compilation are 
listed at the end of the article. Almost all of these have 
been taken from the tables of Li ef al. and C. W. Li,’ 
which are based on measured Q values. For nuclides 
not included in these tables, we have estimated the 
masses from the available experimental data; the de- 
tails of these estimates will be found in the text. For 
each reaction, the Q value calculated from the masses, 
designated @m, is listed with the reaction heading, as is 
the binding energy of the bombarding particle in the 
compound nucleus, Æ». Both quantities are expressed 
in Mey. Although the mass values generally are assigned 
probable errors of fifteen to thirty kev, the Q values are 
usually more accurate. Experimentally determined Q 
values are cited in the text, whether they are among 
those used in fixing the masses or not. A summary of 
Q values has been recently published by D. M. Van 
Patter and W. Whaling.* 


Conventions 


Within the limitations imposed by the requirement 
of brevity, we have endeavored to avoid the use of 
unfamiliar conventions and abbreviations. We set 
forth here some of the symbols which we have used, 
together with their definitions: 


E Energy in Mev, laboratory coordinates unless 
a © otherwise specified. Subscripts p, d, t, etc., 
refer to protons, deuterons, tritons, etc. 
Angle of observation, in laboratory coordinates 
unless otherwise specified. 


_ 52D. R Inglis, Revs. Modern Phys. 25, 390 (1953). 
£ Li, Whaling, Fowler, and Lauritsen, Phys. Rev. 83, 512 


-i, Phys. Rev. 88, 1038 (1952). 
Man Patter and W. Whaling, Revs. Modern Phys. 26, 


n 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


LAURITSEN 


Eres General: the bombarding energy at which 
resonance occurs; specific: the bombarding 
energy at which the resonant part of the phase 
shift of the partial wave in question reaches 90°. 

T Observed width of a resonance, in kev; Mbora- 
tory coordinates unless otherwise specitied, 

E, “Characteristic” energy of a level as determined 
by certain conditions on the logarithmic deriva- 
tive of the wave function at the nuclear surface9 

wl’, Statistical weight factor times partial width of 
particle x. 

n? “Partial” width, after removal of barrier pene- 
tration factors, evaluated at Fy. 

J Total angular momentum. 

T Parity. 


T,T, Isotopic (or isobaric) spin quantum numbers: 


T,=1(N—Z). 
ACKNOWLEDGMENT 


Whatever claim the present compilation,may make 
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He> 
(Fig. 1) 
Mass of He 


The most direct observations on the mass of He’ 
appear to be: 


r 


1. Het(n n)Het: Ereas=0.95 (center of mass); mass 
defect= 12.92+0.05. 

2. Li (ta)He5: Heï=Het+n+0.97+0.05; mass de- 
fect= 12.94+0.05. i 

3. Lit (d He*) He: Qo>= 0.91 0.09; mass'defect= 12.85 
+0.09. 

4. Li” (d «)He®: Qo=14.23+0.07 ; mass defect= 12.85 
0.07. 


We adopt: He? mass defect= 12.92+0.08 Mev. 


I. (a) H?(d n)Het 
(b) H2(¢ n)Het 


The yield exhibits a pronounced maximum near 
E.z=107 kev, with a cross section of 4.95-40.14 b 
(Ar 54)j [see also (Aj 52c) ]. Precise cross sections in the 
range Ez=10 to 1700 kev are tabulated by (Ar 52d), 


Om=17.577 Es=16.63 


9E. P. Wigner and L. Eisenbud, Phys. Rev. 72, 29 (1947); T. 
Teichmann and E. P. Wigner, Phys. Rey. 87, 123 (1952). 

t References are indicated by the first two letters of the first 
named author and are listed at the end of the article. 
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(Co 52f), (Ar 54), and (Ja 53d). The course of the cross- 
section, curve is adequately described by the single- 
level resonance formula, assuming s-wave deuterons 
forming an intermediate state with J=3/2*, with 
d-wave emergent neutrons. Typical parameters are: 


“‘ya@= 10 kev-cm (0.42 of the sum-rule limit), 7,2=2.8 


X10" kev-em (Co 52f, Ar 52d, and Vo 54). An analysis 
in terms of a complex scattering length has been made 
by (F151). At low energies the cross section ap- 
proaches the simple Gamow exponential form: Eo 
=A exp(—BE™) with B=44.40 kevi (Ar54), 43.8 
kev! (Ja 53d) (2me?/ho=44.40 E-3), 

The angular distribution of products is isotropic at 
resonance and below [see (Aj 52c) ]. Angular distribu- 
tions have also been studied at E,=1.5 Mev (Ar 54a), 
Ea=2.21 Mev (St52c), and 10.5 Mev (Br 51b) 
[see (Aj 52c)]. The distribution at high energy is 
dominated by the stripping process (Bu 51i). See also 
(Ja 53e) antl (Jo 54b). 


IL. H°(d py 
Not observed: (Mc 51a). 


II. H3(dd)H? E,=16.63 


Differential cross sections for Ez=0.96 to 3.22 Mev 
(@=44° to 132°, c.m.) are tabulated by (St 52e), and 
for Ha=10.2 Mev (6=29° to 140°, c.m.) by (Al 52b). 
See He?(d d)He’. 


IV. H3(¢n)He> Qn=10.37 


Two neutron groups, attributed to the ground state 
and an excited state of He® at ~2.6 Mev, are reported 
by (Le 51a). 


V. H3(He? pfHe® O,,=11.13 

Qo= 11.18-0.07, Mev [ (Al 53a) ; photoplate] 

Qo=11.13+-0.07 Mev [(Mo 53d); scint. spec- 
K trometer ] 

The spectrum shows a well-defined proton peak 
corresponding to the ground state, superposed on a 


considerable background attributed to the three-body 
breakup (Al 53a, and Mo 53d). There is no evidence in 


Taste I(5). Total neutron cross sections: Het (n »)Het. 


En (Mev) a(b) 
1.154 1.4 
2.49b 3.1640.06 
a  2.99> 2.79+0.06 
4.95¢ 2.1830.06 
710° ° 1.780=-0.05 
12.35¢ 1.1770.03 
14.14¢ 1.0053-0.03 
17.974 0.848+0.03 
19.004 0.816+0.02 
20.074 0.770+0.02 


a (Ba ) 

b rted in (Se 53). vE 

e x i Feo reper T £ E. Perry (private communication). 
d (Da 
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Fic. 1. Energy levels of He®. In these diagrams, energy values 
are plotted vertically in Mev, based on the ground state as zero. 
Uncertain levels or transitions are indicated by dashed lines; 
levels which are known to be particularly broad are cross-hatched. 
Values of total angular momentum (J), parity, and isotopic spin 
(T) which appear to be reasonably well established are indicated 
on the levels; less certain assignments are enclosed in parentheses. 
For reactions in which He® is the compound nucleus, thin-target 
excitation functions are shown schematically (where known), 
with the yield plotted horizontally and the bombarding energy 
vertically. Bombarding energies are indicated in laboratory co- 
ordinates and plotted to scale in center-of-mass coordinates, 
Excited states of the residual nuclei involved in these reactions 
have generally not been shown; where transitions to such excited 
states are known to occur, a brace has been used to suggest 
reference to another diagram. 

For reactions in which the present nucleus occurs as a residual 
product, excitation functions have not been shown; a vertical 
arrow with a number indicating some bombarding energy, usually 
the highest, at which the reaction has been studied, is used instead. 


either spectrum for the existence of an excited state of 
He*. If the transition in question occurs at all, it is 
either much less probable than that to the ground state, 
or else the level is much wider: see (Go 54c). 


VI. Het(n n)He! E,=—0.95 


The thermal cross section is 0.78 b,(Hi 51). The total 
cross section has been measured from 40 kev to 5 Mev 
by (Ba 51a) and from 2.5 Mev to 20 Mev by the Los 
Alamos group. It exhibits “a maximum of 7.4sb ata 
£,=1.15 Mev and thereafter decreases monotonically 
to 0.77 b at E,=20 Mev. Total cross sections at - - 
several energies are listed in Table I(5). 

Angular distributions have been studied in the range 
E,=0.4 to 2.7 Mev by (Ad 52), E,=3 to 4 Mev by 
(Hu 52c), at E,=2.6, 4.5, 5.5, 6.5, and 14.3 Mev by 
(Se 53) and at E,=14.1 Mev by (Sm 54). The course of 
the phase shifts is similar to that of’ Het(p p)Het 
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(see Li’) and is described by the same level parameters, 
with a shift of about 1 Mev in the characteristic ener- 
gies. There is evidence for an „appreciable negative 
D-wave phase shift; a part of this may be associated 
with the Dj level at 16.8 Mev as well as with another 
not yet located Dy level [(Se 53), (Do 52b), and 
(Al 54c)]. The *(negative) s-wave phase shift varies 
roughly as (E,)!, and can be accounted for by potential 
scattering in a well of radius 2.4X10~% cm. A fit with 
a larger radius requires the assumption of a high, 
broad level corresponding to a 2s neutron (La 54a). 
See also (Si53), (Br54a), (Hu 54a), (Ta 54a), and 
(Hi 54c). 


VII. (a) Het(d p)He® = Qn=—3.17 
(b) He'(d p)He!+2 QOm=—2.225 


Q values for reaction (a) of —2.9 and —3.2 Mev are 
reported [see (Aj 52c) ]. Q= —3.100.05 (Fr 54h). 


VIL Li8(nd)He® On=—2.43 
Qo=—2.5740.10[ (Fr 54b) ; photoplate ] 


The deuteron spectrum, observed at En=14 Mev, 
shows a well-defined group with a width of 0.8 Mev 
(c.m.), corresponding to the ground-state transition. 
Tn addition there is observed a continuum of deuterons, 
attributed to transitions to a broad excited state, 
extending to at least 4 Mev in He’. There is no evidence 
for a sharp group in this region. Angular distributions 
of both the ground-state and excited-state deuterons 
indicate a pickup process, with /,=1 (Fr 54b). 


IX. Lif He*)He® Q,=0.83 
Qo=0.91+0.09 (Le 55; mag. spectrometer). 


This reaction has been observed at E4=14.5 Mev. 
The ground-state group has a c.m. width of 0.69+0.2 
Mev [ (Le 54) and (Le 55) ]. 


X. Li8(ta)He® Qn=15.15 


From the Q value of this reaction, determined at 
E,=230 kev, it is found that He’=He'+n+0.97 
+£0.05 Mev (Craig, Cross, and Jarvis, private com- 
munication). 


XI. Li@)He® Qn=—3.41 


The angular distribution of tritons exhibits a forward 
maximum at E,=14 Mev (Fr 54b). 


XII. Li(da)He® Qn=14.16 


Qo= 14.26+0.09 (Le 55; mag. spectrometer). 
-Qo=14.2+0.1 (Cu 53a; photoplate). 


The observation that the angular correlation of 
ground-state «œ particles and those resulting from the 
He? breakup is not isotropic is consistent with a P, 
assignment. (Fr 51b). 

At £2=0.98 Mev, using, nuclear emulsions with 


partial magnetic separation, (Cn 53a) report a-particle 
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groups corresponding to the ground state, with a width 
of 0.3-40.1 Mev, and to an excited state at 2.50.2 Mey 
with a width of 1.50.3 Mev. [It does not appear ex- 
cluded that the second group may come from levels at 
7-10 Mev in Be®, produced by Li’ (d m)Be®. ] At Eg=14.5 
Mey, the ground-state group is observed to~have a 
c.m. width of 0.66+0.2 Mev (Le 55). 


Lié 
(Fig. 2) 
Mass of Li’ 


The three most direct observations on the mass of 
Li® appear to be: 


1. He3(d y)Li® Qo= 16.360.2: mass defect 
=13.17-40.2 Mev 


2. He? (He? p)Li® Qo=10.86--0.15: mass defect 


=13.18+0.15 
3. Li8(d Lit Qo=0.9+0.1: mass defect 
=12.85-+0.1 G 


The weighted mean of these three values is in goci 
agreement with an earlier adopted value of 12.99+0.3 
(Aj 52c) ; we adopt: Li® mass defect= 12.99-+0.15 Mev. 


I. He'(dy)LiF Qm= 16.54 
Qo=16.36+0.2 (Hi 54, Bl 54: scint. counter) 


Capture gamma radiation of energy E,=16.6+0.2 
Mev is observed at Ea=0.45 Mev. The experimental 
radiation width at resonance is calculated to be 11:2 ev 
consistent with the T, expected for an 1 transition. 
The excitation curve measured from Ea=0.2 to 2.85 
Mev shows a broad maximum at Ez=0.45+0.04 Mev. 
Above this maximum, nonresonant capture is indicated 
by a slow rise of the cross section. The angular distribu- 
tion is isotropic within 10 percent from 0° to 90° at 
Ea=0.58 Mev. At the resonance the cross section is 
estimated to be 0.05-£0.01 mb [ (Hi 54) and (BI 54) ]. 


Il. He?(d p)He! Qm=18.341 E,=16.54 


The cross section exhibits a broad maximum at 
E.z=400 key (Bo52a), Ea=430+30 kev (Ya 53), 
E(He’) = 640 kev (Ea=430) (Ku 53). The peak value 
is 0.69+0.03 b (Bo 52a), 0.72 b (Ku 53), 0.9040.10 
b (Ya 53), 0.94-£0.08 b (Fr 54a). Cross sections from 
188 kev to 3.5 Mev are tabulated in these references. 
From 36 to 93 kev the cross section fits the Gamow 
exponential form E,=A exp(—BE™~) with B=91.0 
kev? (Ar 54), 87.7 kev? (Ja 53d). Below about 600 kev, 
the cross-section curve is wel fitted by the one-level 
formula, assuming s-wave deuterons forming 4 J=3* 
state, emitting d-wave protons. Suitable parameters are 
ye=5.0X10-” kev-cm (0.21 of the sum-rule limit: 
Vo 54), ¥2=9.7X10-” kev-cm (Bo 52a). (BI.54) give 
¥e°=12.8X10-” kev-cm, based on a peak cross section 
of 0.9 b. 
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-5.50 
Li°+y-n 


The angular distribution of protons is isotropic at and 
below resonance [(Ja 53d), (Bo52a), (Ya 53), and 
(Ku 53) ]. Distributions at higher energies have been 
studied by (Bo 52a) and (Ya 53). At Ez=10.2 Mev, 
the stripping process dominates (Bu 51i). 


Ill. He (d d)He £,=16.54 

The elastic cross section is appreciably less than 
Rutherford scattering at 65.4° (c.m.) for Ea=380 to 
570 kev, The data are consistent with the assignment 
of 3+ for the 16.8-Mev state in Li® (Fr 54a). The varia- 
tion with angle and energy of the cross section (meas- 
ured with an accuracy of 4 to 8 percent) in the range 
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Li°+p-d 


Ea=1.0 to 3.25 Mev is quite similar to that of H?(d d)H® 
particularly at large scattering angles and at the higher 
energies (Br 54b). At Z,=10.2 Mev, the differential 
cross sections are identical for angles >45°. The a=gular, 
distribution at this energy indicates that incoming 
orbital angular momenta as high as 2 and possibly-3 : - 
may be involved (Al 52b). 
ə 

IV. He(He p)Li® Qm=11.05 wt 

Qo= 10.86=-0.15 Mev (Al 53a; photoplate). oes 

The spectrum at E(He*) =0.24 Mev shows an un- 
resolved proton group superpesed ora continuous back- 


e “o > 
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ground attributed to the three-body breakup. The width 
of the ground state appears to be larger than 1 Mev 
(Al 53a). No evidence is found at Z(He*) =0.36 Mev for 
well-defined proton groups of lower energy than the 
ground-state group (Go S4c). See also Be*. 


V. He!(p p)He'* E,=—1.80 


1. E,<3.6 Mey. Angular distributions have been 
measured at several energies: a broad maximum in the 
backward scattering appears at E,~2.2 Mev (Fr 49c). 

2. Ep=4.8, 5.1 Mev: see (Br 51h). 

3. E,=5.78 Mev. Cross sections with a precision of 
about 2 percent have been measured at 26 angles by 
(Kr 54). The phase shifts found are S=47.9°, P= 
P,=112.9°, D;=—1.3°, and D;=—0.49°, assuming 
8 an inverted doublet. 

i 4. E,=9.48 Mey. Cross sections with a precision of 
about 3 percent have been measured at 43 angles by 
(Pu 52a). The phase shifts are given in (Do 52b). At 
E,=9.73 Mev, (Co 54g) report general agreement 
with the results of (Pu 52a) at forward angles but find 
cross sections only about 75 percent as great at back 


angles. 
oy 5. Ey=17.5 Mev. Cross sections have been measured 
J at 33 angles by (Br 54). The phase shifts found are S= 


—100.7°, P;=36.4°, Py=81.2°, Dy=10.9°, D;=157.7° 
or (—22.3°), F=1.2° (K. Brockman, private com- 
munication). 

6. Ep=19.5 Mev: see (Co 531) 

7. E,=28 Mev. Cross sections at 15 angles are given 
by (Wi 54d): an upper limit of 2 mb is set for the cross 
section of the reaction He'(p py) He’. 

8. E,=31.6 Mev. Cross sections at 6 angles are re- 
ported by (Co 531). 

A phase-shift analysis, covering experimental data 
for E,<3.6 Mev, has been carried out by (Cr 49a) and 
extended to E,=5.78 and 9.48 Mev by (Do 52b). The 
ie P, phase shift rises steeply, passing through 90°at 
1 Re about E,=2.5 Mey (lab), flattening off at 110° at 5 Mev 
ip and decreasing slowly thereafter (see Kr54). The 

S, (negative) and P, phase shifts show a monotonic 
increase with energy, and there are indications of small, 
negative D phase shifts at the higher energies. The 
behavior of the P; phase shift can be well accounted 
for by the one-level Breit-Wigner formula, with 
Exoo~2.1 Mev (c.m.), y?=17.6X10-" Mey-cm (Ad 52) 
(0.75 of the sum-rule limit: Vo 54), y°=25X10-¥ 
Mey-cm (Do 52b). 
The situation of the P; “resonance” is not so clear: 
(Ad 52) suggests on the basis of the lower energy data 
=- that the two states are of about equal reduced width 
and are split by ~3 Mev (Eres, lab). (Do 52b), on the 
other hand, find a much greater width, y’~100X 10-8 
Mev-cm, for the P; level, and a considerably greater 
splitting (AZ,=7.5 Mev, c.m.). An analysis in terms 
e-body potential with spin orbit coupling ap- 


vor a Gaussian well’over square or exponential 
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Early results on «-particle scattering in hydrogen 
(Ts 40) which indicated several resonances below the 
ground state of Li are not confirmed by (Ru 54): 
E.=3.1 to 5.3 Mev. 


VI. (a) Het(p dHe QOn=—18.341 E,= “1.80 
(b) He!(p pu)He? Qn=—20.566 ~“ - 
Angular distributions are reported at E,=27.9 Mey 
(Wi54d) and 31.6 Mev (Be 52a) for reaction (a), 
For reaction (b), see (Wi 54d). 
VII. He'(p 26)H? Qn=—19.802 H,=—1.80 
See (Wi 54d). 


VIL. Li®(y ) Lib m= — 5.50 
Qo= — 5.6 Mev (Ti 53c; photoplate). 


The photoneutron threshold is 5.35+0.2 Mev 
(Sh 51d). The existence of an excited state of Li? at 
~2.5 Mev is suggested by the data of (Ti £1). 


IX. (a) Li(d )Li® Om=0.75 
(b) LiS(d p)He!-+H? Ọm=2.56 


Qo=0.9-0.1 Mev [(Fr 53b) ; mag. spectrometer ]. 

At Ea=1 Mev, a broad triton group corresponding to 
the ground-state transition is observed. The width of 
the ground state is 1.5 Mev. A continuous proton distri- 
bution, attributed to the Li® breakup, is also observed 
with a sharp upper limit at 2.80 Mev. The tritons are 
emitted preferentially in the forward direction (Fr 53b). 


X. Like a)Li® Qm= 15.06 


o=14.850.15 (E. Almqvist, private, communica- 
tion; prop. counter). 
See also (Ku 53b). 


He! 
(Fig. 3) 
I. HeS(GLIfF Qn=3.55 
E,g(max) =3.500.05 Mev (Wu 52). 


The half-life is 0.799--0.0034 sec (K1 54), 0.86-£0.03 
sec (Sh 52), 0.83-40.03 sec (Ba 53), 0.840.03 sec 
(Ve 52a): log ft=2.94 (Ki52). See also (Aj 52c); 
(Al 53b), and (Ru 53). 


Il. (a) H3(éa)2n 
(b) He (t a)n? AANA 
(c) H8(¢m)He® Q,,=10.37 


The spectrum of alpha particles emitted in reaction 
(a) has been observed at £,=220 kev. A possible group 
from reaction (b) has not been confirmed. For reaction 
(c), see He®. The cross section for neutron production 
rises monotonically from 0.1 to 2.2 Mev, reaching & 
value of about 0.18 b at 2.2 Mev. See also (Aj 52c). 


QOm=11.320 E,=12.25 
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SLi 
-2353 : 11-29-54 -355 Li+ n-p 
Lit +1-He Site 
452 4 
OOI Li” d- He? 
Li'ey-p * 
5 Fic. 3. Energy levels of He®: for notation, see Fig. 1. 


IIL. Li®(n p)He® 
See Li’. 


On=—2.77 


IV. Li’ (n d)Heé m= —7.19 
See Li! and (Fr 54b). 


V. Lii@He)He® Q,=—4.52 


He? groups to the ground state and the 1.71-Mev 
state have been observed at Ey=14.5 Mev. The He? 
angular distributions, analyzed by pickup theory, 
indicate an L=1 transfer for both groups and therefore 
even parity for the ground state and the 1.71-Mev 
state. At the maxima of the angular distributions 
(6~18°, c.m.), the cross sections are, respectively, 
8.0 mb/sterad and 2.0 mb/sterad (Le 54 and Le 55). 


VI. Lit(@a)He® Qn=9.79 


Two alpha groups are reported by (De 52), corre- 
sponding to the ground state and an excited state at 
1.710.01 Mev (see Aj52c). Further studies at 
E,=215 and 700 to 900 kev indicate a third group, 
corresponding to an excited state at 3.35-40.09 Mev 
(K. W. Allen, private communication). Angular dis- 
tributions of the @ particles to the ground state and the 
1.71-Mev level at E,= 240 kev are consistent with J=0 
and 2, respectively, for these states (Al 54d: see Be"). 


VII. Li” (y p)He® Qn=—10.01 

Analysis of tracks in Li-loaded emulsions are re- 
ported to indicate the excitation of a y-emitting level 
at 1.60.2 Mev (Ti 54a). See also (Ti 53b). 
VIII. Be?(xa)He® Q,n=—0.64 

See Be”, 
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Li’ 
(Fig. 4) 
I. (a) H?(He? d)Het "Om= 14.309 Ey= 15.786 
(b) H*(He? p)He® Om= 11.13 


(c) H?(He* p)He'+n On= 12.084 

The relative intensities (43-42, 642, 51-42) of 
reactions (a), (b), and (c), do not vary for E (He?) = 225 
to 600 kev. The deuterons are isotropic (c.m.) at £,= 360 
kev. The total cross section, reported for E (He?) = 100 
to 800 kev, varies from 0.5 mb to 0.18 b, without 
showing resonance behavior, the main variation being 
accounted for by the Coulomb barrier effect (Mo 53d). 
See also (Al 53a). 
Il. Het(d y)Li® Qn=1.477 


A 


An upper limit for capture radiation at #,= 1.055 
Mev (2.19-Mev state) is 0.1 mb (Si 54). 


II. (a) He'(d d)He! 
(b) H?(a a) H? 


The scattering cross section has been studied at 90°, 
120°, and 156° (c.m.) for Ez=1000 to 1200 kev. A 
resonance is found at E,.s= 1069 kev (lab.), Tya,=35 
kev, corresponding to the 2.189-Mev level of Li’, 
Analysis of the scattering indicates J=3+ and a reduced 
width of the order of the sum-rule limit; the most satis- 
factory fit to the data assumes a reduced width for the 
ground state of the same order [see Table I(6) ] (La 53). 
(Ga 54) confirm the resonance and the assignment. 
Further measurements at nine angles for Ez=0.3 to 
4.6 Mev show no evidence of a resonance corresponding 
to the 3.57-Mev level; this observation is consistent 
with the assignment T=1, J=O* to this state. A rise 
in the 173.6° (c.m.) cross section above 3 Mev, from 
0.1 b/sterad at 3 Mev to 0.5b/sterad at 4.6 Mev, coupled 
with a decrease by a factor of 2 in the 90° (c.m.) cross 
section, and a study of the angular distributions, is 
interpreted as indicating T=0 levels in Lif at 4.520.08 
Mev (J=2*) and ~5.4 Mey (J=1+, '=1 Mev). 
(Ga 54a and A. Galonsky, private communication.) 
The location of the J=3*, 2+, and 1+ levels is in good 
agreement with values predicted by (In 53). 

Differential cross sections have previously been 
reported for E4=0.88 to 3.51, 6.5, 7.94, and 10.3 Mev 
[see (Aj 52c)]. 


E,=1.477 


TABLE I(6). States in-Li’ from He‘(d d)Het. * 


sS 2 
Je T Lis T easi (kev) TZ (3A/2pa)a 
3, +,0 2.188=-0.002>.* 30 
2.189+.0.0015-4 @ = <5, 0.5 
2, +, 0 4.52+0.084 ~900 0.6 2 
1, +,0 4.9—5.84 > 020.6 


è Values from (Vo 54) and A. Galonsky (private communication), “ 

> The error assigned does not iaclude the 0.026-Mev uncertainty in the 
(He'-+-H?) —Lis mass difference. * p 

e 5 
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Fic. 4. Energy levels 
of Li’: for notation, 
see Fig. b 


IV. (a) Het(4 p)He® 
(b) Het(d p)He*+-2 
(c) Hed n)Li® 
Reactions (a) and (b) have been observed at 6.5, 
7.94, and 10.3 Mev. The total cross section has been 
measured at 10.3 Mev. The angular distribution of 
protons is peaked forward [see (Aj 52c) ]. Reaction (c) 
has not been reported. 


V. HELI Om=3.55 
See Hes. 


On=—3.17 Ey=1.477 
On=—2.225 
On=—4.03 


VI Lit(y m)Li® On=—5.50 


~The cross section is 0.5-+0.2 mb for E,=14.8 to 17.6 
Mev (Ti 51) 
VIL Li6(y @)He! On=—1.477 
The cross section is <Syb in the range E,=2.6 to 17 
Mev (Je 53b, Gl 52a, Ti 52, and Ti 54b). The reaction 


is forbidden by isotopic spin selection rules insofar as 
electric dipole absorption is concerned (Ge 53a). See 
also (Ga 53a). 


VII. Lif He O,,=—15.786 
This reaction has not been observed with 17.6-Mev 
gammas. The cross section is < 6+4ub (Ti 54b). 
IX. Lis(e e’)Li® 
See Li’: Li (e e’)Li7. 
X. (a) Li®(p p’)Lit* 
(b) Lif d’)Lis* 


Proton and deuteron groups are observed correspond- 
ing to the state at 2.19 Mev. At Ea=7.4 Mev (9=90°); 
no deuteron group corresponding to the presumed T=1 
state at 3.57 Mev is observed [(Br 53h) and C. P. 
Browne, private communication]. See also (Aj 52c). 
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XI. Li’ (p d)Lit Q„=—5.010 


Deuteron groups have been observed corresponding 
to the ground state, and to levels at 2.2-+0.2 and 3.7 
+0.2 Mev (Fr 52). At E,=18 Mev, angular distribu- 


. tions of the deuterons to the ground state and the 2.19- 


Mev level, analyzed by pickup theory, indicate J,=1 
and hence even parity, J S3, for both states (Re 54a). 
XII. Li(d/Li® O,,=—9.988 


cm 


The angular distributions of the tritons, analyzed by 
pickup theory, indicates l„=1, and hence even parity 
for the ground state (Ho 53c, Le 54) and for the 2.19- 
Mev state (Le 54). At E4=14.5 Mev, the cross sections 
at the maxima of the angular distributions are 32.4 
mb/sterad and 16.0 mb/sterad, respectively, for the 
ground state and the 2.19-Mev level [Levine, Bender, 
and McGruer, private communication and (Le 55) J. 


XIII. Li (He a)Li® ỌOm=13.321 


Alpha-particle groups observed at Æ(He)=700 to 
900 kev are listed in Table I1(6). It is suggested that the 
‘harrow states at 5.31, 6.63, and 8.37 Mev may have 
T=1. A broad continuum with a maximum near 
E,=6 Mev may indicate a T=0 state in this region 
(K. W. Allen and E. Almqvist, private communication). 


TABLE II(6). States in Li® from Li?(He? «)Lis.2 


Lis T (kev) Remarks 
0 
2.189 
3.56=£0.06 
4.3 40.2 weak, doubtful 
5.310.07 S100 (T=1) 
~6 A ~2000 not certain; (T=0) 
6.6340.08 S100 (T=1) 
7.40+0.10 ~1000 (T=0) 
8.370.08 . 100 not certain; (T=1) 


a K, W. Allen and E. Almqvist (private communication). 


XIV. Be(y Lit Qm=— 17.670 

Several recoil tracks, observed in Be-loaded emulsions 
irradiated with 31-Mev bremsstrahlung, are attributed 
to this reaction (P. Stoll and P. Erdös, private com- 
munication). 


XV. Be(pa)Li® Qn=2.132 
Qo=2.1260.003 [(Co 53); mag. spectrometer ]. 


In addition to the ground-state group, a group of 
a particles with Q= —0.064£0.005 Mev, corresponding 
to a level at 2.187 Mev is observed (Br 51f). Gamma- 
alpha coincidences with a short-range group correspond- 
ing to the 3.57-Mev level are observed at the 2.56-Mev 
resonance (Da 52). The fact that the ground-state and 
2.19-Meév state a particles do not exhibit resonance at 
E,=2.56 Mev is consistent with the assumption that 
the first two levels have T=0: see B? (Ma 54g). 
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The gamma-ray energy is reported as 3.57240,012 
Mev (not corrected for possible Doppler shift). The 
internal pair spectrum is consistent with a magnetic 
dipole assignment to the radiation, and hence with 
J=0, 1, 2, even parity, for the 3.57-Mev state (Ma 54b). 


XVI. BY (y a)Li® O,= —4.453 


See B”. 
Be’ 


(Not illustrated) 


Mass of Be! 


From the mass of He’ and the 3.57-Mev excited state 
of Li’, one can estimate the mass defect of Be® as 
20.8+1 Mev. 


I. (a) HE(He 2p)He! On=12.848 E,~10.8 
(b) He (He? pLi ~~ Qn=11.05 


The total cross section shows a monotonic increase 
for E(He*) = 100 to 800 kev. At E(He*) = 200 kev, it is 
at least 2.5 ub. Below E(He*)=350 kev, the cross 
section fits the simple Gamow exponential form. At 
higher energies, partial waves of l> 2 appear to be 
required (Go 54c). See also (Al 53a). 


II. A radioactive material of 0.4-sec half-life, possibly 
to be identified with Li‘ or Be’, is observed under bom- 
bardment of Li and Be with 50-Mey protons (Ty 54). 


He’ 
(Not illustrated) 


A rough extrapolation from higher A suggests that 
the first T=} state in Li’ is located at an energy of the 
order of 15 Mev above the ground state. This means 
He’—Li’~14.5 Mev; mass defect ~31.5 Mev. He? 
would then be unstable (by ~4 Mev) to decay into 
He’+n and (by ~3 Mev) into He+3n. See also 
(Pe 53c). 

No evidence is found for the Li? (n p)He? reaction 
(cross section<5 mb) for —1.0>Q>—7.0 (Fr 54b). 
From the mass defect suggested above, the Q for this 
reaction would be ~—14 Mev. Two other reactions, 
with very negative Q values, which lead to He? are 
Li’ (¢ He*)He? and Be®(n He*)He’. Neither reaction has 
been reported. 


Li? 
(Fig. 5) 
I. Lif(a m)Lis E,=7.245 el = 


The total neutron cross section has been measured for 
E,=0.035 to 4.2 Mev by (Jo 54f). A resonance is 
observed at E,=255+-10 kev with a maximum cross 
section of 10.3 b and-an observed width of 100 kev. 
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Li?+ x-x’ 


Fic. 5. Energy levels of Li”: for notation, see Fig. 1. 


The resonance is attributed to p-neutron formation of 
a state in Li’ at 7.46 Mev with J=5/2- and re- 
duced width 7,?=0.42 of the sum-rule limit [see also 
Li°(7 a)H*]. The cross section rises gradually from 1.3 
to 2.1 b for Z,=1.5 to 4.2 Mev (Jo 54f). 

The average total cross section shows a monotonic 
decrease from 1.9 to 1.7 b for E,=6 to 9.7 Mev 
(Ne 54c). At E,=14 Mev, the total cross section is 
1.39+£0.05 b (Co 52h). See also (Hu 53c). 

The large coherent thermal neutron scattering length 
suggests the existence of bound s-levels, possibly asso- 
ciated with the structure at 6.6 Mev (La 54a). 


I. Lif(z p)He® Qm=— 2.77 E,=7.245 

At E,=14 Mev, the cross section is 6.7--0.8 mb 
(Ba 53), 6+2 mb (Fr 54b). The angular distribution of 
protons exhibits a minimum near 0= 70° (Fr 54b). 


MI. Lif d)He® Qm=— 2.43 E,=7.245 


At En=14 Mev, the cross section for long-range 
deuterons (to the ground state of He®) is 89-+-10 mb. 
The cross section for short-range deuterons (continuum, 
Q=—4.3 to —6.7 Mev) is 779 mb. The angular dis- 


_tribut‘on indicates a pickup process (Fr 54b). 


IV. Li8(2a)H? Qn=4.780 E,=7.245 


Lit (n )Het 


-The isotopic thermal capture cross section is 930 b; 
upto £,~5¢ ev, the cross section varies as 1/v (Hu 52e). 


_ A strong resonance is observed at Z,=0.25 Mev with a 


distribution establish that the 7.46-Mev state 


Peek 


— a 


AND T. 


LAURITSEN 


is formed by p-wave neutrons and has J=5/2- [see 
(Aj 52c), (Jo 54f), and (So 53)]. Partial widths at 
resonance are I',=114, Ta=60 kev (c.m.); E,=7,79 
Mev (Jo 54f). See also (Co 53j) and (Sz 53). 

The cross section has been measured for E,<0.6 Mev 
by Blair and Holland [see (Hu 52e) ] and for Fin=0.88 
to 14.2 Mev by (Ri 52a) and (Ri 53). The cross section 
decreases monotonically above the resonance except 
for a broad hump at En~2 Mev. The cross section is 
32060 mb at Z,=1.5 Mev, 270440 mb at 2.0 Mey 
(We 54), 18826 at 2.49 Mev, and 264 mb at 14.2 
Mev (Ri52a). Angular distributions at E,=0.2, 0.27, 
0.4, and 0.6 Mev, reported by (Da 53b), are consistent 
with dominant s- and p-waves in this region. Angular 
distributions at E,=1.1, 1.5, 2.0, and 2.5 Mev are re- 
ported by (We 54) and at 14.2 Mev by (Fr 54b). See 
also (Ne 54a), (Si 53), and (We 53). 


VI. Lif(d p)Li’ Q,,=5.020 


Qo=5.019+0.007 [ (St 51) ; mag. spectrometer’. 
Qo=5.028+-0.003 [ (Co 53); mag. spectrometer]. 


The energy of the first excited state is given as 
4836 kev (Bu 48c), 475+3 kev (Co 53) from meas- 
urements of the proton groups, and 477.42 kev from 
the y-ray energy (Th 52, corrected for Doppler shift 
and broadening). At Ea~1.0 Mev, a level has been 
observed at 4.61+0.02 Mev (Ge 52). 

Angular distributions, at Ea=8 Mev (Ho 53c) and 
Ea=14.5 Mev [(Le 54) (Le 55), and Levine, Bender, 
and McGruer, private communication], of the protons 
to the ground state and to the 0.48-Meyv level, analyzed 
by stripping theory, show odd parity, J <5/2 for both 
states. The neutron capture probability is approxi- 
mately the same for both states, indicating that their 
reduced widths are about the same (Ho 53c). At 
Ea=14.5 Mev, the cross sections at the maxima of the 
angular distributions (~18°, c.m.) are 12.8 mb/sterad 
and 9.2 mb/sterad, respectively, for the ground state and 
the 0.48-Mev level (Le 55). At 22° (c.m.), the cross 
section for the formation of the 4.6-Mev state is 6 
mb/sterad. A state at 5.5 Mev has not been observed: 
at 22°, ¢<0.5 mb/sterad (Le 55). Angular distributions 
have also been measured at lower energies [see (Aj 52c) J. 
The angular correlation between protons to the 477-kev 
state and the y rays from its subsequent decay has been 
shown to be isotropic in the range Hz=0.4 to 1.0 Mev 
[(Sa 53a), (Th 53), and discussion in (Aj 52c)]. This 
observation strongly indicates J=4 for Li7*. A pre- 
liminary value for the half-life is 1.0X10-® sec (S. 
Devons, private communication). 


VI. Lista) Li? Q,,=0.988 
Qo=0.982+-0.007 [ (Pe 52); mag. analysis]. 


At E,=240 kev, the reaction has been observed to 
the ground state and to the 477-kev level (Pe 52). 
See also (Cu 53c). 
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Be VIL. Li? (y 2) Lis Qm= — 7.245 


The total cross section for neutron production (in- 
cluding the y, np and y, 2n processes) has a broad 
maximum (¢=4 mb) at E,~18 Mev. For the (y n) 

-, Process alone, the peak cross section is estimated as 

~2.7 tab at E,~14 Mey (Go 54b). Discontinuities 

are reported in the integral yield curve at E,=9.6, 10.8, 

s 12.4, 14.0, and 17.5 Mey (Go 54f). The sum of the 

integrated cross sections-of the resonances is 3 Mev-mb 

(Go 54{) while the total integrated cross section to 

E,=17.5 Mev is 20 Mev-mb (Mo 53c). It is suggested 

that the difference is due to a number of weak unre- 

solved levels or to a continuum of photon absorp- 
tion (Go 54f). See also (Pe 53c). 


VII. Li’(y p)He® Qm=— 10.013 


Ti According to (Ti 53b) and (Ti 54a), the cross section 
y in the range E,=12 to 20 Mev exhibits a single broad 
maximum at 15 Mev with a width of ~4 Mev and a 
peak crogs section=2.2-40.25 mb (see also Go 54b). 
Re-evaluated results of (Tu 53) [see (Ti54a)] are 
~not in essential disagreement with the above results. 
(Ru 54a) find two asymmetric maxima with peak cross 
sections of 8 mb in the range E,=10 to 30 Mev. See 

also (Tu 54). 


IX. Li(y)He! On=—2.465 


The excitation function, obtained from analysis of 
tracks in Li-loaded emulsions, appears to exhibit 
peaks at EZ,=4.7, 5.5, 6.8 [(St 53a), (St 54h), and 
(Er 54)], 9.3, 16.7(?), 21.5(?), and 23.5(?) Mev 

; (Ti 53b). Angular distribution studies suggest J= 5/2- 
EY or 3/2- for the 4.7-Mev level and J < 5/2 for the 5.5- and 
3 6.8-Mev leyels [ (Er 54) and (St 54h) ]. Absolute cross 
sections are 26.548 ub at £,=6.1 Mev (Na 52b), 
155 ub at E,=6 to 7 Mev (Ti 53), and 0.47-0.08 mb 

at 17.6 Mev (Ti 53). 


X. Li (e e’)Li™* 
Oi At E.=190 Mev, there is observed a broad electron 
a group believed to be due to a combination of elastic 
scattering from Li® and Li’ and inelastic scattering from 
the 0.48-Mev state, and two resolved inelastic groups 
from the 4.6 and 6.6-Mev states of Li? (G. Hutchinson 
and R. Hofstadter, private communication). 


XI. Lil (n n) Li" =>H+Het On=—2.465 

Analysis of 167 (He‘+-H®) tracks observed in Li- 
loaded photoplates irradiated with 14-Mev neutrons 
yield evidence for Li’ states at 4.6, 7.5, and 9.25(?) Mev 


(Al 54b). See also (Fr 54b). 


f XII. Lit (p p’)Li™ ‘ “A 

jasti oton groups corresponding to the states 
i at hae one m 6.56 Mev have been observed at bom- 
i barding energies up to 18.3 Mev eee AE 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


E,,= 96 Mev, proton groups corresponding to the ground 
state and the 0.48-Mev state (unresolved) and to the 
4.6-Mev level have been detected (K. Strauch and 
W. F. Titus, private communication). 


XI. Li’ (d a’) Lit* 


Inelastic deuteron groups have been observed to the 
states at 0.48 and 4.6 Mev [see (Aj 52c)]. At Ba= 14.5 
Mev, the angular distribution of the deuterons corre- 
sponding to the 0.48-Mev state is peaked at ~35°, 
c.m., dc/d2=7.7 mb/sterad, while the differential cross 
section of the deuterons from the 4.6-Mev state in- 
creases with angle in the interval 17° to 90°, c.m. At 
50°, c.m., do/d2=8.8 mb/sterad (Le 55). See also 
(Le 54). 


XIV. Li (e a’)Li™* 

Emission of 480-kev y radiation is observed in the 
range Ea=1.2 to 3.5 Mev [(Li54), (Te54), and 
(He 54b) J: see B". From the observed Doppler shift, 
it is concluded that the lifetime of the 0.48-Mev ex- 
cited state is <3 10- sec (Li 54). Using Poa particles, 
(Zh 54) finds E,=478+2 kev, and observes that the 
Doppler shift in Li? metal is very nearly the maximum 
possible. It is concluded that the upper limit for the 
lifetime of the first excited state is 1.3X10-% sec 
[see B°(n a)Li7]. At Ea=1.9 Mev, the angular dis- 
tribution of the radiation is isotropic within 10 percent 
(Li 54). 

At E,=31 Mev, a group with O=—4.8+0.2 Mey is 
observed, in addition to a continuum corresponding to 
Li’ («œ 2a) H? (Go 51a). (It is possible that the 6.6-Mev 
state is responsible for part of this continuum.) 


XV. Be7(e)LI7E 0,=0.863 

The decay proceeds to the ground- and 0.48-Mev 
states. The fraction to the excited state is given as 
0.1070.02 by (Wi 49b) and 0.118+0.02 by (Tu 49a) 
and (Tu 49c). The latter value has been corrected to 
0.123+0.006 by (Di 51). 

The y-ray energy is given (in kev) as 474-44 (Za 48), 
48545 (Ku 48), 476.7--0.8 (Ho49a, and Ra 49a), 
478.5+0.5 (Te49b), and 477.340.4 (P. Marmier, 
high resolution magnetic spectrometer; private com- 
munication). 

The half-life is 52.933-0.22 days (Se 49), 53.61--0.17 
days (Kr 53c). Log ft=3.33 for the ground-state transi- 
tion and 3.48 for the excited state. Both transitions are 
super-allowed (Ma 54h). 


XVI. Be’(n p)Li? On=1.645 
See Be’. 


XVII. Bey d)Li? On=—16.682 


Several tracks observed in Be-loaded photopiates 
exposed to 31 Mev bremsstrahlung are attributed to 


$ The symbol (¢) indicates orbital electron capture. 
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this reaction (P. Stoll and P. Erdös, private com- 
munication). 


XVIII. (a) Be(da)Li? Qm= 7.152 
(b) Bed 2a)H > Q,,=4.687 

The weighted mean of published Q-value determina- 
tions is 7.153-40.002 Mev (Va 54): this includes a recent 
value of 7.153+-0.004 Mev (Co 53; mag. spectrometer). 

The energy of the first excited state is given as 48243 
kev (Bu 48c), 478-44 kev (Co 53) from measurements 
of the a-particle groups. At Ea=0.47 to 1.0 Mev, 
three a-particle groups are observed, corresponding to 
the ground state, the 0.48-Mev state, and one at 4.62 
+£0.02 Mev (Ge53). A continuous distribution of 
a particles and iritons is also observed, which corre- 
sponds to the breakup of the 4.6-Mev Li? state into 
Heit H? and to the reaction Be®(d /)Be8* [(Ge 53) and 
(Cu 52) J. The fraction of alphas leading to the 0.48-Mev 
state varies from ~60 percent at low bombarding 
energies to ~40 percent at 1 Mev (Ge 53). 

At Ez=0.3 to 0.7 Mev, the combined ay and a; groups, 
corresponding to the ground- and 0.48-Mev states, are 
approximately isotropic (Re 51a). The (ary) angular 
correlation has been observed for E;=0.40 Mev (Ue 53) 
and Ey=0.84 Mev (Co 54e). There is no significant 
departure from isotropy, indicating J=4 for the 0.48- 
Mev state. 

At E£a=5.3 Mev, the second level is reported at 
4.59+0.1 Mev (As 52). At Ez=14 Mey, levels are 
reported at 4.76+0.15 and 7.540.17(?) Mev (Go 51a). 
See also (Ta 53b). 


XIX. (a) BY(na)Li? Q,,=2.792 
(b) BY°(7 2a)H? Q,,=0.328 

(a) Two groups of a particles are observed corre- 
sponding to the ground state and the 0.48-Mev state. 
With thermal neutrons, the fraction of transitions 
leading to the ground state is about 6 percent: see B1, 
The ionization and range of a particles from this reac- 
tion have been extensively investigated. See (Aj 52c) 
and (Ha 52i). 

The y-ray energy is 478.5+1.5 kev: study of the 
Doppler broadening in various materials yields a life- 
time of 0.75+0.25X10-* sec (E149). The angular 
correlation of y rays and a particles is isotropic, con- 
sistent ith the assignment 7=3 to the excited state 
(Ro 50). 

- (b) This reaction has been observed with E,=12 to 
20 Mev (Fr 53d). See also (Br 52d), (Co 53j), (Da 53c), 
(Je-54), (Sz 53), and (De 54). 


S 
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XX. BU (ya)Li Qn= 
See B”. - 


— 8.667 


ia 


7 CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


AJZENBERG AND T.. LAURITSEN 


Be’ 
(Fig. 6) . S 
I. Be7(e)Li? Qm=0.863 


The decay is complex; see Li’. 
TI. Het(a n)Be” Qn =—18.983 Í 

A search for this reaction at E,=39 Mev, 0.5 Mev 
(c.m.) above threshold, led to anegative result; the cross 
section is <0.7 mb. This result is consistent with the 
assumption that Be? has negative parity (Wa 52c). 
II. Li(p y)Be’ (Q,,=5.600 


c 


The gamma-ray decay is complex, involving the . 
0.43-Mev state and, possibly, other states as well. f 
Resonances for production of 0.42-Mev radiation occur `i 
at Ep=1.03 Mev (weak) and 1.7 Mev (strong: T~50 
kev). At £,=0.4 Mev, the cross section is ~1 ub 
[ (Er 54a) and private communication]. An upper limit 
for the capture cross section at Ly=1.82 Mev is 3 ub 
(J. M. Freeman, private communication). See also 
(Ba 54h) and (Ba 54i). = 
IV. (a) Li'(pa)He? O,,=4.016 E,=5.600 

(b) Lis(p p)Lié 

The weighted mean of Q values for reaction (a) re- 
ported in (Va 54) is 4.0230.002 Mev. 

The cross section for reaction (a) follows the Gamow 
function from E,=30 to 250 kev (Sa 53b). The yield of 
He? particles exhibits a broad, low maximum at 
E,=0.6 to 0.9 Mev and a pronounced resonance at 


8.83. ' 
Li’+d a 


=1645 | 
Li’+p-n J 


a 
Fic. 6. Energy levels of Be? : for notation, see Fig. 1. The super- 
allowed character of the decay of Be? to the J=} state of Li? 
indicates J <#- for the ground state. 
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E,= 1,820.08 Mev, where the differential cross 
section at 164° is ~8 mb/ster. In the range E,~=1.3 to 
3.1 Mev, the elastically scattered protons exhibit a 
single maximum, o=120 mb/sterad at 9=164°, at 


. £y=1.75+0.1 Mev with a width '=0.5 Mev (Ba 516). 


) 


The iow energy resonance is attributed to s-wave 
protons, forming a state in Be? with J=% or 4, even. 
This state corresponds both in location and in character 
to the (6.6-Mev) level*required to account for the 
strong 1/2 s-wave neutron absorption of Li®, The 1.8- 
Mev resonance is formed either by s or p waves. The 
parameters y? and Ey agree with those for the corre- 
sponding level in Li’ at 7.47 Mev only if ¿= 1 is assumed, 
and therefore J <5/2, odd (Ba 51e). 

It appears that the He? particles are peaked in the 
forward direction. For E,<0.9 Mev, the asymmetry 
increases with bombarding energy. This observation is 
consistent. with the interference expected from two 
states of opposite parity (Ba 51e). At E,=0.2 Mev, 
the alpha_particles are isotropic (Ne 37a). 


V. (a) Li8(d n)Be? On=3.375 
(b) Li8(d n)He+He! On=1.792 


At £4a=3.5 Mev, the angular distributions of the 
neutron groups, analyzed by stripping theory, indicate 
odd parity and J<5/2 for the ground and 0.43-Mev 
states. Reaction (b) is also observed (Aj 52). 

The y-ray energy is 428.9-+2 kev (corrected for 
Doppler shift and broadening). The difference between 
the Li’* and Be7* energies is 48.5-41.0 kev (Th 52). 
The y rays are emitted isotropically with respect to the 
neutrons at Eg=0.60 Mev. Since the neutrons are not 
isotropic with respect to the deuteron beam, it is con- 
cluded that, the 430-kev state of Be? has J=} [(Th 51f) 
and (Th 53) ]. The y rays are isotropic with respect to 
the deuteron beam at Ey=0.7 Mev (C152). A prelim- 
inary value of the half-life of the 0.43-Mev state is 
1.4X10-® sec (S. Devons, private communication). 


VI. Lit(p n)Be? Qn=—1.645 

A recent threshold measurement, based on the value 
of 411.770 kev for the Hg” y ray [crystal spectrometer 
measurement: (Mu 52b) ] is Æp= 1.8814+0.0011 Mev; 
when based on the value 1.3325 for one of the Ni® 
y rays [measured in terms of the proton magnetic 
moment: (Li53d)], the threshold is Æ,= 1.8797 
+-0.0011 Mev. The former value is in better agreement 
with earlier accepted values (Jo 54d). See also (Aj 52c). 

At E,>2.4 Mev, two neutron groups are observed, 
corresponding to the ground and the first excited state 
of Be’. The weighted mean value of five independent 
observations for the energy of the excited state is 43145 
key [see Table II(7) in (Aj 52c)]. A neutron thresh- 
old determination gives 430410 kev (Cook, Marion, 
and Bonner, private communication). At E,= 18.3 Mev, 
Be? levels are observed at 4.60.2 and 7.10.2 Mev 


(Th 52d). 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


LIGHT NUCLET Bet Bit 89 
VIL Be BI Om=—12.071 


See (Co 54b) and B”. 


VIII. BY Cp a) Be? 

The weighted mean of four ground-state O determina- 
tions is 1.148-£0.002 Mev (Va 54) ; this includes a recent 
value by (Cr 52c) of 1.147-0.0025 Mev. The weighted 
mean of five determinations of the energy of the first 
excited state gives 430.34:2 kev [see (Aj 52c) and 
(Cr 52c) |. A recent determination of the energy of the 
y ray from the first excited state gives 432+3 kev 
[ (Da 54); see also C!J. 

Work at E,=17.9 Mev indicates a-particle groups 
corresponding to levels at 0.4, 4.71+0.1, 6.29+0.1, 
and 7.08-+0.1 Mev. The alpha groups corresponding to 
the two highest levels are strongly peaked in the back- 
ward direction (J. B. Reynolds, private communication). 


Om =4.147 


Li’ 
(Fig. 7) 
I. Lit(@-)Be? On= 15.987 


The half-life is given as 0.825+0.02 sec by (Ra 51), 
0.89+0.01 sec by (Bu 53a), 0.875-£0.02 sec by (Br 53d), 
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Fic. 7. Energy levels of Li: for notation, see Fig. 1. 
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0.850.016 sec by (Sh 52), 0.84-£0.04 sec by (Wi 54f), 
and 0.841=-.0.004 sec by (K1 54). The decay is complex; 
see Bes. See also (Aj 52c). 


IT. List p)Li8 O,=0.800 
Qo=0.784+-0.015 Mev (Pe 52; mag. analysis). 


This reaction has been observed at E,=0.24 Mev 
(Pe 52) and 0.35 Mev (Mo 52). 


TIL. Li (n y)Liè Qm=2.037 


The thermal capture cross section is 33-5 mb 
(Hu 47a). See also (Th 51g) and (Wi 534). 


IV. Li (nn)Li? E,=2.037 


The coherent scattering amplitude (thermal neutrons) 
is negative: ceoh=^.804+0.05 bn, ototai=1.5+0.4 b 
(bound atoms, epithermal neutrons) [(Sh 51b) and 
(Hu 52e) ]. The cross section has a sharp peak, e~11.5 
b, at En=256 kev (Hu 54a). It rises monotonically 
from 1.2 b at E,=0.6 Mev to 2.0 b at E,=3.4 Mev 
(Bo Sic) and to 2.5 b at E,=4.2 Mev [see (Jo 54f)]. 
The average total cross section of elemental Li decreases 
monotonically from 2.1 to 1.6 b from E,=6 to 9.7 Mev 
(Ne 54c). The cross section of elemental Li has been 
measured at eight energies between E,=14.1 and 18.0 
Mev (Co 54f). At 14.1 Mev, the total cross section of 
Li’ is 1.453-0.03 b (Co 52h). 

Parameters for the resonance are: 


Eres= 260, Gmax—Opor= 7 b, T=45 kev (Ad 50a) 
Eres= 256, max—Fpor= 10 b, P=40 kev (St 51£) 


le rena ee pe lhe tes! ee 


The resonance is attributed to a state with J=3+, 
formed by p-wave neutrons (St 51f). The reduced width 
Yn is 0.15X3h?/2ua? (Vo 54). The angular distribution 
at and above resonance shows some backward scattering 
(H. B. Willard, private communication; see also Wi 54 
and La 54a). 


‘or 


V. Li (n n!)Li™* 


Pe at Os gat ya Si a ę Pa 


. The excitation function for 0.48-Mev y rays shows an 
abrupt rise from threshold and a broad ([~1 Mev) 
_ maximum at Z,=1.35 Mev. Near threshold, the be- 
havior of the curve indicates s-wave formation and 
s-wave emission. The resonance can be fitted on the 
sumption of incident s and d waves, J=1-, or incident 
$ waves, J=1*, superposed on a strong s-wave back- 
jund: ‘the latter assignment gives the better fit 


n d)He® Qn=—7.19 E,=2.037 

ev, the cross section is 9.8+-1.1 mb 
(Fr 54b). 
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VIII. (a) Li? (n t)He® On=—3.41 Hy=2.037 
(b) Li?(x)He!+2 Q,=—2.465 3 


The cross section for reaction (a) is 55+8 mb at 
En„=14 Mev. The angular distribution of tritons ex- 
hibits a forward peak (Fr 54b). The cross section for~ 
production of tritium [reactions (a) and (b)] is 7218 
mb for pile neutrons with an estimated effective nergy 
of 3 Mev, and 30+20 mb for Po— Be neutrons (Ma 54f). 
See also (Ba 54). ; 


IX. Li? p)Li§ Q,,=—0.188 


At Ea=8 Mev, the angular distribution of the long- 
range protons, analyzed by stripping theory, indicates 
even parity for the ground state of Li8(Ho 53c). At 
Ea=14.5 Mev, three proton groups are observed, 
corresponding to the ground state and to levels at 
0.974+0.015 Mev and 2.28 Mev. The angular distribu- 
tions of the protons, analyzed by the stripping theory, 
indicate 7,=1 and therefore even parity, J <3, for the 
ground state and the 0.97-Mev level. The cross sections 
at the maxima of the distributions are 26.1 mb/sterad— 
and 10.2 mb/sterad, respectively, for the formation of 
these states (0~10°, c.m.) [(Le 54) and (Le 55) ]. 


X. Be(y p)Lis Qm=— 16.871 
See Be’. 


XI. Be®(d He®)Li® Qn=—11.377 


This reaction has been observed at Ez=20 Mev by 
(Wi 54f). 


XI. BU a)Li® On=—6.630 
See B”, 
Be’ 
(Fig. 8) 
I. Be’>2He! (Q,,=0.096 


A Q of 94.5+1.4 kev is quoted by (Jo 53a). The 
weighted mean of this and previous measurements is 
941.3 kev (see Va 54). The half-life is <5X 10-4 sec 
[(Mi 53c); see C#(y a)Be®], <210- sec (Ho 52d). 
(Tr 54e; see B'(p a)Be’) finds Q=90+5 kev and a 
half-life <4 10 sec. See also (Al 53) and (Ed 52b). 


Il. Het(@a)He! E,=— 0.096 


Measurements of absolute differential cross sections 
have been made at 18 angles, with an accuracy of 5 to 
10 percent, for 10 energies between E.=12.88 and 
21.62 Mev. Phase shift analysis indicates states of Be® 
at 7.55+0.08 Mev (T=1.2+0.4 Mev, J=0+) and 
10.80.4 Mev (T=1.2+0.4 Mev, J =4+). The d-wave 
phase shift shows only a slow increase in the range 6 
to 12 Mev (c.m.) (St 53). Absolute differential cross 
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22.261 Lif+d 
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Fic. 8. Energy levels of Be*: for notation, see Fig. 1. 


i e also been measured with an accuracy of exhibits a strong angular dependence. The differential 
es eens at 33 angles, for Ea=22.4 and 22.8 Mev, cross section is 28-- 14 mb/sterad at 83° (c.m.) [(Gr 51) 
at 11 angles fon E,=21.6 Mev [(Ke 53), (Br 53a), and and (Gr 52) ]. See ¿lso (Ed 52b), Ola, 51a), and 
(St 53) ].. The scattering of 30-Mev a particles on He (Te 53). ~ 
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II. Lit y)Be> 0,,=22.261 
Not observed: see (Sa 53b) and (Si 54). 


IV. (a) Lis n)Be? Qn=3.374 
(b) Lid 2)He'+He® Q,,=1.791 
Resonances are observed at Eg=0.41 Mev (wide) 
and 2.12 Mev (sharp) (Ba 52). In the range E4a= 60 to 
450 kev, the cross section for Be? production follows 
the penetration function accurately up to Ea~200 kev 
and drops below it thereafter. The branching ratio to 
the 0.43-Mev state of Be’ remains substantially con- 
stant in this range (Hi 54b). 


Ey= 22.261 


V. (a) Li8(d p)Li? Om=5.020 E,=22.261 
(b) Lif@ p)He'+H? O,,=2.556 

Cross sections have been measured for Ez=30 to 
250 kev by (Sa 53b), for Ea<1.8 Mev by (Wh 50d) and 
for Ea=1.0 to 3.0 Mev by (Ni 54). In the low-energy 
range, the cross section follows the simple Gamow 
function, reaching a value of ~10 mb at Ea=200 kev 
(Sa 53b). The excitation function for the ground-state 
group shows a broad maximum near Ez=1.0 Mev and 
falls slowly to 3 Mev. The yield of short-range protons 
continues to rise slowly above Ea= 1.0 Mev [ (Wh 50d) 
and (Ni 54) ]. The maximum is interpreted by (Wh 50d) 
as indicating a level at Ez=0.4 Mev, T~0.5 Mev. 
The angular distributions at Ez=1.0, 2.0, and 3.0 Mev 
indicate stripping effects, with 7,=1 (Ni54). See also 
(Th 52) and (Sa 53a). 


VI. Lif(¢ d)Lié Ey= 22.261 
See Lis. 
VII. Lit(d ALi O,=0.75 E,=22.261 


In the range E4=0.4 to 4.2 Mev, the cross section 
increases from 50 to 300 mb (Ma 54j). See Li®. 


VIII. Lit(d He?)He® Q,,=0.83 
See He’. 


E,=22.261 


IX. Li§(¢ «)Het QOm=22.357 E,=22.261 


Qo= 22.375+0.014 (Ph 53; mag. spectrometer) 

Qo= 22.3962-0.042 (Co 53; mag. spectrometer) 
Cross sections have been measured for Ez=30 to 
250 kev by (Sa 53b), for H2=60 to 450 kev by (Hi 54b), 
and for Ea=0.2 to 1.6 Mev by (Wh 50d). Below 200 
kev, the curve follows the simple Gamow. function; 


“¢&8 mb at 200 kev [(Sa53b) and (Hi54b)]. At 


Ez=0.6 Mev, the cross s2ction exhibits a broad maxi- 

mum; correction for variation of the penetration factor 

yields a resonance at E4=347 kev, with '~520 kev 

[.(Wh'50d) ; see, however, (Hi 54b) ]. Angular distribu- 

tion data suggest a J=2* statu in this neighborhood. 
(Aj 52e). ` ; 
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X. Li8(¢2)He'+He! Qn,= 16.100 
See (Cu 52a). 
XI. Lis(He? p)BeS On = 16.768 


Qo= 16.60 (Ku 53b; scin. spectrometer) 


At E(He*)=720 kev, proton groups are-observed to 
the ground state of Be® and to an excited state at 2.87 
+£0.25 Mev. There is no evidence of any other states in 
Be! below £,~11 Mev (Ku 53b). This result- is con- 
firmed by (E. Almqvist, private communication) at 
E(He’) = 900 kev. 


XII. Li?(p y)Be’ Qn=17.242 


The cross section has been studied for E,»=30 to 250 
kev by (Sa 53b) ; it follows the Gamow function for this 
energy region. Resonances are observed at E,=441.1 
=£0.5 kev (Bo 48c, Fo 49b, and Kr 54a), 1.03 Mev 
(Kr 54a and Pr 54b) and 2.0 Mev (Pr 54b), 2.1 Mev 
(C. E. Mandeville, private communication). Resonance 
parameters are given in Table I(8). No resonances 
greater than 3 percent of the 440-kev resonance are 
observed from £,=1.8 to 5.2 Mev (target thickness 
18 kev) above the slowly rising background (Ba 52a). 

The course of nonresonant radiation can be accounted 
for by the assumption of a direct capture mechanism, 
not involving compound nucleus formation. At the 
higher energies, >1 Mev, it is assumed that a process 
inverse to the ‘‘giant-resonance,” familiar from electric- 
dipole induced (y n) and (y p) processes, is involved. 
A reasonable fit to the experimental curves is obtained 
with a radius of 3.2X10-* cm and a giant resonance 
located at 22 Mev with [~2.3 Mev. Except for inter- 
ference from the narrow resonances, the background 
radiation is expected to be isotropic and to proceed 
mainly to the 2.9-Mev Be! state (Wi 54a). 

The y radiation comprises two main components: 
E,=17.2+0.2 Mev (sharp) and £,=14.4+0.3 Mev 
(broad) with relative intensities which vary with bom- 
barding energy. Both components exhibit resonance 
at 0.44 Mev (Wa 48). At the 441-kev resonance, the 
14.8- and 17.6-Mev y rays are individually isotropic, 
within ~6 percent (De 50a, St 51c, and Na 52). This 
observation is not inconsistent with p-wave formation 
of a J=1* state [(De 50a) and (Ch 53d); see also 


TABLE I(8). Resonances in Li’ (p y)Be®. 


Be 


ol 
Ep (kev) (Mev) T (kev) Ge y2/ (3hi2/2pa) 
441.1+-0.5 17.63 12° 9.4> 
12.2+-0.5¢ 0.0644 

1030°-f 18.14 20 

2000! 19.0 
a (Fo 49b) and (Bo 48c). 
b (Fo 49b). 
e (Hu 52f). 
d (Vo 54). 
e (Kr 54a) 


a). n 
t (Pr 54b) and C. E. Mandeville (private communication). 
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Lil (p p)Li7]. The angular correlation of internally- 
produced electron-positron pairs at the 441-key reso- 
nance shows that the radiation (mainly the 17.6-Mev 
component) 1S magnetic dipole or electric quadrupole 
confirming the assignment of even parity to the 17.63- 


- Mev state (De 54b). 


Study of y—a coincidences at £,=0.44 Mev reveals 
three*a groups in addition to that resulting from the 
14.8-Mev transition (Bu 50h), corresponding to weakly 
excited levels at 4.2, 5.4 and 7.6 Mev. The associated 
y-ray Intensities are 1.80.3, 1.70.3, and 1.0+0.3 
percent, respectively, of the total radiation. In the plane 
perpendicular to the proton beam, the y—a angular 
correlations have the form (1-++0.85 cos’) for the 4.2- 
and 5.4-Mev levels and are isotropic for the 7.5-Mev 
level. For the first two, the results indicate a mixture of 
M1 and £2 radiation, J=2+, while the isotropy in the 
third case suggests J=0+ [(In 53a), (In 54), and E. 
Titterton; (private communication) ]. See also (Na 52) 
(Go 52a) (Ne 54d), (Ti 53a), and (Ti 54). 

The argular distribution off the 441-kev resonance 
is anisotropic and different for the 14.8- and 17.6-Mev 
y rays (suggesting different J for the ground and the 
2.9-Mev states), and contains a strong (cos@) term, 
indicating interference in the neighborhood of the 
resonance between the resonant (p) waves and other, 
nonresonant, waves of opposite parity [(De 49d), 
(St 51c), and (Na 52) ]. Angular distributions of the 
total y radiation have also been measured at and near 
the 1.03-Mev resonance. The results show anisotropy, 
and a study of the coefficients appears to indicate 
interference by nonresonant waves with at least two 
components of opposite parity (Kr54a). See also 
(Aj 52c), (Fo 54a), and (GI 54a). 


XIII. Li (p )Be7 Qn=—1.645 Hy=17.242 


The neutron yield indicates a resonance near thresh- 
old (Ne 54; see also Ba 51e and Bo 51b), and at E= 
2.25 Mev, r~0.2 Mev, omax—Smin= 0.25 b [(Ta 48) 
and (Ad 54) ], and Ep=4.89 Mev, T~0.4 Mev [ (Ba 52a) 
and (Bl Sia) ]. 

The angular distribution in the range Z,=2.0 to 2.5 
Mev exhibits a strong (cos#) dependence, with a coeff- 
cient which changes rapidly in the neighborhood of the 
2.25-Mev resonance, suggesting interference of entering 
waves of opposite parity (Ta 48). In an analysis by 
(Br 48a) it was assumed that the resonance is formed by 
s-wave protons with waves contributing a broad 
background. An additional odd-parity level just below 
the reaction threshold was required to account for the 
behavior of the cross section near threshold. (Ad 54) 
finds that a good fit to the resonant part of the cross 
section can be obtained by assuming p-wave formation 
of a J=3t level using equal reduced widths YPEY 
=(.8 103 Mev cm (2.5 percent of the sum rule limit) 
as derived from the observed width of the presumed 
analog 2.28-Mev level of Li®. A satisfactory account 
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of the nonresonant cross section and of the angular 
distribution requires two further states, formed by s 
waves, with J=1~ and 2~ and T=0 (opposite to that 
of the 3+ state) with undetermined location. A pre- 
liminary measurement of the polarization of neutrons 
produced at £,= 2.23 Mev is in agreement with calcu- 
lations based on these parameters (Ad 54). (Wi 54c) 
find a polarization of 0.50-40.04 at §=42° for neutrons 
in the energy range of 0.3 to 0.55 Mev; above and 
below these energies, the polarization decreases slowly. 
See also (Ad 54a) and (Er 54a). 


XIV. Li (p p)Li? E= 17.242 

The elastic scattering exhibits anomalies at Ep= 0.44, 
1.03, 1.9, 2.1, and 2.5 Mev [(Br 51a), (Ba 5ie), 
(Wa 53), and P. Malmberg, private communication }. 
Absolute differential cross sections are reported for 
7 angles in the energy range #,=0.36 to 1.4 Mev 
(Wa 53) and at 6 angles for E,=1.35 to 3.04 Mev 
(P. Malmberg, private communication). At £,=441 
kev, the observed scattering varies from 1.18 times 
Rutherford at 6=50° c.m. to 2.16 times Rutherford at 
6=160° (Wa 53). Analysis in terms of s-wave and 
p-wave phase shifts establishes that the resonance has 
J=1* and is formed by p-wave protons with channel 
spins 1 and 2 in a ratio of 1 to 5. This ratio may corre- 
spond either to a pure j— j state, with a p; proton 
(2,4): or a pure L—S state of the character °5; or 
3P\; in either case isotropy of the capture y radiation 
is guaranteed [ (Li 53a), (Ch 53d), and D. A. Liberman, 
private communication ]. 

At the #,=1.03-Mev resonance, the observed elastic 
scattering cross section varies from 1.08 times Ruther- 
ford at @=50° (c.m.) to 6.54 times Rutherford at 
0=160° (Wa 53). The analysis is in agreement with a 
1* resonance, again with a channel spin ratio of about 
1:5 (D. A. Liberman, private communication). A cusp 
in the elastic scattering cross section curve (six angles) 
at the Li7 (p )Be’ threshold appears as a distinct peak 
for 8=70° to 130° (four angles, c.m.) and as a cusp at 
150° and 167° (c.m.) (P. Malmberg; private com- 
munication). The structure near 2.1 Mev may be con- 
nected with the Li’ (p 7) Be*® resonance at 2.0 Mev; the 
magnitude of the cross section (at 166°) is compatible 
with a J=3* assignment, and it is suggested that this, 
rather than the 2.25-Mev resonance observed in 
Li’ (p ) Be’ is the T=1 state analcgous to the 2.28-Mev 
state of Li§ [see, however (Ad 54) ]. 


© 


XV. Lit(p p)Li™* Ey=17.242 = 
Li (p p’y) Li? 

A pronounced resonance appears in the yield of 
inelastically scattered protons [ (Br 5ta) and (Mo 544) ] 
and 477-kev y rays (Kr 54a) at E,=1.030-£0.005 Mev, 
T=168 kev. The absolute differential cross section for 
the inelastic scattering of protons has been measared 
at 10 angles near the resonance. It is nearly isotropic 
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at E,~1.03 Mev, whereas at higher energies it is peaked 
in the backward direction. The total cross section for 
inelastic scattering is 41.62-3 mb at Ep=1.05 Mev, 
35.4+2.8 mb at E =1.14 Mev, and 32.04:2.7 mb at 
Ep=1.24 Mev. The spherical symmetry at resonance 
and the asymmetry above it are consistent with either 
an s- or a p-wave resonance interfering with a non- 
resonant’ wave of opposite parity (Mo 54a). From 
E,=1.5 to 3.5 Mev, the differential cross section at 
@=164° rises smoothly except for a change in slope 
between 2.2 and 2.3 Mev (Ba Sie). 


XVI. Li’ (p d)Li® 
l See Li’, 


QOm=—5.020 E,=17.242 


XVII. Lil(pa)He! Qn=17.337 Ey=17.242 
f The weighted mean of experimental Q values is given 
i as 17.346+0.010 Mev by (Va 54). This value includes 
f two recent determinations: 17.344+-0.013 Mev [ (Fa 53) ; 
i} mag. spectrometer] and 17.352+-0.009 Mev [(Co 53); 
i mag. spectrometer]. 

The cross section has been measured for E,=30 to 
250 kev by (Sa 53b) and from Ep=0.5 to 3.75 Mev by 
(He 48e). The yield has a brea maximum at £,=3 
Mev [(He 48e) and (He 48g)]. Analysis of anes 
n distributions indicates a level ~1 Mev broad, possibly 
at E,=3 Mev, with J=2t, and a several Mev broad 
level of J=0+, underlying the region (In 48 and He 48e). 
= See also (Hi 51a), (Ta 51a), (Hi 52a), (Ca 53d), and 
= (Al 54d). 


C XVI Li(@mBe* Q,,=15.017 


A study of the neutron groups at several angles by 

means of a neutron spectrometer indicates a single 

= broad level at E,=3.0 Mev below E,~8 Mev super- 
= posed on a continuum attributed to the 2Het+7 
t breakup. Neutron groups corresponding to an excita- 
ie tion of 4 or 5 Mev would have been resolved if they had 
ten percent of the intensity of the ground-state group. 
A group leaving Be® with an excitation of 7.5 Mev would 
have been observed if it were twenty percent as intense 
_ as the ground-state group [ (Tr 54b) and C. H. Johnson, 
private communication]. Work with photoplates has 
indicated levels at 2.3, 2.9, 4.1, 4.9, and 7.6 Mev [see, 
for instance, (Tr 53a) ] as well as states at 10 Mev 
(Ri 41), 11.1, and 14.7 Mev (Wh 50c). A 4.90.3 Mev 
y ray which had previously been assigned to Be® is now 
believed to be from the first excited state of C! [see 


~ 


Æ I1(8). Slow neutron thresholds in Li?(¢7)Be® (Bo 54c). 


Be (Mev) T (kev) 


Q (Mev) 


16.72 190 

—2.63 17.65 <20 
—2.8 17.8 100 

i „18. 19 170 
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C®(n n)C2*]. At Ea=0.68 Mev, upper limits for the 
production of 5-, 6-, and 7-Mev y radiation are <2 mb, 
<1 mb, and <0.5 mb, respectively (Si 54). 

Thresholds for slow neutron production observed by 
(Bo 54c) are given in Table IT(8). It is suggested that 
the prominent peak at Ha= 2.18 Mev corresponds to the 
first T=1 level of Be® [see C? (y a)Be®] and that the 
sharp rise at Za=3.37 and broad maximum at E4= 4.07 
Mev correspond to the 0.44- and 1.03-Mev resonances 
observed in Li’ (p y) Be’. The fact that the slow neutron 
yield decreases only rather slowly after the rise at 
Ea=3.37 Mev is attributed to a possible threshold at 
Ea=3.6 Mev (Bo 54c). 

A search for pairs in the range E= 5.0 to 8.5 Mev 
at Ea=0 to 330 kev led to a negative result; an upper 
limit for the cross section is 0.02 ub (Bent, Sippel, and 
Bonner, private communication). See also [(Tr 52a), 
(Ca 53e), (Th 53), (Re 54b), and (Tr 54d) ]. 


XIX. Li (He d)Be® Q,,=11.748 
See (Ku 54) and (Mo 54d). 


XX. Be? (x p)Li’ Qn=1.645 £,=18.887 
Be? (n a) Het On= 18.983 


At thermal neutron energies, proton emission is more 
probable than alpha emission. This result is consistent 
with the odd parity of Be’. The (n p) cross section is 
50000 b (Ha 53f). (Ad 54) estimates a theoretical 
upper limit for this cross section of 18 500 b, assuming 
a single resonance. 


XXI. Li8(@-)Be® Q,,=15.987 


The a-particle distribution indicates decay through a 
state at 3.1 Mev, r=0.8 Mev (Bo 48), 2.98+0.06 Mev, 
T=1.26 Mev (W. Whaling and C. W. Li, private com- 
munication), and 2.9 Mev, T=1.2 Mev (Gi54). In 
addition to decay through the 2.9-Mev state, (Fr 54d) 
indicate a possibility of decay through states.of Be® 
at 4.3 and 7.4 Mev. The $ spectrum indicates about 90 
percent of the transitions going to the 2.9-Mev state 
and less than 2 percent to the ground state (Ho 50). 
The shape of thea and £ spectra suggest the involvement 
of states of higher excitation in Be. 

Thea—B angular correlation is isotropic within 3 per- 
cent for all fractions, Eg=0.1 to 0.9 Eg(max); in the 
correlation expression 1+4 cos, A=0.01+0.03 
(Ha 54d). For Hs=9.8 and 7.5 Mev, (Bu 53a) reports 
A=0.04+0.2 and 0.12+0.09 respectively. An isotropic 
distribution is consistent with a spin of 2+ for Lis and 2* 
for the 2.9-Mev Be® state; a spin of zero for the 2.9-Mev 
state would also guarantee isotropy. Gamma-beta 


coincidences to the extent of 2+1 percent are reported 


by (Ve 51b) and (Ve 52a). (Bu 53a) reports an upper 
limit of 0.8+-0.3 percent on the number of disintegra- 
tions leading to 4.9-Mev y radiation. Logft for the 


transition to the 2.9-Mey state is 5.60 (Fe 51b). See | 


also (Ga 51a), (Cl 53), and (Wi 53i). 


See 


E a ee 
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© XXII B(6*)Be® 0,,=17.8 


It appears that B? decays mainly to the 2.9-Mev level 
of Be g log fl=5.54 (Ki 52); the a-particle spectrum is 
similar to that of Li® [(Al 50g) and (Gi 54)]. 


XXII. Bey n)Be? 0,,=—1.666 


The weighted mean of three determinations is 
Q=—1.665+0.002 Mev(Va 54); this includes a recent 
value of —1.662+0.003 Mev by (No 54; neutron 
threshold). At ZE,=6 Mev, most of the neutrons leave 
Be? in the 2.9-Mev state [(Ca 54e); see also Be’. 


XXIV. Be’(pd)Be®? Q,,=0.559 


The weighted mean of eight Q-value determinations 
is 0.559+0.001 Mev (Va 54). This includes a recent 
value of 0.560+0.003 Mev by (Co53; mag. spec- 
trometer). 

For Ep=5 to 8 Mev (Ha 51a), 18 Mev (Re 54a), 
and 22 Mev (Co 53e), the ground-state deuterons are 

m Strongly peaked forward. The angular distribution at 
E£,=18 Mev has been analyzed by pickup theory by 
(Re 54a) who find /=1 in agreement with odd parity 
for Be and even parity J<4 for Be’ [see also B® and 
(Da 54a) ]. At E,=8 Mev, peaks observed in the spec- 
trum of charged particles have been tentatively at- 
tributed to deuterons leading to levels in Be® at 2.9, 
4.0, and 5.1 Mev (Ar 52e). See also (St 53d). 

At E,=31.5 Mev, deuteron groups corresponding to 
the ground state, the 2.9-Mev level, and a group of levels 
near 17 Mev have been identified [(Be 54c) and J. 
Benveniste, private communication ]. 


XXV. (a) Be(d i) Be? On=4.591 
(b) Be?(d t)He'+Het Qn=4.687 


The weighted mean of five Q-value determinations for 
reaction (a) is 4.598-+0.012 Mev (Va 54). This includes 
a recent value of 4.60-+0.03 Mev obtained by [ (Ca 52b) ; 
photoplate]. 

At Ea~1 Mev a pronounced triton group correspond- 
ing to the ground state is observed in addition to a 
continuum of a particles, corresponding to formation 
of the 2.9-Mev state of Be® and its subsequent breakup; 
see also Be?(da)Li’. The excited state is formed about 
twice as frequently as the ground state. The three-body 
reaction (b) appears not to occur [(Cu 52), (Cu 53), 
and (Ge 53); see also Li’]. i ; 

The angular Sees O m 
reported by (De52b): Ea=0.15 to 0.62 Mev, by 
(Re 51a): Ba to 0.7 Mev, by (Ju 53): Ea=1.16 
Mev, by (Cu 53): Za=1.3 Mev, by (Fu 52): Ez=3.6 
Mev, by (E151): Ea=7.7 Mev, and by (Ho 53c): 
Eqz=8 Mev. The distributions exhibit a pronounced 
forward bunching, suggesting that a pickup process 
occurs, with an / transfer of 1 [see e.g. (Ho 53c)]. 


See also B! and (Ba 53c). 
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XXVI. Be?(He? a)Be? Q,,= 18.900 


At E(He’?)=900 kev, alpha-particle groups are ob- 
served corresponding to the ground state (weak) and 
to the 2,9-Mev level (Strong, broad) (E. Almqvist, 
private communication). See also (Mo 54b). 


XXVII. (a) BY (y d)Be**>He'+Het On=—5.930 
(b) B”(y np) He!+ Het On= —8.155 
Reaction (a) is believed to occur and to involve 
excited states of Be’. The ground-state transition is not 
observed at E,=15 to 18 Mev. Reaction (b) hes been 
observed (Er 53). See also (Ge 53a), (Mu 52a), (Aj 52c), 
B" (y 1) Be, and BY. 


XXVIII. B (7 1)Be* On,=0.232 
See (Ja 54), (Pe 51b), and (Ri 54a). 


XXIX. B"(p He’)Be* 0,=—0.532 
Qo= —0.536£0.003[ (Cr 52c) ; electrostatic analyzer |. 


XXX. (a) B"(da)Be? Om= 17.809 
(b) B°(da)He!+Het Qn= 17.905 


The weighted mean of four experimental Q values is 
17.86+0.04 Mev (Va 54); this includes two values of 
17.91+0.06 by (Tr 53; ion chamber), 17.87-0.06 Mev 
by (Cu53b; photoplate). A recent value is 17.829 
+0.010 Mev by (El 54; mag. spectrometer). All ob- 
servers agree that transitions occur to the ground state 
and to a state at 2.7340.2 Mev [ (Wh 51b); P=0.95 
+0.20 Mev], 2.88+0.08 Mev (Tr 53) and 2.8740.08 
Mev [(Cu 53b), (Cu 54); T=0.940.2 Mev]. 

(Tr 53) finds that the shape of the æ spectrum 
(omitting the ground-state peak) can be fitted by a 
Breit-Wigner single level formula: J = 2* gives a slightly 
better fit than J=0: the reduced width 77=13.4K10-® 
Mev-cm [of the order of 2 times the sum-rule limit], 
E,=5.29 Mev, with J=2, a=4.48X 10- cm. An excess 
of a particles in the experimental spectrum in the region 
corresponding to E,~7 Mev may be due to BY(d 3a) 
or to a broad resonance. There is no evidence in this 
work for further peaks (E4=0.6 to 1.07 Mev; Tr 53). 

At Ez=1 Mev, @=0°, 30°, 60°, 90°, and 120°, groups 
are reported corresponding to states at 4.1, 5.1, 6.2(?), 
7.5, 9.2, and 11.5 Mev in addition to the ground state 
and 2.9-Mev state groups [ (Cu 53b), (Cu 54), and J. J. 
Jung, private communication]. (Ho 54e), on the other 
hand, find no evidence for any excited states below 10 
Mev except the broad 2.9-Mev state. Observations 
were made with a magnetic spectrometer at several i 
angles, with E4=2 to 3.2 Mev. The three-particle | 
reaction (b) has not been observed. At Ea=1 Mev itis — 
less than 5 percent of the intensity of reaction (a) — 
(Cu 53b). . Spal 

At Eg=0.6 Mev, a search has been made for a-y 
correlations from a 4.9-Mev state in Be*. The 


limit of the alpha group intensity to such a state 
* i 


ga 
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percent of the intensity to the ground state and the 
2.9-Mev level (Th 53). See also (Ju 53a). 


XXXI. (a) B" (y {Bes Om=— 11.227 
(b) B" (y )He!4+-He! Qm=— 11.131 
These reactions have been observed in boron-loaded 
photographic emùlsions. Reaction (a) is stated to in- 
volve states in Be at 0, 2.2, 2.9, 3.4, 4.0, and 4.9 Mev. 
Reaction (b) has been observed (Er 53). See also B", 
(Mu 52a) and (Aj 520). 


XXXII. B'(pa)Be’ Qn=8.575 


A recent Q-value determination is 8.589=+0.005 
L(Co 53); mag. spectrometer]. The weighted mean is 
8.585=£0.006 (Va 54), 8.579+0.009 Mev (Jo 53a). A 
value of 94.5-1.4 kev is found for the breakup energy 
of Be’ produced in éhis reaction (Jo 53a). 

Alpha groups corresponding to states at 0, 2.2, 2.9, 
3.4, 4.0, and 4.9 Mev are reported by (Gl 53). Work 
with a high-resolution magnetic spectrometer sets an 
upper limit of 10 percent for any such groups other than 
the ground state and the broad 2.9-Mev level [ (Ma 53g) ; 
E,=1.98 and 2.61 Mev; see also (Be 53) ]. The energy 
of the excited state, is given as 2.94-L0.06 Mev, T=1.1 
Mev [(Li51) and W. Whaling, private communica- 
tion ]; r=0.84 Mev (Be 53). For E,=2 to 3.2 Mev, no 
levels are observed between the ground state and 
E,=10 Mev except for the 2.9-Mev state (Ho 54e). 

The ground-state œ particles are strongly anisotropic 
at the E,=0.16 Mev (C#*: J=2*, T=1) resonance. 
It is thus unlikely that the ground state of Be! has 
J=2 (Th 52c). The angular correlation of œ particles 
leading to the 2.9-Mev state with those resulting from 
the break-up of Be’*, observed at Ep=163 kev, is 
consistent with the assignment J=2+ but not with 
J=0* (Ge 54a). The fact that the œ particles leading 
to the 2.9-Mev state are 50 times as intense as the 
ground-state group may indicate a difference in the 
isotopic spin purity of these two states (Be 53). Study 
of the directional correlation of successively emitted 
æ particles indicates J=0 for the ground state of Be® 
(Tr 54d; see Be®—>2He'). See also (Hu 53, Gl 54, 
Gl 54a, and Gl 54b). 

At £,=7.9 Mev, nuclear pairs are reported with 
E,~7 Mev [ (Ph 51b) ; see however Li’(d n) Be®]. 


XXXII. (a) C2(yo)Bee Qn=—7.374 
(b) C2(n n’)3Het Qn=—7.278 


* The cross section for reaction (a) exhibits pronounced 

< peaks at E,~18 and ~28 Mev (possibly with fine 
“~Siructure; see C”). For energies less than 20 Mev, the 
process proceeds predominantly, more than 75 percent, 


= yia the 2.9-Mev level of Be® [(Te 50c), (Mi 53c), and 


. 


(Wi55)]. No significant deviation from isotropy is 
found in the angular distributions by (Mi 53c). (Te 51b) 
and (Ed 52) find, on the other hand, that the shape of 
the a spectrum indicates some correlation and suggest 


p3 
a ee y 


AND T. 


LAURITSEN 


that the transition involves #2 and M1 absorption, 
J=2* for the Be’ state. The small participation’ of the 
ground state suggests J=0t. A J=1* state of C! 
formed by M1 absorption, cannot disintegrate via a 
J=0* state of Be’. (Wi 55), analyzing some 2500 stars, 
find that the angular correlations indicate #1 and E2 
absorption (see C!) with little or no contribution from 
M1 except for E,<15.6 Mev. It is suggested that the 
dominance of the 2.9-Mev state may be explained by a 
relatively small admixture of 7=1 in this state. There 
are indications of participation of Be’ states at 4.00.1, 
6, 10, and 15 Mev, particularly for Z,=25 to 26 Mev 
(Wi 55). 

Analysis ofs200 stars produced by y rays of energy 
> 26 Mey, indicates that about 2 percent involve transi- 
tions through the ground state of Be®, 10 percent states 
in the range 3 to 16 Mey and 88 percent states near 
17 Mev. Two levels of Be® are reported in this region, 
at 16.8+0.2 and 17.60.2 Mev, both with T<0.3 Mev: 
a third level, at 16.4 Mev, may diso exist. Angular cor- 
relation experiments indicate electric dipole absorption 
and J= 2+ for the 16.8-Mev level and J=2* or possibly 
J=0+ for the 17.6-Mev level. An upper limit of 4 is set 
for the probability of y emission for both levels. The 
observation that Hi absorption dominates, and the 
high fraction of transitions to the 17-Mev states sug- 
gests that they have T=1 [(Wi53e), (Wi55), and 
J. J. Wilkins, private communication; see also 
(Ge 53a) ]. [No evidence for a state near 17.6 Mev, 
other than the J=1+ state, is seen in Li’(p #)Li’; see 
(Wa 53).] The lifetime of Be’, obtained from examina- 
tion of C2(y a)Be® stars, is <5X 107 sec (Mi 53c). 

For reaction (b), see C”. See also (Da 53d), (Li 53b), 
and (Gl 54b). A 


XXXIV. O!8(y a)C?*—Be®+He? Qm=— 14.522 


At E,~22 Mev, the reaction appears to proceed 
mainly via the 9.6-Mev state of C” to the ground state 
of Be’. For E,>24 Mey, transitions through a 16-Mev, 
T=1 state of C¥, to the 2.9-Mev state of Be! appear to 
dominate. See (Ge 53a), (Hs 53), (Mi53c), (Li53b), 
(Li 52c), (Go 52a), and (Da 53d). 

B8 
(Not illustrated) 
I. B&8(6t)Be® Q,=17.8 

The half-life is 0.46+0.03 sec [see (Ho 53a) ], 
0.61+-0.11 sec (Sh 52). The decay is complex; see Be’. 
Il. B° AB Qm=— 18.3 

The threshold is Æ, <21.2 Mev (Al 50g). 


The reaction Li*(He* n)B8 (Q,,=—1.8 Mev) is not 
reported. 


=- CC-O. In Public Domain. Gurukul Kangri Collection, Haridwar 
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4 Li’ IV. C2 (+ 3p) Li’ 
See (Re 53). 
The following reactions are not reported: LiT ( p) Li’ 

(On=—2.9), B&(n pyL? (Omn=—13.3), BEU HAL? 

(On= —14.1), and Bi (n He) Li? (0,= — 23.6). 


Ona ~47.3 
(Not illustrated) 
I. Li(G-)Be*Bet+n 0O,=12.4 


Li? decays to excited states of Be which decay by 
neutron etission. The half-life is 0.168-+0.004 sec 


& * (Ga 5tc), 0.170-£0.005 sec (Ho 52b). See also (Sh 52) BE 


5 


and (Fr 53a). 


II. Be(d 2p)Li? 


(Fig. 9) 


nm 


TOS 


The threshold is 1941 Mev (Ga 51c). 


I. (a) Li DLI 
(b) Lif(t p)Li® 
(c) Li®(t m) Be? 
(d) LIS a)He® 
(e) Lit (t 2)Het-+ 


+ Het 


On= 0.988 E,=17.670 
On= 0.800 
Om= 16.004 
Om= 15.15 
On= 16.100 


II. B'(y 2p)L? O,=—31.4 
See (Sh 52). 
J 
az ALL 
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AA alfin 
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At £,= 240 kev, 6=90°, the ratio of the long-range to 
the short-range deuteron group is 54:14. Protons from 


Ga 


f, Mf f. 7 


Fic. 9. Energy levels of Be: 
for notation, see Fig. 1. There 
Whe is no evidence for the state at 
x 19.2 Mev. 
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reaction (b) nave intensity 2 on the same scale (Pe 52). 
A continuum of æ particles from reactions (b), (c), (d), 
or (e) is also observed (Pe 51). See also (Mo 52). 


I. Lif(a p)Be  On=—2.133 


Not observed: see BY, 


IM. Liv(d p)Li® Q,=—0.188 E,=16.682 


' Absolute cross sections have been measured for 
Ea=0.7 to 3.3 Mev by (Ba 54a). Resonances are ob- 
served at 0.80 and 1.04 Mev with peak cross sections of 
0.15 and 0.12 b, respectively. Above 1.1 Mev the cross 
section rises monotonically, becoming essentially con- 
Stant at 0.16 b above 1.8 Mev. The abrupt change of 
slope at 1.4 Mev, attributed by (Ba 52) to a resonance, 
is not confirmed, and it is suggested that the behavior 
of the cross section in this region reflects the increasing 
importance of the stripping process. Absolute cross 
sections reported by (Ba 54a) are about a factor of two 
lower than those of (Ba 52). 


: IV. (a) Li?(d n)Bes QOm=15.016 E,=16.682 
, (b) Li?(da)He® Om= 14.16 
(c) Lil 2)He'+He! O,,=15.112 


The cross section for reaction (a) has been measured 
for Ea=70 to 110 kev by (Ra 54b). It follows the 
Gamow function as does the cross section for reaction 
(c) in the range Ez=30 to 250 kev (Sa 53b). Resonances 
for neutrons occur at Ey=0.68, 0.98, and 2.1 Mev; 
the cross sections are 39, 43, and 58 mb/sterad, and the 
widths are 0.25, 0.060, and 0.4 Mev, respectively 
(Ba 52). See also: Be? and Heë and (Tr 53a). 


aiei a 


V. (a) Li’ (d d)LiT 
(b) Li” (d i) Lis m= — 0.988 
(c) Li (d He*)He® Qn=— 4.52 


The cross section for reaction (b) is ~100 mb at 
E4a=2.5 Mev (Ma 54a). For reactions (a) and (c), see 
Li’ and He’. 


Ey= 16.682 


rept. a Se eee ye a yp TN 


VI. Li (He? 4)Be® Q,,=11.189 


At E(He*)=720 kev, proton groups are observed 
corresponding to states in Be? at 1.8, 2.4, 3.2, and 4.9 
Mev; the last of these would appear to be 0.5-1 Mev 
broad (Mo 54d) and (Ku 54). These states have also 

_ been observed at E(He*)=9N0 kev by K. W. Allen and 

Ë. Almgvist (private communications) who find the 
states at 2.4 and 3.2 Mev to be sharp (T <100 kev). 
urther levels were observed up to Ez~9 Mev. 
indications of several low-lying states in Be’ are 


jntermediate coupling [(In 53) (the *P level in 
. 9 of In 53 has been drawn 4 units too low, D. R. 
ate communication). ] , 


a Sy + 


= CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 
b J ra = e . 


VII. Li9(6-)Be*Be§+n Qm=12.4 
See Li’. 


VIII. (a) Be®(y 7) Be® On=— 1.666 
(b) Be®(y a)He® Ly) 
(c) Be(y n)He'+He* Qn=—1.570 - 


The cross section for neutron production exhibits 
a peak of 1.0 mb at E,~1.7 Mev, and a minimum near 
E,=2.2 Mev rising to ~0.7 mb at 2.76 Mev (Gu 49a). 
At E,=2.185 Mev, c=0.39-+0.06 mb (Ha 53b). The 
curve passes through a broad maximum near 10 Mev, 
o~1.6 mb, falling to ~0.8 mb at 17 Mev and rising to 
a peak of ~3.0 mb at 22 Mev (Na 53a); see also 
(Jo 53f). At Z,=6.3 and 8.1 Mev, the cross sections 
are 1.14+0.1 and 1.38L0.16 mb, respectively [ (Wi 54b) 
and R. D. Edge and D. H. Wilkinson, private com- 
munication) ]. The course of the cross section and 
angular distribution in the range Z,=1.7 to 3 Mev are 
accounted for by a single-particle model, assuming 
electric dipole transitions to an S, level -~0.1 Mev 
below threshold and a broad D; level at an energy >2.5 
Mev (Gu 49a). 

At E,=6.1 Mev, the cross section for neutron pro- 
duction is given by (Ca 54e) as 1.2540.12 mb. The 
cross section for production of œ particles of energy 
greater than 0.4 Mev is 1.28-0.25 mb, indicating that 
less than 20 percent of (y n) processes at this energy 
involve the ground state of Be’. From the coincident 
a spectrum, it is concluded that the main processes are 
Be (y 2) Be’* (2.9-Mev state) and Be? (y «)He® ground 
state, produced in ratio 1.2 to 1 (Ca 54e). 

The cross section for reaction (c) is <1 mb at 
E,=1.63 Mev (Al 52a). 


IX. Be(y p)Li® QOn=—16.871 


The integrated cross section to 26 Mev is 13 Mev-mb. 
The yield has a broad maximum, T=4.7 Mev, at 
E,=22.2 Mev where c=2.72 mb (Ha 53). 


X. (a) Be(yd)Li? 0,=—16.682 
(b) Be(y)LI® 0,,=—17.670 
(c) Be(ya)He’ O,=— 2.52 


The integrated cross section for reactions (a) and 
(b) is <0.1 Mev-b for E, to 31 Mev (P. Stoll and P. 
Erd6s, private communication). For reaction (C), see 
Be? (y 1) Be®. 


XI. Be (e e’)Beo* 
At E.=190 Mev, inelastic electron groups are ob- 


served corresponding to levels of Be? at 2.54 and 6.96 
Mev (Mc 54a). 


XII. Be(n 2’)Beo* x 
See (Gr 51a) and (Sc 54c). 


bod 
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XIII. Be(p p’)Be™* 

The first inelastic proton group locates a level at 
2.43340.005 Mev [(Br51f); see also (Aj 52c)]. No 
other group is observed for E,<6 Mev (Ar 52e) (weak 
groups corresponding to levels at ~1.6 and ~3 Mev do 
mot appear to be excluded by this spectrum), At 
E,=7.1 (Da 52e) and 7.5 Mev (St 53d), the inelastically 
scattered protons to the 2.4-Mev state are peaked in 
the forward direction. The width of the 2.4-Mev state 
is <3 kev (Br 51f). 

At £,=31.5 Mev, proton groups are reported corre- 
sponding to the ground state and states at 2.50.2, 
6,840.3, and 11.6-+0.4 Mev by (Br 52c), and to states 
at 2.45, 5.0, 6.8, 7.9, 11.3, 19.9(?), and 21.7(?) Mev by 
(Be 54c) and J. Benveniste (private communication). 
There is no indication in this work of levels near 1.8 or 
3.0 Mev. 


XIV. Be(d ad’) Be* 
At £y=14.5 Mev, deuteron groups have been ob- 


served to the ground state and to the 2.4-Mev state 
(J. N. McGruer, private communication). 


XV. Be (a a’) Be* 


At E,=21:7 Mev, a group corresponding to a level 
at 2.63140.15 Mev is observed (Mc 51b). 


XVI. (a) BY(y p) Be On=— 6.585 
(b) B°(y p)Bes+n Qn=—8.250 
Reaction (a) has not been reported. For reaction (b) 
see Be® and B". 


XVII. BY(nd)Be O,=—4.359 


Deuteron %roups corresponding to the ground state 
and to an excited state at 2.49--0.12 Mev have been 
observed at Z,=14 Mev. No other deuteron groups 
were detected below E:~5.5 Mev. In particular an 
upper limit of ~15 percent is given for a possible group 
leading to a state at ~1.5 Mev. The angular distribu- 
tion of the deuterons, analyzed by pickup theory, indi- 
cates an J=1 transfer and odd parity, 3 <J <9/2 for 
both states (Ri 54a). See also B". 


XVIII. BY(ta)Be® Qn=13.217 


Alpha groups are observed to the ground state and 
to levels at 1.73-£0.08, 2.39-£0.08, 3.10+0.09 (T <0.1 
Mey), and 4.74+0.08 Mev (T~1.4 Mev) (K. W. Allen, 
private communication). 


XIX. Buda)Be? Qn=8.016 
Qo=8.018-+£0.007 (Va 51; mag. spectrometer) 
Qo=8.029+0.005 (El 54; mag. spectrometer). 
d 
Alpha groups have been observed to the groun 
Bec an excited state at 2,422£0.005 Mev 
(Va 51: T<7 kev), 2.431+0.006 Mev (El54). At 
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Ea=1.51 Mev, no other groups were obsérved which 
would correspond to states in Be? below 5 Mev; the 
upper limit to their intensity is 10 percent of the _ 
intensity of the group corresponding to the 2.4-Mey . 
state (Va 51). The 2.4-Mev state decays mainly by 
neutron emission (Di 52). 


XX. C#(y a) Bes 
See (Mi 53c). 


On= — 10.656 


B? 
(Fig. 10) 
I. Li (He p)Beè On=16.768 Es=16.583 
See Ber 
Il. Be(pn)B? On,=—1.851 
Qo= —1.8520.002 (Ri 50e; neutron threshold) 


The width of the ground state is <2 kev (St 51e). 
At £,=6.59 Mev, a neutron group is observed corre- 
sponding to a level at 2.37-40.04 Mev. A continuous 
distribution of neutrons, attributed to the (p pn) reac- 
tion, is also observed (Aj 53). Neutron threshold meas- 
urements indicate a narrow state at 2.37-40.010 Mev 


Ei ti \ 
-1088 | Be Be'+ p a) 
Be?’ + Het |-1.85) ' 
Be’+ p-n =2.179 
P nosa B+ d-t Se 
-3.983 ©- -6.211 | am 
6 u ie 
Li’+a-n B+ p-d_ 
-7.559 i as 
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and, possibly, a broad state ((~1 Mev) at ~1.4-Mev 
excitation [(Ma 54c) and Cook, Marion, and Bonner, 
private communication ]. 


Til. BY(y 2) B® Qm=— 8.436 
Qo= —8.55+0.25 [ (Sh 51d); neutron threshold] 


IV. B°(pd)B° n= —6.211 


At E,=18 Mev, deuteron groups are observed to the 
ground state of B? and to an excited state at 2.40.15 
Mey. The angular distributions of the deuterons to 
both states, analyzed by pickup theory, show an /=1 
transfer indicating odd parity, <J <9/2, for the 
ground- and the 2.4-Mev states of B*[ (Re 54a) and 
J. B. Reynolds, private communication ]. 


V. BY (He? a)B? @n= 12.130 

At E (He’)=1 Mev, alpha-particle groups have been 
observed to the ground state of B® and to an excited 
state at 2.58+-0.13 Mev (K. W. Allen and E. Almqvist, 
private communication). 


VI. C®(p a)B? Qn=—7.559 
At £,=18 Mev, alpha-particle groups have been 
observed to the ground state of B® and to an excited 
state at 2.40+0.08 Mev (J. B. Reynolds, private 
communication). 
(ou 
(Not illustrated) 


Comparing the mass of C° with that of its mirror 
nucleus Li, and making appropriate Coulomb and 
n— p mass difference corrections, we find a mass defect 
of 32.342 Mev. C? is then stable to decay into B8+p 
(by ~0.5 Mev). C? has not been observed [see (Sh 52) ]. 
Two reactions leading to C? are Be? (He? 2) C? (0n= —7), 
C®(y 3n)C? (Qm= —54). 


Bel? 
(Fig. 11) 
I. Be®(6B” 0,,=0.556 


The weighted mean end-point energy is 0.556-L0.003 
Mev (LiSia). The mean half-life is 2.7+-0.4X10° y 
(Hu 49a) : log fi=13.65 (Fe 51b). The spectrum is of the 
D2 type (Wu 50). 


Il. (a) Li?(ta)Heé On=919  By=17.236 
(b) Li7(¢ 2n)Bes On= 8.759 
(o) Li U 2) Be? m= 10.425 


© (d) Li (tn)He5+Het Qm=7.91 


E aa The: neutron yield (elemental Li) at 0° exhibits two 


broad resonances at #,=0.84 and 1.70 Mev. The angu- 
Jar distributions are not isotropic (Cr 51b). 

At Er=240 kev, reaction (a) accounts for 20 percent 
of the disintegrations yielding œ particles at @=90° 
(De 52). The ground-state a particles are distributed 
as 1—(0:66--0.06) cos’@ (c.m.) while those correspond- 
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z 
Binip E5565- v eon -0.538 
B° Be B% t-He® 
-2.566 
Li’+a-p 1130:54 
-3.845 
C+ n-a 


Fic. 11. Energy levels of Be”: for notation, see Fig. 1. 


ing to the 1.7-Mev state of He® are isotropic within 8 
percent. The formation of the 1.7-Mev state is 8 times 
more probable than the formation of the ground state 
(Al 54d). These results indicate p-wave formation of 
the 0.84-Mev resonance and therefore J=2+ for the 
17.82-Mev state [(Ch 53d) and (Al 54¢)]. See also 
Hes and (Aj 52c). 


MI. Li (œ p)Be” Q,,=—2.566 
See (Ec 37). 


IV. Be’(n y)Be® Qn=6.811 
Qo=6.8160.006 [ (Ki 53b) ; pair spectrometer ] 


The thermal capture cross section is 9.00.5 mb 
(Hu 52e). In addition to the ground-state transition, 
£,=06.81 Mev, a 3.41+-0.06-Mev y ray is observed, 
attributed to a cascade through the 3.4-Mev state. 
The intensity of the cascade transition is about 0.25 
photon/capture (Ba 53d). See also (Wi 538). 


V. Be(n 2) Bee E,=6.811 


The total cross section is constant at 6.04+0.03 b 
from 1 ev to 4 kev (Ho 52f) : the spin dependent scatter- 
ing is <0.03 b (Pa 52). 

In the region Z,=0 to 18 Mev, three resonances are 
reported at 0.62, 0.81, and 2.73 Mev. The parameters of 
these resonances are exhibited in Table 1(10). The angu- 
lar distribution is of the form 1+-0.28 Pa (cos) at the 


2 


4a 


VI. BE(z 2’) Be* 
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0.62-Mev resonance, while the nonresonant scattering 
at 0.5 Mev is isotropic [(Wi54) and (Wi 54e)]. The 
variation of the angular distribution with energy near 
the 0.62-Mev resonance is not well fitted by the as- 
sumption of a d-wave resonance interfering with s-wave 
potential scattering; a somewhat better fit is obtained 
with 7=1, assuming channel spin one for the potential 
scattering (Wi 54e). 

Angular distributions have also been measured at 
£,=2.30, 2.60, 2.90, 3.32, and 3.66 Mev. At the higher 
energies, the neutrons are peaked in the forward direc- 
tion (Me 53). The total cross section has been meas- 
ured from 3 to 12 Mev, with a resolution in E, of the 
order of 10 percent, by (Ne 53a) who find an approxi- 
mately monotonic decrease from 2 b at 4 Mev to 1.6 b 
at 12 Mev. (Co 54f) have studied the region E,= 14.1 
to 18.0 Mev and find a monotonic decrease in c; from 
1.49+0.02 b at 14.1 Mev to 1.38-40.03 b at 18.0 Mev. 
(Co 52h) find the total cross section at En =14 Mev 
to be 1.530.03 b. See also (Go 52d), (Hi 54d), and 
(Ne 54c). ° 


E,=6.811 
‘See (Gr 51a). 


VII. Be(n 24)Beè Qn=— 1.666 E,=6.811 


At £,=3.7 Mev, the reaction appears to involve 
a two-step process through the 2.43-Mev state of 
Be [ (Be (n 2’) Be’*—Be®+- | (Fo 54b). The cross sec- 
tion is 0.30.1 b for Ra—Be neutrons (R. D. Edge 
and D. H. Wilkinson, private communication). See also 
(Ho 50b and Ag 52). 


VIII. Be(wa)He® Qmn=—0.64 £,=6.811 


A resonanĉe (~1 Mev wide) for production of He 
occurs at E,=2.6 Mev; the cross section is ~50 mb 
(Al 47b). At 14 Mev, the cross section is 101 mb 
(Ba 53). 


6 


IX. Be(d p)Be® 0,,=4.586 
The weighted average of ground-state Q values is 
4.588-+-0.006 Mev (Va 54). A recent value is 4.586 


+0.009 Mev [ (Bo 54d); mag. spectrometer ]. 
Proton groups corresponding to levels in Be” at 3.37, 
5.96, 6.18, 6.26, 7.37, and 7.54 Mev have been observed 


Taste I(10). Resonances in Be?(n n)Be. 


Bele® 


(ict) Mev) (kev) On) 72/GR#/2ua) J, = Ref. 
7 3 37 0.017 3* Ad 49 
CAEN 23 4.35 3+ Wite 
7 54 ~1.3 >0 o Sie 
DBE ay SOR 5.258 (Q) Wiske 
2.732 9.27 ~100 ~1.8 Bo 51c 


(2.85) (9.4) 


a 3 kground, $ 
a oS DAT A7), (St Sid), (Ri 510), and (MeS3). onances, a sharp 
e The Targe cross section and asymmetry suggest Se S 
one at 2.73 and a much broader one at ~2.85 Mev ( ). 


d Includes background. 


Bets, B19 101 


Tagre I(10). Bei(d $)Be® proton groups." 


Bei Beier ot 
Rh $3 tar ¥ mb/ater 

0 “O 1 + ~54 
3.372 3.37 1 -+ ~5.7 
5.959 5,940.04 
6,178 
6.260 6.244004 . 
7.37 1 + ~g 


. analyzed by stripping theory 


ular distribution {10° to 20°, 
tion). 


private com 


[see Table I1(10)]. For Ea=5.4 to 7.4 Mev, the in- 
tensity of the group corresponding to the 6.18-Mev state 
is about 5 percent of the intensity gf the group to the 
6.26-Mev state (@= 90°). The width of the proton group 
to the 7.37-Mev state is consistent with a [ for the 
level of about 25 kev [see Be®(n 2) Be®]. The 7.54-Mev 
state appears to be narrower and approaches the in- 
strumental width of about 10 kev. The upper limit to 
the intensity of other proton groups at 9=90°, and for 
Ea=5.4 to 7.4 Mev, is 5 percent of the strength of the 
group to the 6.26-Mev state [ (Bo 54d) and J. J. Jung 
and C. K. Bockelman, private communication |. The 
angular distributions of the protons to the ground state 
and to two excited states have been studied at 14.5 Mev. 
The results are summarized in Table II(10) (J. N. 
McGruer, private communication). See also (Aj 52c). 

The angular correlation of protons and 3.37-Mev 
y rays, observed at £,=0.48 and 0.84 Mev establishes 
that J>2 for the excited state. Since the stripping 
results show J <3, even parity, and since the internal 
pair conversion coefficient is consistent only with £1, 
M1, or £2 radiation (Ma 53e), the 3.37-Mev state 
appears to have J=2* (Co 54e and St 54f). There is 
evidence that a stripping process occurs even below 
Ea=1 Mev [(Ca 52) and (Tr 52) }. The energy of the 
y line corresponding to the decay of the first excited 
state is 3.351+0.027 Mev, with a yield at £,=2.5 Mev 
of 20.4+2.0X10€ y/d (Ma 53e). At Ea=3.85 Mev, a 
6.02+0.06-Mev y ray (corrected for Doppler shift) is 
observed and is assigned to the 5.96-Mev state of Be”: 
yield=3.210-® y/d (Bent, Sippel, and Bonner, pri- 
vate communication). 


X. B"(n p)Be® Q,,=0.226 
See (Ja 54) and (Eg 48). > on m 
XI. C¥(na)Be"” Qn=—3.845 
See CH T 
Be a. 
(Fig. 12) an 
I. Lit@y)BY® ỌQm=4.453 > 


Resonances are observed for Ea= 500425 and 118345 
kev (T <10 kev), corresponding to levels at 4.750.02 


` P > 


` 
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and 5.162-+0.008 Mev. The former decays mainly via the 
0.7-Mey state, with of ,~0.15 ev (L= a/T 4,41); 
while the latter, with wI",~1 ev, decays to the ground 
state (5 percent), the 0.7-Mev state (25 percent), and 
the 2.15-Mev state (70 percent). No resonance corre- 
sponding to, the 5.11-Mev level is observed: wl’, <0.02 
ev. These results are taken to indicate J=1* and T=0, 
for the 4.77, J=2- and T=0 for the 5.11, and J=2+ 
and T=1 for the 5.16-Mey states of B® [ (Wi 53h) and 
(Jo 54e) ]. The analysis assumes forbiddenness of E1 
transitions between T=0 states in 7:=0 nuclei 
(Ge 53a), a 0.25 percent T=1 impurity in the ground 
state of Li’ [(Ra 53) and (Jo 53c) ], and/or a T=0 im- 
purity in the 5.16-Mev state of B® (D. H. Wilkinson, 
private communication). 


IL. Lif p)Be? E,=4.453 


See (Sh 37). 


Om= —2.132 


TIL. (a) L¥(He p)Be?” On=11.189  Ey=17.774 


(b) LI (He d)Be® Q,,=11.748 
= (c) L(Hëa)Lis Ọn=13.321 
See Lit, Be’, and Be’. 
IV. Li” (e n)BY On= — 2.792 


See (Ha 39a) and (Ba 533i). 


V. Be (p 7) BY On= 6.585 

The cross section has been measured for E,=30 to 
250 kev by (Sa 53b). At the lowest energies, the curve 
approaches the simple Gamow function; the general 


- course up to 200 kev reflects the effect of a higher 


resonance; o®2 ub at 200 kev (Sa 53b). Resonances for 
capture radiation are reported at #,=0.336, 0.492, 
0.67, 0.998, 1.087, and 2.565 Mev [see (Aj 52c) ]. Re- 
duced widths are tabulated by (Vo 54). There appears 
to be some disagreement concerning the possibility of a 
resonance below 150 kev and also whether the reported 
resonance at 492 kev is properly assigned to this reaction 
(see Ta 46 and Hu 52h). 

The 0.336-Mev resonance (I'=175 kev) is generally 
attributed to s-wave protons, because of its relatively 
great width [y?> 30 percent of sum-rule limit : (La 54a) ] 
and the fact that the y radiation is isotropic (Ja 48). 
At E,=315-kev y rays corresponding to the ground, 
0.72, 1.74, and 2.15-Mev states are observed, with 
relative intensities (+25 percent) of 0.15, 0.40, 1.00, 
and 0.45° The observed relative intensities are taken 
to imply J=1 for the 6.89-Mev state in B®. The cross 
section at resonance is 12-4 pb (Ca 54). The several 
ev gamma width to the 1.74-Mev level implies an 
allowed £1 transition and hence JE, Sw (see 
also Be®(p d)Be® and Be*(p a)Li®). If T=0 for the 
6.89-Mev level, the strong transitions to the 0.7- and 
2.1-Mev states are difficult to understand (D. H. 
Wilkinson and A. B. Clegg, private communication). 


LIGHT NU GEE Psp x 103 


The broad resonance at 0.99 Mev is also believed to 
be formed by s waves [+?=3 percent of sum-rule limit 
(Vo 54) |. The angular distribution of the y radiation 
is Y(6)=1+0.09 sin’), suggesting dominant s-wave 
formation with some d-wave contribution (De 49d and 
Pa 53d). (Ho 53b) locate the resonance at £,= 9934-2 
kev, P=88-43 kev. The decay proceeds mainly: to the 
ground state. The thick-target yield of 7.5-Mev radia- 
tion is 19X10 +/p; T,=23 ev (assuming J=2).. 
An appreciable yield of 0.4, 0.7, 1.0, and 1.4-Mev radia- 
tion is observed, suggesting ~10 percent of cascade 
transitions (Ho 53b). Study of the angular correlation 
of internal conversion pairs indicates about equal con- 
tributions of £1 and £3 or M2 transitions (De 54b). 
The great strength of the E1 transition may indicate 
that this is a T=1 level (Wi53a): see, however, 
Be(pa)Li and Be’(p d)Be*. Why the 7.48-Mev state 
does not couple with the 0.7-Mevy state and why the £3 
contribution is so large are not understood. 

The narrow 7.56-Mev level [£,=108542 kev; 
(Ho 53b) ] decays mainly to the 0.7-Mev state: yield 
=1.0X10~9 7/p, T,=6.0 ev (assuming J/=0). Again 
the presence of softer radiation suggests a small contri- 
bution of cascades, possibly to high, a-unstable levels. 
The absence of transitions to the 1.74-Mev (J=0*) 
state is consistent with the assumed J=0 character of 
the 7.56-Mev state (Ho 53b). The individual angular 
distributions of the y rays to the 0.72-Mev state and of 
the subsequent 0.72-Mev y rays are isotropic within 5 
percent of the 1.09-Mevy resonance. This is consistent 
with J=0 for the 7.56-Mev state (Pa 53d). 

The excitation curve for E,26 Mey shows a pro- 
nounced resonance at Z,=2.57+0.01 Mev, and another 
one, ~0.5 Mev wide, superimposed on a general rise, 
at E,=4.72+0.01 Mev. The excitation curve for 
E,2=2 Mev shows the first resonance and the general 
rise but not the 4.72-Mev resonance (Ha 52e). At the 
2.6-Mev resonance, the capture radiation appears to 
proceed predominantly to the 0.7-Mev state; the yield 
is ~3X10- y/p (Ma 53e). See also (WiS3c) and 
(La 54a). 


VI. Be(pn)B? Qm=— 1.852 E,=6.585 

Resonances are observed at E,=2.56 and 4.72 Mev, 
superposed on a general rise to E,~4.5 Mev ((Ri 51) 
and (Ha 52e) ]. See also (Aj 52c). 


VII. Be(p p) Be? E;=6.585 

The yield of elastically scattered protons, observed 
at @=138°, shows interference effects at the 0.33-, ~ 
0.995-, and 1.086-Mev resonances (Th 49). Analysis of 
these data suggests J=2- and J=0, respectively, 
for the states corresponding to the upper two resonances 
(Co 49). ait 

Absolute differential cross sections have been meas- 
ured at E,=30.6 Mev by (Wr 53a), and at E,=51.5 
Mev by (Br 52c) who also list cross sections for the 
formation of excited states in Be’. See also (Co 54a). 
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VII. Be(p )Be? Qm=—12.071 E,=6.585 


An excitation function is given by (Co 54b) for 


E,~14 to 23 Mev. The absolute cross section is 9.0 mb 
at 22 Mev. 


IX. (a) Be(pd)Be® Q,=0.560 E,=6.585 
(b) B&(pa)Li® Om=2.132 

The total cross section for reaction (a) exhibits peaks 
at 0.33, 0.47, 0.68, and 0.94 Mev. Reaction (b) shows 
all but the 0.47-Mev peak. The angular distributions 
indicate strong interference between states of opposite 
parity [see (Aj 52c)]. 

Reaction (b) exhibits a strong resonance at Ep = 2.56 
Mev [T=41+2 kev, (Ma 54g)]: short-range alpha 
particles and y rays are observed corresponding to the 
reaction leading to the 3.58-Mev level in Li® and its 
subsequent y decay to the ground state of Li®. Alpha- 
particle groups to the ground and the first excited state 
of Li® and ground-state deuterons from reaction (a) do 
not show resonance at E,=2.56 Mev (Ma 54g). 
Assuming T= 1 for the 3.58-Mev state in LiS, these data 
indicate that the 8.89-Mev state in B has T=1 or 
J=0t. 

The thick-targct yield of 3.6-Mev y rays at Ep=2.72 
Mev is given as 4.76X10-° y/p by (Da 52) and 4.63 
X10~ y/p by (Ma 54b); the cross section at resonance 
is 0.11 b (Da 52). Assuming equality of the reduced 
proton and neutron widths (Ma 54b) finds that the 
observed cross section requires J>2. A J of 3- would 
require an implausibly large reduced a-particle width, 
and it is concluded that the level has J=2+, T=1, 
Ya =19 or 33 percent of the sum-rule limit, y,2=y7,2 
=0.5 or 0.2 percent (Ma 54b). ; 


X. Be(dn)B" Q,,=4.360 


Neutron groups have been observed corresponding 
to states in B" at 0.72, 1.75, 2.15, 3.53(40.06), 4.78, 
5.14, 5.37(?), 5.58, 5.72(?), 5.93, 6.12, 6.38, 6.58, and 
6.77(?)(+.0.04) Mev [(Aj 51) and (Aj 52c)]. A neutron 
group corresponding to a state at 2.85--0.03 Mev is 
reported by (Dy 53) [see also (Re 54b) ]. Angular dis- 
tributions of the neutrons to the ground state and to the 
first few excited states have been studied at Ez=0.95 
(Pr 53) and 3.4 Mev (Aj 52b). The data show evidence 
both for stripping and compound nucleus formation 
with more evidence of the former in the higher-energy 
work. Analysis by stripping theory indicates that the 
ground state and the first four excited states have even 


© parity, J <3, and that one or both of the 5.11-5.16-Mev 


states have odd parity, J=1 or 2 (Aj 52b). Strong 


- as 


terms in cos@ observed in the low-energy work are taken 
to indicate that, in addition to the stripping process, 


- “two compound nucleus (B") levels of opposite parity, 


possibly ~ J=5/2- and 3/2, are involved (Pr 53). 

Gamma-gamma coincidences and angular correla- 
tions have been studied by (Sh 54a) who find essential 
ecay scheme proposed by (Aj 51) 
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Fic. 13. Gamma-ray transitions in B, The horizontal lines 
indicate the pertinent levels of B”, identified on the right by the 
level energies, and where known, the total angular momentum 
(J), parity, and isotopic spin (T). Solid vertical lines indicate 
observed transitions, while dashed lines indicate transitions whose 
assignment is not well established. Numbers on the vertical lines 
give the (estimated) percentage branching from each level; 
figures in parentheses are least certain. 
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and (Ri 52b) except that the 1.4-Mev y ray evidently 
occurs both as a transition between the 3.6- and 2.1- 
Mev levels and as a transition between the 2.1- and 
0.7-Mev levels (Sh 54a). The lifetime of the 0.72-Mev 
state is 742X10- sec (Th 53a). A decay scheme with 
(very approximate) relative intensities is given in 
Fig. 13. 

Observed thresholds for slow neutron production are 
given in Table III(10). Data taken with Ez=3 to 5.4 
Mev indicate no additional thresholds. The small yield 
of threshold neutrons from the 4.78-Mev state is con- 
sistent with p-wave neutrons and thus with even parity 
for that state. The small widths observed for the ex- 
cited states from 5 to 6.6 Mev, despite the instability 
of these states to a emission, imply that they have 
relatively high angular momenta (Bo 54c). 

At Ea=2.5 Mev, the yield of 3.595+-0.014 Mev 
y rays is 5.7+0.7X10-° y/d, and that of 5.98-+0.04 
Mev y rays is 1.6X10-® y/d. In addition, a y ray of 
4.44+0.03 Mev is also observed and is believed to be 
due to the decay of one of the 5.1-Mev levels to the 
0.72-Mev state. The 5.98-Mey y ray may be located in 
Be” (Doppler corrections to the E, have not been made) 
(Ma 53e). See also (Aj 52c), (Gr 53b), and (Sw 53a). 


TABLE IJI(10). Slow neutron thresholds in Be?(d »)B” (Bo 54c). 


Ea (Mev) Bick (Mev) T (kev) 
0.52 4.78 <10 
0.92 5.11 <10 
0.99 5.17 -~ <10 
1.92 5.93 <10 
2.08 6.06 <10 
2.20 6.16 <20 
2.53 6.43 
2.70 6.57 ~30 
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XI. Be™(g-)B" 


ENERGY 


Om = 0.556 
See Be, 


XII. (a) BY(y d)He'+ Het 
(b) BY(y a) Li® On= — 4.453 
(c) Bey np)He'+-Het 0O,=—8.155 
In the range Z,=10 to 30 Mev, reaction (a) proceeds 
almost entirely through excited states of Bef: transi- 
tions via (a+d)-emitting states of Li® apparently do 
not occur. The cross section exhibits a maximum at 
E,=10.8 Mev, possibly corresponding to the known 
level at that energy. The peak cross section is 1.5 mb 
[(Mu 52a), (Lo 53), and (Er 53) ]. Reaction (b) is said 
to proceed via the ground state and y-emitting states 
of Li® at 1.1 and 2.2 Mey. The cross section for (Li?+ 9) 
y rays is 0.2 mb. The corresponding cross section for 
reaction (c) is 0.3 mb. Neither reaction (b) nor (c) 
appears to exhibit the 10.8-Mev resonance (Er 53). 
See also (Aj 52c). 


Om= —5.930 


XIII. B’ (2 n’) B!* 


E A gamma ray of energy 0.7170.007 Mev is ob- 
served with E,=2.5 Mev [(Da 53c) and R. B. Day, 
private communication ]. See also (Sc 54c). 


XIV. B"(p p/)BY* 


Qi= —0.719+0.0016 Mev (Cr 52c; elect. analyzer). 
Qı= —0.718-£0.005 Mev (Da 54; scin. spectrometer). 


Proton groups observed by (Bo 53) and (Br 53i) are 
exhibited in Table IV(10). See also (Aj 52c) and C". 


TABLE IV(10). B® levels from B(p p’)B!* and 
Bd d’)B™* [ (Bo 53 and Br 53i)]. 


Relative intensities 


at 6 Mev at 7 Mev at 7.6 Mev 
Bix (pp) (dd) @p') (ad) p) (dd) 
0 . 100 100 100 100 100 100 
0.717 0.005 7 7 Ip 4 1 
1.739 +0.005 1 <04 1 <0.15 1 <0.15 
2.152 +0.005 6 2 5 4 3 4 
3.583 +0.005 4 5 4 7 T 


4.771”+0.005 
5.105*+0.007 
5.159°=-0.007 


a Observed in B!°(p p’): intensities not reported. 


XV. BY(dd’)B* 

Deuteron groups observed by (Bo 53) are exhibited 
in Table IV(10). The absence of deuteron groups corre- 
sponding to the 1.74-Mev state of B" is strong evidence 
[see (Ad 52a) ] that this state is the J=0+, T=1 analog 
of the ground states of Be” and C”. 


XVI. C(8+)B" On= 3.64 


ife i ; the maximum 
The half-life is 19.140.8 sec (Sh 49) 2 
Positron energy, obtained by absorption measurements, 


CC-0.1 
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is 2.10:0.1 Mev. Gamma rays of energy 723-415 and 
1033-430 kev, the latter resulting from transitions to 
the 1.74-Mev state, are observed. Relative transition 
probabilities to the 0.72, 1.74, and 2.15-Mev levels are 
given as 98.4/1.65-£0.2/<0.1 by (Sh 53a). The corre- 
sponding log ft values, using #3(max)=1.90 and 0.88 
Mev (obtained from B'"(pn)C™) are 3.03 and 3.43. 
In view of direct evidence that the 1:74-Mev level has 
J=0+ [see Be’(d n)B"], it is reasonably certain that 
the 8* transition to this state is of the 0-0 type. Transi- 
tions to the 2.1-Mev state (J=1*) should be allowed 
and it is not clear why they are not observed (Sh 53a). 
See also (Ko 53). 


XVII. BY (y n)B® 
See (Sh 51d). 


XVII. BU(pdjB® O,=—9.233 


At Ep= 18 Mev, a deuteron group to the ground state 
of B” has been observed. The angular distribution of 
the deuterons, analyzed by the pickup theory, indicates 
1,=1, and therefore different parities for the ground 
states of B? and of B” (Re 54a). 


On= — 11.459 


XIX. C?(da)B® 0,=—1.349 

At Ez=7 Mev, alpha-particle groups to the ground 
state and to the 0.72-Mev level are observed [ (As 51) 
and (Sp 53c) ]. 


XX. N4(ya@)B" On=—11.613 
See (Mi 53c). 


Low-Lying States of B!’ 


The following arguments support the spin and parity 
assignments for the first five excited states of B” [see 
(Ri 52b), (Aj 52b), and (Sh 53a) ]. 

(a) All have even parity: J=0, 1, 2, or 3 (neutron 
angular distributions). 

(b) 1.74-Mev state: J is certainly not >1 because the 
C" 6+ decay is allowed. C” can reasonably. be assumed 
to be J=0*; if charge independence is assumed, the 
1.74-Mev state is clearly established as J=0+ [see 
(Pr 53) ], T=1 by B°(dd’)B”. The fact that the ground- 
state y transition is not observed is consistent with a 
large spin difference. 

(c) 0.72-Mev state: C® decay is allowed; therefore, 
J is certainly not >1; J=0 is eliminated by observa- 
tion of the transition from the 1.74-Mev state, thus 
J=1. The lifetime of the state is consistent with £2 z 

(d) 2.15-Mev state: The fact that the y transition to 
the 1.74-Mev state is comparable in intensity to the 
ground-state transition argues strongly against J=2 
and 3; J=0 is excluded by the observation of the transi- ~ 
tion to the 1.74-Mev level; therefore J=1. uw 

(e) 3.58-Mev state: The absence of a transition to the 
1.74-Mev state argues against J=1. If J=0, the transi- 
tion to the ground state should be very weak; if J =3, 


` 
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the transition to the 0.7-Mev state should be relatively 
weak. Thus J= 2 appears to be more consistent with the 
observed decay scheme. j 

(f) 4.77-Mev state: The only observed y decay is to 
the 0.72-Mev state. This obServation favors J=0 or 1 
over J=2 or 3. The parity is even (Bo 54c). The fact 
that the state is seen in Li®(@ y)B™ eliminates 0+; 
therefore J=1+. ° 


C10 
(Fig. 14) 
Mass of C!° 


The mass difference C’—B" is given as 3.84-40.1 
Mev from 8 end-point measurements (Sh 53a) and as 
3.59+0.05 Mev (Cook, Marion, and Bonner, private 
communication), 3.6-+0.2 Mev (Aj54b) from the 
B"(p n)C® Q value. The weighted mean of these three 
results yields a mass defect for C! of 18.64--0.06 Mev. 


I. C°@)BY O,,=3.64 
The decay is complex. See B". 


Il. B°(pn)C® Qm=— 4.42 


Qo= —4.37+0.05 Mev (Cook, Marion, and Bonner, 
private communication; neutron threshold). 
Qo= —4.350.2 Mev [(Aj 54b); photoplate ]. 


= -364 ase -3.65 
2 B= B'°+ He? -t -442 
3 Sec B°+ p-n 


ao 
= Fic. 14. Energy levels of C”: fo- notation, see Fig. 1. 
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At £,=17.2 Mev, neutron groups are observed to the 
ground state, to an excited state at 3.34-40.2 Mey 
and to wide or unresolved level(s) at E,~5.1 Mey. The 
larger cross section for the formation of the 3.3-Mey 
state than for the formation of the ground state of C10 
is consistent with the smaller J change involved if the 
first excited state is the analog to the J=2 state in Beto 
at 3.37 Mev (Aj 54b). 


Belt 
(Not illustrated) 


A rough interpolation from other A suggests that the 
first T=} state in B" is located ~13 Mev above the 
ground state. This means Be!—BU~11.5 Mev; mass 
defect ~23.4 Mev. Be"! might then be stable by as much 
as 2 Mev with respect to Be'’+-n. See also (Ba 39e). 

No evidence is reported for this nucleus. Some of the 
reactions which would lead to it are: Be(t p)Bel 
(Qn=—1), B'(np)Be" (Q,=—11), B"(tHe!)Beu 
(Qm=—12), and C! (n He*)Be" (Q,,= —24). 


Bu 
(Fig. 15) 
I. Li(ay)B"  Qn=8.667 


Three resonances are reported below E,=2.5 Mev 
[ (Be 51) and (He 54b) ]. Their parameters are exhibited 
in Table I(11) as are their modes of decay (see also 
Fig. 16). Study of a—y and y—y angular correlations, 
taken together with the relative y-ray intensities, leads 
to the following assignments: 9.28-Mev level: J=5/2t, 
9.19-Mev: J=5/2-, 8.92-Mev: J=} or 5/2, 6.81-Mev: 
J=}, 4.46-Mev level: J=5/2- [(Jo 52d) and D. H. 
Wilkinson, private communication]. Seeʻalso (Jo 52c). 


II. Li (œ 2)BY 
See (Ho 50b). 


mm —2.792 Ey, = 8.667 


MI. Li (& p)Be® O,=—2.566 E,=8.667 
See (Ec 37). 


IV. Li’ (æ a’) Li™ E,=8.667 


The 0.48-Mev y-ray yield shows resonances at 
E,=1.91 (He 54b), 1.889+0.010 Mev (Li54: peak 
o=110+20 mb), at E«= 2.46 (He 54b), 2.50-0.03 Mev 
(Li 54: peak c=80415 mb, crs~30 mb), and at 
E,=3.06 Mev (He 54b). Correction for barrier pene- 


TABLE I(11). Resonances in Li’ (æ y)B". 


Ea" T (lab)a Partial widths, wry (ev), to states of BY at 
(Mev) (kev) Bu* 0 2.14 4.46 5.03 6.81 


0.401 <1 8.922 0.15 <0,003 <0.003 ~0.005 0.00 
0.819%0.C01 ~4 9.188 <0.05 <0.02 2.0 <0.1 ~0.3. 
0.958 +0.C01 7 9.277 3.5 <0.17 8.1 <0.4 24 


eee 
a (Be 51). Also see (He 54b). rt 
b (Jo 52d) and D. H. Wilkinson (private communication). (Be 51) repo! 
total gamma widths of 0.04, 0.6, and 4.7 ev for the three resonances. 
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p ; Fic. 15. Energy levels of B”: for notation, see Fig. 1. 


tration effects leads to values of Ep= 1.87 and 2.45 Mev The fast neutron and y-ray yields rise smoothly to 
and T (c.m.)=125=10 and ~155 kev, respectiv ely, for Ea=1.8 Mev with some indication of a broad resonance 
the first two resonances, corresponding to levels in B! at 1 Mev (Ev 49). See also BY, (Pr 53), and (Aj 5%). 3 
at 9.86 and 10.23 Mev. Calculation of the ratio of the 


e > 


reduced width to the Wigner sum-rule limit gives VI, Be®(d d)Be? E,=15.818 Bary 
J <5/2 for the 9.86-Mev and J <7/2 for the 10.23-Mev e 
An : +s See (El 52). ere 
states (Li 54). See also (Aj 52c). 
| Be®(d n)BY Qn=4.360 E,=15.818 VIL, (a) Be(d p)Be" Qu=4586 Br=15.8i8 —- 
| The Gross section has been measured for Ea=70 to (b) B&(da)Li? On=7.151 ` 
| 110 kev by (Ra 54b); it follows the Gamow function (c) Be’(dt)Be® Qe=4. .591 : 
| for these energies. (d) Be? (d 2a)H? * Qm=4.687 
1 Lg 
l 


e 


E 
ə» 


Fic. 16. Gamma-ray transitions in B": for notation, see Fig. 13. 


Absolute cross sections for reactions (a) and (b) are 


' reported by (Sa 53b) in the range Eg=30 to 250 kev, 
i and for reactions (4) and (c) by (De 52b) in the range 
f Ea=150 to 620 kev. In reaction (a), (Sa 53b) find a 
| simple Gamow dependence for both the ground state 
and 3.37-Mev state protons; o~25 ub for each at 


) £a=150 kev, assuming isotropy. (De 52b) find the 
i distributions markedlyzanisotropic even at this low 
i! energy (see also Be!) and quote o= 12.7 ub. The cross 
section at Ez=620 kev is 5.26 mb (De 52b). (Ca 52) 
j reports broad maxima in the @=90° yield of ground- 
{ state protons at Ea~0.9, (1.3), and 2.1 Mev. 

The cross section for reaction (b) shows a simple 
Gamow rise to Ea= 250 kev; o~0.8 mb at Ea= 200 kev 
(Sa 53b). The angular distribution for combined ground- 
state and 0.48-Mey state a particles is approximately 
isotropic for Hz=0.3 to 0.7 Mev [(Re 51a); also see 
Li7]. (De 52b) find the cross section for ground-state 
tritons [reaction (c) ] and protons [reaction (a) ] to be 
the same for Ea=150 to 450 kev. Above 450 kev, the 
: (d p) cross section rises more rapidly. For angular dis- 

i i tributions, see Be®. Relative yields for the various groups 
iig from reactions (a), (b), and (c) are given by (Ge 53). 
E The direct three-body reaction (d) does not appear to 
occur (Ge 53). 


kegi 


VII. Be? (He? p)B" Qn=10.325 
ih See (Mo 54b). 


E IX. Be (æ d)B" Qm= —8.016 
E See (Mc 51b). 
l i x X. B(x n)B! E,= 11.459 


Eo oa Tr epithermal cross section (free) is 3.340.5 b 
pi (Hu 54a). Broad maxima appear in the cross section at 
- - Z,=1.9 and 2.8 Mev, o~2.5 b; an additional peak near 
0.2 Mev may be indicated (Bo 51c). Nonresonant 
“scattering at 0.5 Mev is nearly isotropic. At E,=1.5 
Mev, ströng forward scattering is observed (Wi 54). 
For E,=6 to 9.7 Mev, the total cross section is nearly 
constant at 1.4 b (Ne 54c). At Z,=14 Mev, the total 

‘cross section is 1.47+0.03 b (Có 52h). From E,=14.1 
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to 18.0 Mev, o; is nearly constant at 1.450.02 } 
(Co 54f). 


XI. BY (2 2’) BY* 
See (Ph 52b). 


E= 11.459 


XII. B(x p)Be® Qn=0.226 E,=11.459 


The thermal cross section is <0.2 b (Hu 52e); the 
cross section for fast pile neutrons is 3 mb (Eg 48). See 
also (Ja 54). 


XII. B(x d) Be On=—4.359 E,=11.459 


At E,=14 Mey, the integrated cross sections (0° to 
90°, c.m.) for the transitions to the ground and the 
2.4-Mev states of Be are 21+3 mb and 16-42 mb, re- 
spectively (Ri 54a). See also (Ja 54). 


XIV. BY(na)Li? On=2.792 E,=11.459 


Recently quoted values for’ the thermal neutron 
absorption cross section in natural boron are 749-44 b 
(Ca 53c), 7553 b (Ha 53g), 744+20 b (Sc 54b). 
(Hu 53c) adopt c=750+4 b. From 7X10% to 200 ev, 
the cross section follows the 1/v law [(Hu 53c) and 
(Hu 52e) ]. Whether a resonance exists near E,=0.2 
Mev isnot clear [see (Go 47a), (Hu 52e), and (Hu 54a) J. 
A pronounced resonance exists at £,=1.9 Mev with 
T=400 kev, c=0.5 b (Pe 51c), which may actually be 
two resonances, at 1.81-L0.04 and 2.03+0.04 Mev 
(St 50d) and (Bi 52) ]. The ratio of transitions leading 
to the ground state and to the 0.48-Mev state of Li? 
varies markedly with energy: see (Aj 52c) and (De 54). 


XV. BY(d )BY Qm=9.234 
Qo=9.227-0.006 (El 54; mag. spectrometer). 


Proton groups reported by (Va 51a) and (El 53) are 
listed in Table II(11). All the levels listed, except the 
last, have widths less than 15 kev; an upper limit of 
4 kev is placed on the width of the 8.93-Mev level and 


TABLE II(11). Proton groups from B(d p)B". 


A B 
(Va 51a) (El 53) 
= Rel. 
any) dR any Int.b Bur 
9.23540.011 2 0 

7.097+0.009 0.4 2.138 
4.77640.008 1.2 4.459 
4.201+0.008 1.0 5.034 
2.47740.007 2 6.758 
2.427+0.007 0.2 6.808 
1.93740.006 1.2 7.298 
1.248+-0.009 0.088 7.987 
0.667+0.005 0.3 0.667 0.008 0.087 8.568 
0.309£0.005 5 0.30640.008 0.58 8.927 
0.045+0.005 8 0.042+40.008 1.0 9.191 
—0.0414-0.005 4 —0.042 0.007 9.276 

—1.08 +£0.02 0.6 10.32 


OENE AE E E re a 
III I 


a Approximate differential cross sections in mb/ster at 90°, Ea =1,51 Mev. 
b Approximate relative intensities at 90°, Ez =8.06 Mev. 


lection, Haridwar 
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of 10 kev on the levels at 7.99, 8.57, 9.19, and 9.28 Mev 
[see Li (a y)B"]. The width of the 10.32-Mev level is 
5417 kev (El 53); it does not appear to correspond to 
the 10.23-Mev level observed in Li’(a@a’)Li’ (Li 54). 
No other levels are observed below £,=11.46 Mev. 
For example, the upper limit for groups <50-kev wide 
is 15 percent’ of the intensity of the protons to the 9.19- 
Mey state (El 53). [The nonappearance of levels corre- 
sponding to those reported in Li’ (a a’) Li’ is presumably 
to be attributed to their large width.] At E4= 4.153 
Mev, four proton groups are observed corresponding to 
excited states at 6.79, 7.25, and 8.96 Mev [(Kh 54): 
error given as +0.02]. 

At Ea=7.7 Mey, the angular distributions of the 
protons to the ground state and to levels at 2.14, 4.46, 
5.03, 6.76, and 6.81 (unresolved), 8.93, 9.19, and 9.28 
Mev (unresolved) have been analyzed by stripping 
theory. The levels below 7 Mev seem to be fitted by 
!,=1 assignments, although in the case of the un- 
resolved levels, higher 7, are not excluded. The /,=1 
assignment restricts the states to odd parity and 
3<J<i7/2 [(Ev 54); see, however, (Pa 54a) ]. The 
variation of the distributions with energy is of particular 
interest and may indicate interference between stripping 
and compound nucleus formation. Relatively large 
neutron capture probabilities suggest that the ground 
state and one of the states near 6.8 Mev may have a 
single-particle character (Ev 54). 

Angular distributions have also been studied at 
Ea=0.20, 0.45, and 0.60 Mev by (Pa 54a), at Za=0.3 
Mev by (En 52), at Ea=1 to 3.7 Mev by (Re 50), 
at Ea= 1.06 and 1.43 Mev by (Bu 54), at Ea= 3.03 Mev 
by (Pr 54a) and at 8 Mev by (Ho 53c). It appears that 
the stripping process makes an appreciable contribution 
even at energies lower than 1 Mev for some proton 
groups [(Bu 54) and (Pa 54a) ]. Assuming that com- 
pound nucleus formation contributes an approximately 
isotropic background, (Pa 54a) find that the angular 
distributions may be fitted by /,=1, 3, 0, and 2 for the 
B" states at 0, 2.14, 4.46, and 5.03 Mev, respectively. 

Gamma rays reported by (Bent, Sippel, and Bonner, 
private communication: lens pair spectrometer) and 
attributed to the present reaction are listed in Table 
III(11) and indicated in Fig. 16. Studies on p—y 
coincidences and correlations are cited in (Aj 52c). In 
the correlation of the proton group leading to the 2.14 
Mev level and the y radiation due to its subsequent 
decay, (Th 53) finds 1(90°)/7 (180°) = 1.05=20.05, con- 
sistent with J=} for the excited state. See also (Th 54d). 


XVI. BY(y n)Be Om= — 11.459 
See (Sh 51d). 
XVII. B'(y)He'+He! Qn=—11.131 


The reaction apparently proceeds mainly through 
excited states of Be’, although events corresponding to 
B"(y a)Li? and B" (y @)Li™*—H*+ He‘ are also reported. 
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Taste IIE(11). 7 rays from BY(d pB". 


Yaeld Total o° Assignment 

Eyt 10-8 y/d (mb) piz 
Fe E T ERS 

4.74-0.04 Zee 6.9 9.19-4,464 
8.984-0.04 2.3 7.2 8.93 
8.62-£0.07 0.6 1.9 8.57 

5.87+0,.07 0.9 2.8 7.99-2.14 
7.32+0.04 1.3 41 7.30 
6.79=4-0,04 3.3 12 6.76 
5.03-£0.04 1.3 4.1 5.03 
4.50+0.10 1.9 6.9 4.46 


a Bent, Sippel, and Bonner, private communication: E4 =2.0 Mev, thick 
B” target. 

b Includes ~0.5 percent correction for Doppler shift. 

e Average, Ea =0 to 2 Mev. 

d May also be from B*(d n)CH, 4.77-Mev state. 


The cross section exhibits a maximum near E, = 23 Mev, 
o~0.25 mb, and possibly a second maximum near 27.5 
Mey. There is some indication of fire structure, corre- 
sponding to discrete levels of B" in the region 16 to 28 
Mev (Er 53). See also (Mu 52a) and (Lo 53). 


XVII. B"(n n/)Bu* 
See (Sc 54c). 


XIX. B"(p p)BU* 

Inelastic proton groups corresponding to levels at 
2.2, 4.8, 6.5, and 7.8 Mev are reported (Fu 48 and 
Co 52g). A 2.13-Mev y ray is observed for E,=2.5 to 
2.9 Mev (Hu 53 and Ba 54g). 


XX. CX(6+)B" 0, = 1.980 
See CH., 

XXI. C2(y p)B" Qa=— 15.949 
See CP. 

XXII. C2(da)B" 0n=5.163 


The weighted mean of Q-value measurements is 
5.164+0.004 Mev (Va 54). This number includes a 
recent value of 5.166+0.005 Mev [(Ph 53) and G. C. 
Phillips, private communication ; mag. spectrometer |. 

An alpha-particle group has been observed corre- 
sponding to a level at 2.107-+0.017 Mev (Li 51b). At 
E.=7 Mev, alpha-particle groups corresponding to B“ 
levels at 4.45 and 6.83 Mev are reported by (Sp 53c). 
At Ez=2 and 4 Mev, a 448-Mev y ray is observed _ 
which may be located either in B“ or in N“ [ (Be 54a), 
(Be 53b), and Bent, Sippel, and Bonner, private 
communication ]. 


ae 
XXOL N'(na)B"Y On,=—0.154 = Š 

At E,=14.1 Mev, evidence is found for states of B" 
at 2.1, 4.4, 5.0, 6.7, 7.3,¢8.0, 8.5, 8.9, and 11.8 Mev 
(Li 52). i ‘ 


~ 
~ = 
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qu IV. B°(p2)C® O,=—4.42 E,=8.697 
(Fig. 17) | Sao CU, 
I. CGB" Q,=1.980 _ 


The spectrum is simple. The half-life is 20.35=+0.08 
(Sm 41), 20.5040.06 (So 41), 20.00.1 (Di 51), 20.74 
+£0.10. min (Kú 53a). The maximum beta energy is 
9683-8 kev. Log fi=3.62 (using 7}= 20.4 min) (Wo 54a). 


Il. Be®(He? 2)C" On=7.563 
See (Po 52b, Ku 53a, and Mo 54b). 


Til. BY(py)C"  Qm=8.697 

The cross section has been measured for E,=0.5 
to 2.5 Mev. It indicates a broad resonance at E,=1.2 
Mev and, possibly, one at 2.4 Mev. In this energy 
interval, the y transitions are mainly to the ground 
state; at Ep=0.86 Mev, E,=9.9+0.5 Mev; at E,=1.21 
Mey, £,=9.4+0.5 Mev (Da 54). The thick B” target 
yield of 9.47=-0.12 Mev y radiation is 3.4X10-” y/p at 
E,=1.16 Mev (Wa 50a). See also (Kr53b) and 
(Wi 53c). A broad resonance, I'~0.5 Mev, is reported 
at E,=4.0 Mev [(Ba 54g) : preliminary ]. 


V. B(p p)BY Ey=8.697 

The elastic scattering cross section at 0= 138° rises 
from nearly the Rutherford value for Ey<0.9 Mev to 
4 times Rutherford at E,=1.6 Mev. Anomalies are ob- 


served at Hp=1.15 and 1.5 Mev (Br Sia). See also 
(Va 50). 


VI. B°(p p’)B* Ey= 8.697 

The cross section for 0.72 Mev y radiation at 0=90° 
rises monotonically from 0.04 mb/sterad at 650) 
Mev to 1.6 mb/sterad at 2.65 Mev [(Da52) and 
(Da 54) ]. At E,=2.19 Mev, 0=135°, the cross section 
for the corresponding proton group is ~3 mb/sterad. 
For E {4.2 Mev, the cross section for excitation of 
higher states in B” is <0.3 mb (Cr 52c). 


VII. B!(p d)B® m=— 06.211 E,=8.097 


See B°. 


TR TAT] 


mics 


SS 
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VILL. B(p He?)Be® Qn=—0.532 E,=8.697 
See (Cr 52c). 


IX. B’(p a)Be? Om= 1.147 Ey, = 8.697 

- The excitation function is a smooth exponential from 
E,=60 to 200 kev. At Ep=180 and 205 kev, the cross 
sections are 0.26+0.04 and 0.79-£0.12 mb, respec- 
tively (G. G. Bach and D. J. Livesey, private com- 
munication). 

The ground-state a particles exhibit broad resonances 
at Ep=1.1 Mev and 1.5 Mev, superposed on a con- 
tinuous background. At E,=1.1 Mev, 6=138°, the 
differential cross section= 16+3 mb/sterad, dropping to 
half this value at E,=1.6 Mev (Br 51a). At £,=3.33 
Mev, 0=135°, the differential cross section is 20 mb/ 
sterad (Cr 52c). 

The short range alphas, = 138°, and 0.43-Mev y rays, 
§=90°, exhibit only one resonance for Zp<2.7 Mev, at 
1.52 Mev, with a width of 0.25 Mev and a peak cross 
section of 0:14-£0.03 b (Br 51a), 0.2140.07 b (Da 54). 
See also (Aj 52c). 


X.-BY(d nC! On=6.472 


At Ea=3.40 and 3.64 Mev, 0=0° and 80°, neutron 
groups are observed corresponding to states in C" at 
0, 1.85, 4.23, 4.77-0.06, 6.40, 6.77, 7.39-0.04, 8.08, 
8.39, 8.62, 8.97(?), and 9.13-40.02(?) Mev. Levels at 
6.40, 8.39, and 8.62 Mev are believed to be formed by 
stripping, with /p=0 transfer, and to have even parity, 
J=5/2 or 7/2 (Jo 52a). Angular distributions have been 
studied at Ez=0.58 Mev (Pa 54b) and at E4=0.71 to 
1.43 Mev (Bu 54); evidence is found for participation 
of the stripping process even at 0.6 Mev. Assuming an 
isotropic conttibution from compound nucleus forma- 
tion, (Pa 54b) find /,=1, 3, and 0 for formation of Cl! 
states at 0, 1.85, and 4.23 Mev, respectively. 

Neutron threshold measurements indicate levels in 
C! at 8.14-+0.02, 846+0.02, and 8.70+0.02 Mev 
(Cook, Marion, and Bonner, private communication). 
Gamma-ray measurements at Ez=2 Mev indicate two 
lines with £,=6.52+0.04 Mev (caye-=12 mb for 
Ea=2 Mev) and 7.01+0.06 Mev (cave=6.0 mb for 
E4=2 Mev) (Bent, Sippel, and Bonner, private com- 
munication: corrected for Doppler shift). See also 
(Gi 53). 


XI. BY’(He? d)C" 0,=3.203 
The ground-state reaction has been observed for 
E(He’)=1 Mev (E. Almqvist, private communication). 


XII. B" (pn)C! Qn=—2.762 

At E,=5.96 Mev, neutron groups corresponding to 
the ground state and an excited state at 2.05+0.1 Mev 
are observed (Rubin, Mazari, and Ajzenberg, private 
communication). See also (Ri 50e) and (Aj 54b). 


Ci, B12 111 


XI. C?(n 2n)C" 


See C”. 


On 48.711 


XIV. C#(y njCh 


see CZ 


On* — 18.711 


XV. C2(p dC 
See N”. 


Om= ~16.486° 


XVI. C¥(He?a)C On = 1.855 
See (Fr 52e, Po 52b, and Ku 53a). 


XVII. N'4(p a)C# 
See 0'5, 


On= —2.916 


XVIII. O(y na)C! On = — 25.859" 
The threshold energy is 25.9+-0.2 Mev [P. Stoll and 
P. Erdés, private communication, and (Er 54c) ]. 
Bu 
(Fig. 18) 
I. B°()C”  O,n=13.370 


The spectrum is complex: see C”. 


Il. Bela p)BY 0,,= — 6.880 


At E.=21.7 Mev, proton groups are observed corre- 
sponding to the ground state of B? and to levels at 
0.95, 1.65, and 3.82 Mev (Mc 51b). 


UL. BY(ny)B? O,,=3.361 
gin<50 mb (Wa 50e). See also (Wi 53f). 
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TABLE I(12). Resonances in B"! (n n)B" (Bo Sic). E,=0 to 3.15 Mev with an intensity greater than 4 

$ a a z ae percent of the ground-state group, or from 3.15- to 3,5- 

i (fey) ony (kev) (Mey) ~—sE98/(34/2ua) J,» Mev excitation with an intensity greater than 8 percent 

i 0.43 29 40 3.76 i 0.036 2+ Association of these levels with known levels in C12 i 

i discussed (El 53). At Ea=4.153 Mev, proton groups 

i 1.28 2 130 4.53 2 0.28 Sa are observed to the ground state and to levels at 0.949 

| 1.78 04 65 4.99 1 0.012 1 and 1.664 Mev [ (Kh 54); mag. analyzer: =Æ 20 kev]. 

| 2 0.056 At Ea=8 Mev, the angular distribution of the pro- 

f (3) R tons, analyzed by stripping theory, indicate }„,=1 and ("A 

i; 2.45 0.5 120 5.61 1 0.017 2 therefore J <3, even parity for the ground state of Bz 

i 2 0.053 and the levels at 0.95 and 3.38 Mev, /,=0 for the 1.67- 

i 3 0.82 5 Mev state (J=1, 27) and /,=2 for the 4.53-Mev level te 

| 2.58 0.6 w wa a (0.00823 (J <4-) of B? (Ho 53c). At Ea=1.05 Mev, y rays with | 
2 0025 energies of 0.94 and 1.64 Mev are observed with an | 


a Widths corrected for variation of level shift with energy. 
b According to (Vo 54), 1 =3 yields an implausibly large value for 7?/D. 


IV. B” (n n)B" Ey=3.361 


Observed resonances are listed in Table I(12). The 
rise of the cross section at low energies may indicate a 
broad level formed by s-wave neutrons (Bo Sic). The 
angular distribution of scattered neutrons at the 0.43- 
Mey resonance is of the form 1+-4Pi(cos@) and indi- 
cates p-wave formation of a J=2+ level. The distribu- 
tion at the 1.28-Mev resonance agrees with d-wave 
formation of a J=3- level [(Wi54) and (Wi 54e) ]. 
Polarization of the neutrons has been observed at 
E,,=0.43 Mev (A. Okazaki, private communication). 

The total cross section varies from 1.5 b at 6 Mev 
to 1.4 b at 9.7 Mev (Ne 54c). The total cross section 
decreases monotonically from 1.43+-0.02 b (Co S4f), 
1.40+-0.03 b (Co 52h) at E,=14.1 Mey to 1.30 b at 
18.0 Mev (Co 54f). See also (Hi 54d). 


V. (a) B'(71d)Be® Q,=—9.008 E,=3.361 
(b) BY (n i)Be On=—9.561 
(c) BY(na)Lis Qn=—6.630 
Reactions (a) and (b) have not been reported. Cross 
sections for reaction (c) have been reported for nine 
energies in the range E,=12.5 to 19.9 Mev by (Fr 54d). 
See also (Fr 53d) and (He 54c). 


VI. B'(d p)B® Qm=1.136 
Proton groups observed at Ez=4.0 to 8.5 Mev are 


listed in Table JI (12). No other groups are observed for 


TABLE II (12). Levels in B” from B'(d p)B® (El 53 and Bu 50d). 


ay) pus 


1(90°)s 

1.136+0.004 0 1.0 
.189-+-0.004 0.947+0.005 0.78 

5340.008 1.674+0.011 2.85 

4780.007 2.618+-0.011 0.69 

. 2.723£0.011 0.04 

3.383 £0.009 1.86 


a 


7 ties at 90° (+20 percent) at Za =8.06 Mev. 
5 - AEZ 


EYA 
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intensity ratio of 2:1 (Th 54d). ` f 


VII. C2(n p)B? 
See (Je 48c). 


On= —12.588 


VIII. C¥(da)B? On=0.355 
See (Hu 50e) and N°. 


IX. N5(n0)B2 Qm=—7.627 
See (Je 48c). 
C12 |: 
(Fig. 19) | 
I. (a) Be®(He? z)C! Qm=7.563 Ey= 26.274 
(b) Be?(He® p)B" Q,,=10.325 


(c) Be(He? a)Be® Q,=18.900 
See Be’, BY, and C”. 


II. Be%(a n)C® Om=5.708 


The neutron spectrum at E.=5.3 Mev (thin target) | 
indicates levels of C! at 0, 4.2, and 7.5 Mev; in the | 
forward direction, the intensity of the latter group is | 
about 1/8 of that corresponding to the 4.4-Mev level | 
(Gu 52). Study of n—y coincidences indicates that fad 
60-6 percent of disintegrations lead to the 4.4-Mev f 
level; no indication of n—y coincidences involving 1 
other than the 4.4-Mev y ray is observed [(Di 54); d 
E.=5.3 Mev]. 

The energy of the y ray from the first level is 4.425 l 
0.020 Mev (corrected for Doppler shift). From the 
observed Doppler shift it is concluded that the life- 
time of the level is <310- sec (Mi 54d). The in- 
ternal pair conversion coefficient agrees with an 2 
assignment (Mi 54d) while the angular correlation 1S 
consistent with either E2 or M1 (Ha 54a). 

At E«=5.3 Mev, a y ray of energy 3.16+0.05 Mev 18 
observed with an intensity relative to the 4.4-Mev 
y ray of 3 percent (thick target). The intensity of 
7-Mev y radiation is <1/2500 (Be 53c). That the 3-Mev 
radiation results from a cascade transition is confirmed 
by the observation of y—y coincidences (3.14.4 Mev) 
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Fic. 19. Energy levels of C*: for notation, see Fig. 1. 


by (Ue 54) who finds an energy of 3.05+0.1 Mev and a 
relative intensity of 8-12 percent (Z.=5.3 Mev, thin 
target). Comparing this intensity with the reported 
intensity of the neutron group, (Ue 54) concludes that 
most of the transitions from the 7.6-Mev level are of 
this nature (see, however, C? (æ a’) C2). [The sum of the 
y-ray energies gives the level energy as 7.57+-0.05 Mev, 
ie. 0.2 Mev unstable with respect to Be*+He*: see 
N“ (d a)C!?.] Internal (or monopole) pairs (seven cases 
versus 72 from the 4.4-Mev radiation at Ea=5.3 Mev, 
thick target), corresponding to a transition energy of 
7.0+0.6 Mev, are reported by (Ha 54a) who sug- 
gests that the 7.6-Mev level has J=0. See also Be*: 


B” (b a) Be’. 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


At E,=21.7 Mev, a 15.10.5-Mev y ray is observed, 
with a yield comparable to that of the Be*(a p)C# 
process. Assuming a cross section for the (a n) reaction 
equal to the cross section for the (a p) reaction, it is 
concluded that TaK 1001, for the level in question. So 
high a y-ray yield from a level which is energetically un- 
stable to æ emission indicates the operation of a strong 
selection rule forbidding a decay. It is suggested thet - 
the level involved is the expected first T=1, J=1* state 
of C? [see C? (p p’)C2 and B"(dn)C?} (Ra 54a). See — 
also (Gu 53) and (Ca 54g). > ; ; 
UI. B°(dn)C" Qn=6472 E,=25.182 - 

The thin-target excitetion function in the range 
Ez=0.3 to 2.0 Mev shows a continuous rise, with some 

2 


a > 
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indication of a broad resenance near Ea=0.9 Mev. 
Angular distributions indicate that the stripping process 
is of considerable importance at energies >1 Mey 
(Bu 54). See also (Pa 54b). « 


IV. B°(d p)B" Qn=9.234 E,=25.182 


The differential cross section (at 170°) has been 
measured for Ea= 175 to 700 kev. The relative intensi- 
ties of four proton groups show irregularities at La= 210 
kev (C2*= 25.36 Mev) (Pa 54a). Excitation functions for 
ground-state protons at 0=0°, 90°, 135°, and y radiation 
are reported by (Bu 54) in the range Za=0.5 to 2.0 Mev. 
The curves show indications of two broad resonances, 
at Ea~1.0 and 1.5 Mev, superposed on a general rise. 
Angular distributions suggest a stripping process. The 
absolute differential cross section at Eg=1.5 Mev, 
0=90°, is 3510:mb/sterad [(Bu 54): see however, 
(Va 51a) ]. 


V. BY(d d)BY 
See B®, 


Ey= 25.182 


VI. B°(da)Be® Qm=17.809 H,=25.182 


A resonance has been observed at Ea=1.05 Mev 
(Wh 51b). 


VIL. BY(a d)C# Om= 1.348 
Qo=1.39-£0.01 Mev (Sh 53b). 


See (Cr 49c) and N“. 


VIII. (a) BY(py)C2  Omn= 15.949 
(b) BY(pa)Be® On= 8.575 E,=15.949 


The capture radiation comprises two components: 
a 16-Mey line, representing the ground-state transition, 
and a 12+4-Mev cascade through the 4.4-Mev, J=2* 
level. Resonances for the 12-Mev radiation occur at 
E,=0.16, 0.68, 1.39, 2.6(?), 3.14, 3.6, 4.95, and 5.12 
Mey, while the 16-Mev radiation exhibits resonance only 
at E,=0.16, 1.4, 2.6(?), 3.18(?), and 3.6 Mev (Hu 53, 
Co 52e, Go 53d, and Ba 54g). Reaction (b) leads to 
œ particles to the ground state of Be®(a) and to the 
2.9-Mey excited state (a). The a: particles exhibit all 
the resonances listed up to 2.6 Mev, while the ao par- 
ticles show all except the 0.68-Mev resonance [ (Be 53) 
and (Pa 53b)]. Values for the cross sections and the 
derived partial widths are listed in Table ITI (12). 
The. resonance energy and width of the first reso- 
“nancé is given as 162+1 kev, T=5.3+1 kev (Ta 46); 
162.8+-0.2 kev, T=4.5+1 kev (Mo 49a); 163.80.3, 
[T=7.340.5 kev (Hu 53d). 
The angular distribution of the 12-Mev y rays has 


-—} een studied in the range E,=0.15 to 0.18 Mev by 


(Hu 52b), from 0.17 to 0.70 Mev by (Gr 54a), from 
0.2 to 1.1 Mev by (Je 53c), from 0.6 to 2.0 Mev by 
(Gi 54a) and (Gi 54c), and ‘from 0.7 to 2.6 Mev by 
(Go 53d). At E,=0.16 Mev, the distribution has the 
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form (1-+0.25 cos’); above this energy, a (cos) term 
sets in, whose coefficient rises to a value of 0.15-0.2 at 
0.4 Mev, passes through a minimum of 0-0.1 at 0.7 Mey 
rising toa maximum of 0.4-0.5 near 1.5 Mev and vanish. 
ing again at 2.6 Mev. Various reports disagree on the 
course of the (cos’@) term, but it appears to have a min- 
imum (~0.1) near 0.7 Mev [ (Gi 54c) and (Je 53c); see 
however, (Gr 54a) | and a peak value of 0.4-0.6 near 2 
Mev, falling to a value of —0.15 at 2.6 Mev. 

The angular distribution of the 16-Mev radiation at 
E,=0.168 is given by 1—A cosłð+B cos% where 
A=0.190.06, B= A+0.02+0.04 [ (Cr 54) and Craig, 
Cross, and Jarvis, private communication]. Values 
obtained by (Gr 54a) are consistent with these and with 
a theoretical distribution assuming J=2+ for the 
E,=0.163-Mev resonance with interference from a 
J=1 resonance at E,=1.39 Mev (see, however, 
GI 52b). From £,=0.6 to 2.6 Mev, the value of A 
varies between +0.05 and —0.09, while B varies from 
0.15 to 0.3, dropping to —0.43-at 2.6 Mev [(Gi 54a), 
(Gi 54c), and (Go 53d) J. 

The a-particle angular distributions are reported by 
(Ha 39) and (Th 52c). Again, the distributions depend 
upon energy, the long-range group exhibiting a (cos@) 
term which changes sign near E,=0.16 Mev (Th 52c). 

Since the C” level corresponding to the E,=0.16-Mev 
resonance yields a particles to the /=0* ground state of 
Be’, the angular momentum and parity must be either 
even-even or odd-odd. Formation by s waves is ruled 
out by the observed anisotropy. Of the possible states 
formed by p waves, J=0* is excluded for the same 
reason, leaving J=2+. This assignment is supported by 
study of the angular correlation of 12- and 4-Mev 
y rays [(Hu 52b) and (Le 52)'], by the ao-particle 
distribution (Th 52c), and by the angular-correlation of 
a and subsequent a particles (Ge 54a). (If the reported 
proton width of 5 ev is correct, formation by d waves 
cannot be ruled out on grounds of penetrability.) The 
very small values of the particle widths for œ emission 
from the 16.10-Mev C® level suggest the operation of a 
strong selection rule. It appears probable that the level 
has T=1 and that the difference in I'ay and Ta, reflects 
a difference in the isotopic spin purity of the Be® states 
(Be 53). 

The proton width, T =150 kev, of the 16.57-Mev 
level (Z,=0.68 Mev) admits formation by either s or 
p waves. Assignment of J=2- would account for the 
absence of a and 71 from this level. The 15-ev value of 
Tv2 suggests a relatively uninhibited £1 radiation, 
and hence T=1 for the level. An assignment of 3* 
is not excluded by these measurements (Be 53). The 
angular distribution of œ particles is said to indicate 
J=2- [ (Pa 53b); see, however, (Gr 54a) ]. 

For the third resonance, E,=1.39 Mev, C?*=17.22 
Mev, the large proton width suggests s or p wave forma- 


1 The angular correlation of (Le 52) was misquoted in (Aj 52c)- 
The corrected relation should read: W(100°, 100°, g)=1 
+ (0.41+0.06) cos’ (G. M. Lewis, private communication). 
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TABLE HAN Resonances in Ae vee and SAE k 
Ep (y 6)™ G 2)5 alan)? ax ola l a r at} ad ay “eit e ¥ 
(Mev) (kev) 2 ine ? Hi s i r jd ea a ee: m ee ee ia Cis Jiu T 
Soe eS ad eS Sites pn i teal a 
0.163% > DS 152 0.2 0 <3 70 6.1 5 0.005 16.10 ao 
0.675% 322 (<2.3)! 48 (<0.2)§ (<0.04)! 600 35.4 SOS 15 S005 150 150 16.57 (27-)(1) 
1,388" 1270. 35 18 6 1.0 150 8.8 40 20 7 200 1000 NAA EEY 
2.0? 150 non res. 1.4 23 17.8 (0E) 
2.65>s 300 ` (res.) res. 2.6 10.5 18.38 (2+) 
3.147 (res.) res. 18.84 
3.67 res. res. 19,3 
4.956 non res. res. 20.49 
5.12 non res. res, 20.64 
a (Hu 53). d An SE. ee 
b (Go 53d) and (Pa 53b). 
© (Be 53 3): values for Ep =0.16 estimated from earlier work. These values are subject to considerable uncertainty. 


4 (Pa 53b); relative maximum yields: thin ta arget. 


e (Be 53). 
1 Nonresonant. 
z (Ba 54g). 


tion, although in view of the uncertainty in Tp, d wave 
is not excluded (Be 53). On the assumption of s-wave 
formation of a J=1 state (2 is excluded } by the ob- 
servation of ao), the latge width for the £1 transition 
to the ground state of C” points to T=1 (Be 53). 
However, the a-particle and y-ray angular distributions 
are consistent with J=2* [ (Pa 53b) and (Go 53d); see, 
however, (Gr 54a) ]. An assignment of O+ is suggested 
for the 17.8-Mev level (E,=2.0 Mev) on the basis of 
the intensity of the a; group. The 18.39-Mev level 
(E =2.65 Mev) is stated to be 2+ by (Go 53d). 

A search for pairs in the range E,=6.5 to 9.5 Mev 
at E,=1.7 to 4.0 Mev led to a negative result. An 
upper limit for the cross section is 0.03 ub [ Bent, Sippel, 
and Bonner, private communication; see (Ph 51b): 
Be®]. See also (Ch 53d), (Wi53a), (Wi53c), and 
(Fo 54a). 


TX. Bu(pn)C" O,=—2.762 E,=15.949 
Resonances have .been observed at E,=3.18, 3.63, 

4.06, and 4.70 Mev by (Ba 54g), and at E,=3.7, 5.18, 

5.87, and 6.37 Mev by (BI 51a). See also (Aj 52c). 


Xe, B'(p Pp) Bu 

Resonances for 2.13-Mev y radiation are observed 
at E,=2.664 Mev [T=48 kev: (Hu 53); see also 
(Ba 54g) ], and at Ep=3.15, 3.4, 3.78, 4.26, 4.68, and 
5.13 Mev [(Ba 54g): C8*=18.40, 18.84, 19.2, 19.42, 
19.86, 20.25, and 20.64 Mev]. At the first resonance, 
the cross section is 31 mb, assuming isotropic distribu- 
tion (Hu 53). 


E,=15.949 


XI. BY(p d)BY 
See B". 


Qm=— 9.233 E,=15.949 


KI. B!(dn)C®? Qm=13.724 

Neutron groups corresponding to states in C'? at 0, 
4.44, 9640.1, 10.80.1, 1110.1, 11.74+0.08, 12.76 
+£0.08, 13.21+0.05(?), 13.36£0.05(?), 14.16+0.05(?), 
15.09-0.03, 15.52+0.03(?), and 16.070.053 Mev have 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


been observed by (Jo 52a). No group corresponding to 
a state at E,~7.6 Mev is observed with an intensity 
210 percent (Jo 52a). At Ea=8.1 Mev, the angular 
distribution of the neutrons to the 4.4-Mey state, 
analyzed by stripping theory, indicates /,= 1 and there- 
fore even parity for that state, J<3 (Gi 53). Neutron 
threshold measurements show a level in C? at 15.10 
£0.02 Mev (Cook, Marion, and Bonner, private 
communication). 

At low-bombarding energies, a single y ray is ob- 
served corresponding to the decay of the 4.4-Mev state; 
see (Aj 52c). At high energies (Ey= 10.8 to 50 Mev), a 
15.2+0.2-Mev y ray is observed which is attributed to 
the ground-state transition from the first 7=1 state of 
C? at 15.09 Mev [(Co 54d), (Wa 54a), (Ra 54a): see 
C¥(p p')C¥*]. See also (Th 54d). 


XIII. B?(8-)C?  O,,= 13.370 


The half-life is 0.022-+0.002 sec (Be 39), 0.027+0.002 
sec (Je 48c), 0.0220.001 sec (Br 53d). The spectrum 
is complex. Most of the transitions go to the ground 
state of C}; however a 4+1 percent branch to a 
y-emitting level, perhaps the 4.4-Mev state, has been 
observed in a study of 8—y coincidences (Ve 52a), 
and deviations from the Fermi plot below Ea=6 Mev 
may indicate a 5 percent branch to the 7.6-Mev and 
higher excited states of C? (Ho 50). Log ff for the ground- 
state transition is 4.17; assuming a 4 percent branch 
to the 4.4-Mev state, log/t for this transition is 4.8. 
Since both branches are allowed, J=1+ for B® is 
indicated (Ve 52a). 

a Ta 2 
XIV. C®(yn)C# O,=—18.711 

Discontinuities in the yield of C"! (Ka 54a) and of 

neutrons (Go 54f) as a function of bremsstrahlung 


energy indicate levels in C”? at 19.3, 19.8, 20.1, 20.5, — 


20.7, 21.1, 21.6, 22.4, and 22.8 Mev (+0.1 Mev to 
0.2 Mev) The integrated cross sections, assuming 
sharp levels, are also given (Ka 54a). The sum of these 
integrals is 55 Mev-mb,°while the integrated (yn) 
cross section over the same region, ignoring fine struc- 


i 
1 
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ture, is given as 29 Mev-mb by (Mo 53c) and as 27 
Mev-mb by (Na54). The approximate agreement 
implies that a substantial fraction of the observed 
(y n) cross section may be attributed to discrete states 
of C. On the assumption of £1 absorption, the states 
in question have J=17 (and, presumably, T= 1) 
(Go 54f). If the fine structure is ignored, the general 
form of the cross-section curve exhibits a broad reso- 


- nance at Ey~22 Mev with a width of 3 Mev and a 


peak cross section of 8.6 mb (Na 54), 9.5 mb (Ba 54c), 
13.1 mb (Ha 51b) and (Ka 51a). 

The integrated cross section to 38 Mev is 45 Mev-mb, 
while that from 38 to 200 Mev is 15 Mev-mb (Ba 54c). 
The integrated total nuclear absorption cross section is 
~100 Mev-mb to E,=26 Mev (Ha 53d). See also 
(Na 53, Ro 53b, and Bi 54). 


XV. C27 p)BY Qm=— 15.949 

The cross section exhibits a resonance at E,=21.5 
+0.5 Mev, T= 1.70.5 Mev. The peak cross section is 
34-8 mb; the integrated cross section to 24 Mev is 
6316 Mev-mb (Ha 51f). The angular distribution of 
protons at E,=23 Mev indicates interference of a one 
percent quadrupole component with the dominant 
dipole absorption (Ha 52g). See also (St 54i). 


XVI. C2(y 3a) Qn=—7.278 


Maxima in the yield of 3-prong stars are reported at 
E,=17.3, 18.3, 21.9, 24.3, and 29.4 Mev; some evi- 
dence of fine structure is also found. The integrated 
cross section is 1.21+0.16 Mev-mb for £,<20.5 Mev, 
9.8-+-0.4 Mev-mb for 20.5<E,<42 Mev, and <0.2 
Mey-mb for 42 <E,<60 Mev [(Go 53f): 2500 stars; 
see also (Wi55)]. (Da 53d) find pronounced narrow 
peaks at H,=18 and 29 Mev. (Mi 53c) report a single 
broad peak at E,=17-18 Mev with a peak cross 
section of 0.08 mb, a width of ~4 Mev and an inte- 
grated cross section (to E,=27 Mev) of 0.84 Mev-mb. 
Neither (Da 53d) nor (Mi 53c) excludes the possibility 
of fine structure. The mean cross section for the (Li+-p) 
y rays is 0.140.026 mb (Go 53f), 0.175+0.025 mb 
(Gl 52). 

The character of the a spectrum at £,=17.6 Mev 
suggests that £2 and M1 radiation is mainly involved, 
leading to states in C? with J=2* and 1* (Te Sib). See 
also (Aj 52c) and Be®. (Wi 55) find, on the other hand, 
that above E,=15.6 Mev, the major contribution 
comes from E1 and E2 absorption, with £1 comprising 
35 percent for E,=15.6 to 17.0 Mev, 70-75 percent 


_from 17 to 20 Mev, 35 percent from 20 to 22.6 Mev, 


increasing to ~100 percent for E,>26 Mev. It is 
inted out by (Ge 53a) that the threshold for 1 ab- 
sorption (into T=1 states) should be around 15 Mev 
and that the threshold for allowed « emission to the 


_ 16.8-Mev T=1 state of Be® should be near 26 Mev. 

3 Histograms of (Wi 55) indicate that the direct three- 

body disintegration docs not occur to any appreciable 
am ; 4 CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 
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extent [see, however, (Mi 53c) and (Ch 54e)]. See algo 
(Li 53b) and (Gr 54). 


XVII. C2(e e’)C?* 


At E,=150 and 188 Mev, inelastic electron groups 
are observed corresponding to states of C? at 4.5, 7.6 
and 9.7 Mev. The intensity of the 7.6-Mey line is 
0.052 of the elastic peak and,0.125 of the Q= —4.5-Mey 
group (E,=188 Mev, 6=80°) [(Mc 54) and J. H., 
Fregeau and R. Hofstadter, private communication]. 


XVIII. (a) Cenn) 
(b) C2(n n’)3Het On=—7.277 

A gamma ray of energy 4.42+0.02 Mev is observed 
at E,=6.5 Mev [(Da53c) and R. B. Day, private 
communication]. At E,=14 Mev, gamma rays of 
energy 2.8, 4.45, 6.0, and 7.0 Mev are reported to be 
produced, with cross sections of 36, 69, 2, and 2 mb, 
respectively (Sc 53); (Th 54b) report only a 4.4-Mey 
y ray, with a cross section of about 300 mb. An inelastic 
neutron group corresponding to excitation of the 4.4- 
Mev level is reported by [(Wh 53): £,=14 Mev]; 
there is evidence for a second group corresponding to a 
level near 11.5 Mev (Wh 53). 

At high neutron energies, E,=8 to 40 Mev, a-emitting 
levels of C? are excited. A level at 9.7 Mev, with an 
observed half-width of 1.6 Mev, appears to be involved, 
and to decay mainly to the ground state of Be’. Direct 
decay from this level into 3 «œ particles occurs less than 
25 percent of the time (Ja 53). There are indications 
of the participation of another state, at ~12 Mev, 
which decays predominantly to the 2.9-Mev state of 
Be® (Li 53b). See also (Ba 53a), (Fr 53d), (On 54), and 
(Sc 54c). í 


XIX. C2(p p’)C* 


An inelastic proton group corresponding to the 4.4 
Mev state of C! has been observed at E,=7.3 Mev 
(Go 52b), 8 Mev (Ar 52e), 9.5 Mev (Bu 53b), 10 Mev 
[ (Fi 54) and (Fi54b)], and 31.5 Mev (Br 52c). Near 
E,=10 Mev, the angular distribution exhibits a mini- 
mum at 0=90° and maxima near 0=40° and 150°. 
(Bu 53b) report the distribution approximately sym- 
metric about @=90°, while (Go52b) and (Fi 54b) 
find the forward maximum somewhat higher. The 
angular distribution of 4.4-Mev y rays is of the form 
1+3.6 cos’@—3.6 cos‘@, consistent with J=2 for the 
excited state (Go 53e). The angular correlation of y rays 
and inelastic protons is reported by (Go 53). Resonances 
for production of 4.4-Mev y rays are reported at 
E,=5.37 and 5.9 Mev [(Ma 53h) and (Ma 54k)]: 
see N¥. At E,=31.5 Mev, groups corresponding 1? 
levels at 4.4, 7.5, and 9.5 Mev are observed (Br 52c). 
At E,=96 Mev, inelastic groups are observed to levels 
at 4.4, 9.6, and 20.0-+1 Mev in C? (St 54d). (There 18 
no indication of a level near 15 Mev in this work. 
From E,=30 to 340 Mev, a 15.2+0.2-Mev y tay * 
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observed which is attributed to the first T=1 state in 
Ce (Wa 54a) and (Co 54d): see also B"! (d DSA 


XX. C2(d d’)\C2* 


At Ea=10 Mev, inelastically scattered deuterons 
corresponding to a state of C!? at 4.41 Mev are ob- 
served by (Ke 51b). At Ey=19.1 Mev, an additional 
group is observed corresponding to the formation of the 
9.6-Mey state; angular distributions of the groups are 
also shown (Fr 54e). 


XXI. (a) C2(ea’)C2* 
(b) C2(aan)CH 


At E«=22 Mev, inelastic groups corresponding to 
levels at 4.4, 7.65-£0.07, 9.6, and, possibly, 12.7 Mev 
are observed. C?* recoils have been observed corre- 
sponding to the ground state and that at 4.4 Mev, but 
not to the 7.6-Mev state; it is believed that this state 
yields. Be*+-He' with >80 percent probability [see, 
however, Be®(a )C* ] | (Mi 54c) and V. K. Rasmussen, 
private communication ]. For reaction (b), see (Li 53c). 


On=—18.711 


ce 


XXII. C8(dC? 0,=1.309 


The weighted mean of two Q-value determinations 
is 1.310+0.003 Mev (Va54). Absolute differential 
cross sections have been measured for the ground-state 
tritons from 0=5.8° to 137° at Eg=3.29 Mev and 2.19 
Mev. At the higher energy, the distribution agrees well 
with that expected from pickup theory, with /,=1, 
R=6.25X10- cm; discrepancies at the lower energy 
are attributed to effects of the Coulomb barrier (Ho 54c). 


XXII. N?(6+)C? On=17.7 


The decayeis mainly to the ground state via an al- 
lowed transition. Delayed a particles with a total energy 
of ~4 Mev are also.observed, suggesting that a state of 
C! in the region 11 to 12 Mev is involved (Al 50g). 


XXIV. N“ (nC? Qn=—4.007 
See N5 and (Li 52). 


XXV. N¥(da)C® Q,=13.570 

Alpha particles have been observed corresponding to 
levels of C2 at 4.431-0.013 (Ma51), 7.68+0.03 
[(Du 53) and (Sp 53c)], 9.61340.012 Mev (Ma 51). 
The intensity of the group corresponding to the 7.68- 
Mev state is 6 percent of the intensity of the group 
corresponding to the first excited state at Ea=0.62 Mev, 
6=90°: the width is <25 kev. No other group of in- 
tensity >1 percent appears for E.=3.1 to 7.4 Mev 
(Du 53). The 9.6-Mev state has a width >10 kev 
(Ma 51). The angular correlation of a particles and 
4.4-Mev y rays at Ea=0.63 Mev indicates J =2* for 
the 4.4,Mev state [(St 54f: see also 0*6]. A small yield 
of 15-Mey y radiation is reported at Ea= 10.8 Mev 
[(Ra 54a): see Be? (e n)C”]. See also (Th 54d). 
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XXVI. Ni*(pa)C? On=4.961 

The weighted mean of seven Q-value determinations 
is 4.961+0.003 Mev (Va 54). This includes a recent 
value of 4.962-0.004 Mev obtained by (Co 53; mag, 
spectrometer). 

Alpha particles have been observed to a state of 
C”? at 4.432+0.010 Mev (Sc 52). The y-ray energy, 
after a Doppler correction of 20 kev, is 4.443-+0.020 
Mev. The necessity for the correction implies a lifetime. 
for the 4.4-Mev state <3 10-7" sec. This upper limit is 
consistent with 2 decay of the 4.4-Mev level to the 
ground state and thus with J=2* (Th 52). The augular 
distributions of short-range alpha particles and 4.4-Mev 
y radiation indicate that the 4.4-Mev state has J=2+ 
or >4 (Kr 53). See also O". 


XXVII. (a) O8(ya)C?. On=— 7.149 

(b) O8 (74a) = On= — 147426 
There is evidence of the involvement of C” states at 
9.6 and ~11 Mey, which decay to the ground state of 
Be’, a state at 12-13 Mev, decaying mainly to the 2.9- 
Mev Be? state, and T=1 states at ~16 Mev, again 
leading mainly to the 2.9-Mev Be? state. The 4.4- and 
7.7-Mev states of C? seem to occur rarely, if at all 


(see O14). 
NY 
(Not illustrated) 
I. N2(6*)\C2 Qn=17.7 


The half-life is 0.0125-40.001 sec; E,(max)=16.6 
+0.2 Mev (Al 49a). The decay is complex; N”? decays 
both to the ground state and to @ unstable excited 
states of C? (Al 50g). Log ft=4.18 for the ground-state 
transition [ (Fe 51b); see also (Ki 52) ]. 


I. C2(pn)N? On=—18.5 
Exhresn, = 20.040.1 Mev (AI 49a). 


TI. N“ (y 2n)N® On=—30.9 

See N“. 

The following reactions leading to N® are not re- 
ported: B'’(He?)N® (O,=1.3), C?(He? tN? 
(OQm=— 17.7), N! (p NE (On=— 22.4). 

B! 
(Not illustrated) 
° D 


Mass of B!? 


(Ba 39e) computes a mass of 13.0207 amu for B”: 
mass defect=19.3 Mev. A rough interpolation from 
other A suggests that the first T=4 state in C¥ is- 
located ~13.5 Mev above the ground state. This means 
B¥—C%~12 Mev: mass defect=19+2 Mey. Assuming 
this value, B would then be stable by ~6 Mev to 
decay into B?--n, and by ~12 Mey to decay into 


2 
3 = 
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Be- and Li’-++a. (Hu 53a) have attempted to detect 
B® as a delayed neutron emitter with 5X10 sec 
<7;<1800 sec formed in spallation reactions. The re- 
sults were negative as were those of (Sh52) who 
searched for B! as a spallation product of 50 Mev 
bremsstrahlung irradiation. See also (Sn 48). 

The following reactions leading to B® are not re- 
ported: B'(@ p)B® (Qn=1), C(u p)B™ (Qn=—11), 
G2 He)BE (On — 12), C4%@aBe (,=—17), 
Cild He)BY (Qn=—14), CHU a)B¥ (Qn=0), 
NS (n He®)B® (0,,= — 22), CH (y p)B® (Qn=— 20). 


Cis 
(Fig. 20) 
I. Be(an)C? On=5.708 E= 10.656 


Resonances observed for E.=0 to 5.3 Mev are ex- 
hibited in Table I(13) [(Ha 49i), (Ta 53), (Be 54), and 
(Tr 54) ]. See also (Aj 52c) and C”. 


TABLE I(13). Resonances in Be? (a n)C”. 


Ea Ea 
(Mev) (Mev) Cue J, n° References 

0.53 0.53 11.02 (Be 54) 

0.61 0.61 11.08 (G+) (Be 54) 

1.91 1.9 11.98 (3) (Be 54), (Tr 54), and (Ta 53) 
2.24 12.21 (Tr 54) 

2.58 2.65 12.46 (Tr 54) and (Ta 53) 

3.40 13.01 (Tr 54) 
(4.4) (13.7) (Ha 49i) 


a Resonances in neutron yield. 

b Resonances in the 4.2 +0.2 Mev y-ray yield. 

¢ James, Jones, and Wilkinson, private communication; J, r from angular 
distributions of ground-state neutrons. The resonances appear to be super- 
posed on a broad J =7/2- background. 


TI. Be(e p)B? Qm=—6.880 E= 10.656 


See B”. 
MI. Be(« d)B" 
See BY. 


Qm=—8.016 E,=10.656 


IV. (a) Be?(aa’)Be* 
(b) Be(aa’n)Bee On=—1.666 
(c) Be(a2)3Het Qm=— 1.570 


For reaction (a), see Be®. For reactions (b) and (c), 
see (Aj 52c). 


E= 10.656 


V. BY(t a) Be? QOm=13.217 E,=23.873 
See Be. 
VIL. BY (e p)C# Om=4.071 


- The weighted mean of six Q-value measurements is 
4.08+0.10 Mev (Va 54). This does not include a recent 


7; —=yalue of 4.064+0.011 Mev by [(Fa 54d) and W. J. 


Fader, (private communication; mag. spectrograph) ]. 
Four proton groups are observed, corresponding to 
C! levels at 0, 3.09, 3.68, and 3.85 Mev [(Sh 53b), 
fa 54a), and (Fa 54d) ]. The relative intensities depend 
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strongly on bombarding energy (see N"), but generally 
the group to the 3.86-Mev level is the strongest, by a 
factor of 5 or more (Sh 53b). Upper limits for an earlier 
reported group corresponding to a level near 0.7 Mey 
are, relative to the ground-state group: 2 percent at 


Ea«=1.14 Mev, 0.5 percent at H,=1.54 Mev (Ma 53f),: 


7 percent at H,=1.64 Mev (Sh53b), 1 percent at 
Ea=4.8 and 5.8 Mev (Fa 54d). A limit for a 4.6-Mey 
level of 1 percent of the group leading to the 3.89-Mey 
state is given by (Fa 54d). 

Gamma rays of energy 3.730.06 and 0.21-40.03 
Mev are observed, the latter arising from the transition 
between the 3.86- and 3.68-Mev levels. It is estimated 
that the ratio of direct ground-state transitions and 
0.2-Mev cascades from the 3.86-Mev state is 7:3, and 
it is suggested from the large intensity that the soft 
radiation is £1 [ (Sh 53b) and (St 54c) ]. The observed 
angular distributions confirm this assignment (St 54c). 
The angular distributions and p—y correlations for the 
higher-energy radiation contain terms in (cos‘#), in- 
dicating J=5/2+ for the 3.86-Mev level (4 =3/2+ or 
5/2+ from C(d p)C#]. The distributions favor an 
M1 assignment for the 3.68-Mey radiation (St 54c). 

Detailed study of proton angular distributions con- 
firms the assignments J=} and }* for the ground and 
3.1-Mev states, and permits the selection J=5/2* for 
the 3.86-Mev state from the alternatives given by 
C#(d p)C8. An assignment of J=$-, rather than 3-, 
for the 3.68-Mev state follows from the observation of 
E1 cascade radiation (Sh 53b). 


VII. BY(dn)C?  Qn=13.723  E,= 18.671 


The neutron yield curve has been measured for 
E4=0.2 to 2.0 Mev while the angular distributions 
have been observed at 0.71, 1.00, and 1.5y Mev. Even 
at these low energies, the contribution of the stripping 
process is conspicuous (Bu 54). See also C®. 


VIII. BU(d p)B? Qn=1.136 E,=18.671 


The thin-target yield rises smoothly from E4=0.3 to 
1.8 Mev with no evidence of resonances. At Ea= 1.47 
Mev, the cross section is 4 mb (Hu 49j). See also BY. 


IX. BU(dd)B" 
See (Va 51). 


E,= 18.671 


X. BU a)Be® 
See (Va 51). 


Qm=8.015 E,=18.671 


XI. C®2@7)C® On=4.948 
Qo=4.949+-0.006 (Ki 53b; pair spectrometer) 

The thermal capture cross section in graphite is 4.7 
mb (Wa 50e). In addition to the 4.95-Mev ground-state 
transition, a y ray is reported with an energy of 3.68 
-£0.05 Mev and an intensity of 0.3 y/capture. If 3.1- 
and 3.9-Mev y rays occur, their intensities are less than 
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0.10 and 0.06 y/capture, respectively (Ba 53d). See also 
(Th 52b). 


RU Ca n')C?* © Ey=4.948 
C2(n n)3 Het On=—7.277 


Both y- and a-emitting states of C! are excited. 
At E,=14 Mev, the cross section for production of 
inelastic neutrons in the range H,=0 to 11.5 Mev is 
76040 mb (Ph 52b), in the range H,=0.5 to 12 Mev, 
é=520+:200 mb (Gr 53d). The cross section for pro- 
duction of y rays is ~300 mb (Th 54b), 190 mb (Sc 53); 
the cross section for a-particle production is 135-17 mb 
(Hu 54a). 


sanii (CADO E= 4.948 


The epithermal scattering cross section is 4.70=0.05 
bn [free atoms: (Hu 52e)]. See also (Hu 53c) and 
(Hu 54a). No resonances with width >5 kev appear in 
the scattering cross section in the range E,=20 to 
1360 kev (Mi 50c). In that region, the cross section 
decreases monotonically from 4.8 bn at 20 kev to 2.4 bn 
at 1360 kev [(Mi 50c); see also (Ki 53) and (Hu 54a) ]. 
The course of the cross section in this range can be 
accounted for by the broad s state at 3.09-Mev excita- 
tion (Th 52b). The nonresonant scattering at H,=0.5 
and 1.5 Mev is nearly isotropic (Wi 54). A resonance, 
<10 kev wide, is located at 2.08 Mev, o=2.7 b, 
corresponding to a level at 6.87 Mev in C with J>$ 
(Bo 5ic). The angular distribution of scattered neutrons 
at this resonance indicates d-wave scattering, J=3/2* 
or 5/2+ (Ri 53b). 

A structure at 3.0 Mev [(Bo 51c) and (Ri 51e)] is 
attributed to strong interference between a resonance 
at that energy with J=$ and either potential scattering 
or another resonance, possibly that at 3.5 Mev [ (Ri 50d) 
and (Ri 5ic)], of the same spin. The angular distribu- 
tions have been measured in the range E,=2.6 to 
4.15 Mev by (Hu 52d), who find, by a partial wave 
analysis, a D} resonance at E,=2.95 Mev and a S; 
resonance at E,=3.04 Mev. The rise in cross section 
at 3.5 Mev does not appear to be a resonance (Hu 52d).§ 
Another maximum appears at E,=4.4 Mey (Fr 50b). 
Accurate cross sections in this region are quoted by 
(La 50a). At higher energies, the cross-section curve 
shows structure at E,=6.3 Mev (c=2.8 b, T'~0.1 

Mey), 6.7(?), 7.25 (c=1.8 b, '~0.2 Mey) (Hu 53c) 
and ~8 Mev (broad) (Ne 52). The average total cross 
section varies from 1.1 b at E,=9 Mev to 1.4 b at 
E,=:; 14 Mev (Ne 54c). At 14 Mev, the total cross sec- 
ion iż 1.279+0.004 b (Po 52a), 1.32-+0.02 b (Co 52h), 
1.20+0.04 b (Go 52d), 1.29+0.02 b (Co 54f). From 
E, = 14 to 20 Mev, the cross section remains essentially 
constant [(Co54f) and (Da53f)]. At 20.07 Mev, 
—=3,= 1,52+0.05 b. Total cross sections for Z,=39.6 to 
Mev are listed in (Hu 54a). See also (Re 52a), 
3d), Ri 53c), and (Hi 54d). 
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NIV. C?(n 2n)C® Qn=—18.711 Ly, =4.948 


The cross section for C™ production is 5+1 mb at 
24.5-2 Mev and 101.5 mb at 27.5£0.5 Mev (Br 52e), 


XV. C2(dp)C® Qn=2.723 


The weighted mean of nine Q-value determinations is 
2.722-£0.003 Mev (Va 54); this includes recent values 
of 2.722+-0.004 Mev by (Fa 53; mag. spectrometer), 
and 2.720-+0.003 Mev by (El 54; mag. spectrometer). 

Measurements on the proton groups are summarized 
in Table II(13). A number of additional groups are 
reported by (Ca 52a), of which the most prominent 
appear to correspond to levels at 6.92+0.08, 7.58+0.08 
7.75+0.08, 7.87=0.07, 8.350.08, and 8.55-40.07 Mev 
(Ca 52a). The level assignments in Table II(13) were 
obtained by analysis of angular distributions by means 
of stripping theory. Absolute differential cross sections 
for the ground-state protons have been measured at 
Eq=3. 29 Mev by (Ho 54c). While the general shape of 
the distribution is consistent with the assumption of a 
stripping process as the dominant factor, discrepancies 
in detail suggest that compound nucleus formation also 
plays a role at this energy (Ho 54c). See also N". A 
careful search with Ea=5 to 8.5 Mev (@=90°) re- 
veals no further proton groups corresponding to levels 
in the range 0 to 4.9 Mev with intensity greater than 0.5 
percent of the ground-state group (Sp 54b). 

Gamma rays observed by (Ma 53e), (Th52), 
(Be 54a), and Bent, Sippel, and Bonner (private com- 
munication) are listed in Table III(13). Internal pair 
studies of the ground-state y transitions from the 3.84- 
and 3.68-Mev levels appear most nearly consistent 
with an M2 or E3 assignment for the higher transition 
(E1 is excluded, £2 or M1 are possible) and either £2 
or E1 for the lower transition (M1 not excluded). This 
information, together with the implication from the 
large intensity of the 0.17-Mev radiation that this is a 
lower multipole transition, supports selection of the 
values J=5/2+ and 3/2- for the 3.85- and 3.69-Mev 
states from the alternatives offered by the stripping 
analysis (Ma 53e; see also B!(a p)C"). The internal 
conversion coefficient of the 0.17-Mev cascade radiation 
is consistent with an E1 transition (R. J. Mackin, 
private communication). At Hz=2.4 Mev, the upper 
limit of the cascade transitions of the 3.84- and 3.68-Mev 


$A - 495, 79") 
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Fic. 21. Gamma-ray transitions in C13: for notation, see Fig. 13. 
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TABLE II(13). Levels of C? from C#2(d p)C”. 


Ciu* Ci ie “| 3% 
(Mev)a (Mev)> (Mev): (Mev)4 Tent 
0 0 0. o A 
_ 3.086+-0.006 3.0904-0.015 3.107 3.11 a i 
3.686+0.011 3.684+0.015 3.699 3.683 1/23/25 
3.855+0.015 3.869 3.884 


3/2*, 5/2* 


See Rees: C(St 51) and (Va 51a)). 

o 8.5 Mev, 6=90° [éSp 53b) and (Sp 54b)]. 
40 Mev (Kh 53). Lier seh PEG 

< Mev, 0 =15° to 160° [(Ro 51j) and (Ro Sik)]. 
e (Ro 51j), (Ro 51k), (Bl 52), and (Ca 53f). 


states to the 3.10-Mev level is 0.2 107° y/d or 3 per- 
cent of the ground-state transitions (Ma 53e). 

The lifetime of the 3.08-Mev state is <3 10-* sec. 
The pair spectrum indicates an £1 transition (Th 52). 
Branching ratios of the y radiation for the low levels of 
C”, as derived from the present reaction, from B" (a p)C” 
and from C!(7-7)C# are exhibited in Fig. 21. The re- 
duced widths of the ground, 3.1- and 3.86-Mev states 
are ~0. to 0.5 of the single-particle value, while that 
of the 3.68-Mev state is much smaller [see (Fu 54), 
(La 53b), (La 54), (La 54b), and (In 53)]. See also 
(BI 53), (Cu 53), and (Th 54d). 


XVI. C#(yn)C® Qm=— 4.948 
See (Se 49a) and (Go 53f). 


XVII. C®8(va)Be? Q,,=— 10.656 
See (Mi 53c). 


XVII. C¥(p p’)C* 

At E,=7. Mev, 9=90°, proton groups correspond- 
ing to levels at 3.14 and 4.03 Mev are observed (Co 52g). 
(This is the only one of the observed reactions which 
can directly lead to T=$ states in C’*.) 

XIX. N¥(67)C8 Qn= 2.221 

See N”. 


XX. N4(@zd)C® Qn=—5.316 
See (Li 52). 


Taste II1(13). Gamma radiation from C#(d p)C™. 


Ey” Ex’ Yield : 

(Mev) (Mev) 10-%/d Reference 
3.843-+0.014 3.838 4.3¢ (Ma 53e) 
3.86 +0.02 3.84 +0.03 4,04 (d) 
0.170-0.001 ~2° (Ma 53e): cascade 
3.684+0.020 3.677 5.8° (Ma 53e) 
3.76 +0.02 3.74 +0.03 3.54 (d) 

5 .082+0.007 13.2° (Th 52) 

aag : 62° (Be 54a) 


a Uncorrected for Doppler shift. 


b ler shift. DE ; 
2 SOR ee pore Mackin (private communication): E4 =2.4 Mev. 


d (Be 54a) and Bent, Sippel, and Bonner (private communication): 


Ea=4.0 Mev. 
e (Th 52): Ea =1.46 Mev. 
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XXI NU(da)C®8 0,= 7.684 


Qo=7.6814-0.006 (Ma 51; mag spectrometer}. Alpha 
groups are observed corresponding to states in C™ at 
3.083-40.005 and 3.677-£0.005 Mev (Ma 51). 


XXIL. Of a)C* O,=—2.201 | 
Oo= — 2.38-+0.16 [ (Hu 51f) ; ion chamber]. 
At 14.1 Mev, alpha tracks corresponding to levels of 
C at 3.0 and 3.7 Mev have been observed by (Li 52). 
See also OF, 


NY 
(Fig. 22) 

I. N®@*)C? On=2.221 

The half-life is 10.05+0.03 min {Ch 53c), 10.05=-0.1 
min (Ho 50), 10.13-40.1 min (Si 45a), 9.93+-0.03 min 
(Wa 39), Eg(max)=1.2024+0.005 Mev (Ho 50). The 
positron spectrum is simple: the Fermi plot is straight 
down to <150 kev. Logft=3.67 (Fe 51b). See also 
(La 54b). 


Il. B"(He? a) B Om=12.130 E,=21.634 


See B®. 

HI. B! (a n) N!’ On= 1.068 
See N“. 

IV. (a) C2(py)N@ On=1.945 


(b) C2 yp) Ce 

Observed resonances are exhibited in Table IV (13). 
The reduced width for the 0.46-Mev resonance is of the 
order of the Wigner limit, even for formation by s waves. 
This large width and the difference in boundary condi- 
tions at the nuclear surface account for the displacement 
between the 2.37-Mev state of N” and the 3.10-Mev, 
J=3*, state of C”? [(Th 52b) and (Eh 51) |. Considera- 
tion of the cross section at E,~100 kev (Fo 51) and of 
the capture cross section in C!?(n y)C™ leads to an 
estimate of the reduced width of 2 percent of the sum- 
rule limit for the ground states of C’ and N” [ (Th 52b); 
also see (Fu 54) |. 

-The angular distribution of the y radiation of the 
1.70-Mev resonance (ground-state transition) has the 
form 1+A cos%, where A varies with energy (Da 51 
and Da 51b). 

In the range E,= 1.2 to2.5 Mev, a y ray whose energy 
varies with bombarding energy as E,=0.88 (E3—0.43) 
is observed. The radiation represents a transition to the 
2.37-Mev level (E,=0.46 Mev) which then decays” 
preferentially by proton emjssion. Analysis of excita- 
tion functions at @=0° and 90° indicates that inter- 
ference between direct p wave capture from the con- 
tinuum and resonant capture at E,=1.70 Mev are 
involved. At resonance, the relative amplitude of the | 
resonant and nonresonant y rays is 1.10.1, and the- A 


a ~ 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


122 F. AJZENBERG AND T. LAURITSEN 
may 
wa 
N?+n ; 
18.431 | 4 
Cl''+d 
10.021 loime 
2.504 | N"“+He™a B'+He*-n 


p, yp’ (90°) 


Fic. 22. Energy levels of N8; 
for notation, see i'ig. 1. 


1.945 ‘ilps 
2 
CD a lets 
Za B’°+a-n 
Zn 3.236 
- TThresh. 
72.239 
-3003 C+ Het 
N's p-d 12 254 C?+p-n -3.549 
-4.288 C!'%+ He*-d 
N'4d-t 


relative phase, excluding the resonance phase, is 
214°+10°. The absolute yield indicates a width I, of 
0.04 ev for the transition from the 3.51- to the 2.37-Mev 
level (Wo 54). At E,=1.57 and 1.68 Mev, the distribu- 
tion for reaction (b) has the form (0.14+0.03)+sin?ð 


TABLE IV(13). Resonances in C#(p y)N". 


Ores wr 
<= (head (kev) (ub) (ev) 72/3 (h?/pa) NI* Reference 
456.8+0.5 39.5+-1.0 2.37 (Hu 53d) 
45642 35 ee A49b) 
450 35 127 0.67 0.5* Se 51c) and 
a (Se 51e) 
1697412 _ 74+9 3.51 (Va 49) 
16955 702-10 35 1.39 0.04%:> ene ma 
` 2 e e 


Th 52b): a =4.9 X107 cm. 


ssuming incoming p waves. . 


e 3 CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


(Heiberg, Alexander, and James, private communica- 
tion: thick natural target). See also (Wi 53c). 


V. Ce n)N? 
See N”. 


VI. C2(p pn)C! @Qm=— 18.711 E=1.945 
See (Ch 47c and Mc 48). 


VII. C2(p p)C® Ey=1.945 


The elastic scattering exhibits anomalies at E,=0.46 
Mev [ (Ja 53c) and (Mi 54)], 1.7 and 1.75 Mev (Ja 530) 
Further structure is reported by (Ma 53h) at Ep=3.2, 
4.8, 5.37, and 5.9 Mev: (Ja 53c), however, find only 4 
smooth variation in the range E,=2.0 to 4.4 Mev. 
Angular distributions are reported by -(Ja 53c) and 
(Mi 54). 


Om=—18.5  Ey=1.945 


9 
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ABLE V(13), Resonances in C2(p p)C®. 


Ep* Ta Nb 

(kev) (kev) +2/](h2/pa)> (Mev) J, x Reference 
402 35 “(Mi 54) 
46143 34 0.54 2.369 1/2* (Ja 53b 
1.698 .60 0031 3.51 3/2- Ha 53b) 
1.748 66 0.21 3.5 5/2* (Ja 53b) 
3.2(?) 4.9(?) (Ma 53h) 
4.8(?) 6.4(?) (Ma 53h) 
5.37 * 6.90 (Ma 53h) 
5.90 7.40 (Ma 53h) 


a Laboratory system. 
b a =4.77 X1078 cm. 


The s-wave phase shift changes by nearly 180° in the 
neighborhood of the 0.46-Mev resonance and thereafter 
remains constant to Ep=1.3 Mev; a further change of 
8° in the range 1.4 to 3.6 Mev is indicated by the ex- 
perimental data and is attributed to a broad SS level at 
higher energy. Detailed analysis confirms earlier assign- 
ments of [=%4*, $-, and 5/2* for the three levels of N¥ 
at 2.37, 3.51, and 3.56 Mev, and leads to the level 
parameters exhibited in Table V(13). The great breadth 
of the two even parity levels suggests single-particle 
excitation (Ja 53b). 

At E,=5.37 and 5.9 Mev, strong resonances for 
4.4-Mev y radiation are attributed to inelastic excita- 
tion of the first level of C!. Cross sections for y radia- 
tion measured at 0= 105° are 6.343 and 3.01.6 mb/ 
ster, respectively, at the two resonances [ (Ma 53h) and 
(Ma 54k): see also (BI 53a) ]. 

Angular distributions of elastic and inelastic protons 
(see C?) have been measured at E,=9.5 Mev by 
(Bu 53b), at 10 Mev by (Fi54b), at 30.6 Mev by 
(Wr 53a), and at 31.5 Mev by (Br 52c). Characteristic 
maxima ané minima are observed, indicating inter- 
ference between Coulomb and nuclear scattering. See 
also (Co 54a). 


VIII. C:(p d)C# 
See (Br 52c). 


Qm=— 16.486 E,=1.945 


IX. C2(p a)B? Ey= 1.945 


See B®. 


On=—7.559 


X. C2(dn)N® On= 0281 
Qo= —0.281-£0.003 Mev [ (Bo 49c); neutron threshold J. 


Neutron groups have been observed corresponding 
to excited states of N? at 2.29+0.12 (Gr 49a), 2.38 
+0.05 Mev (Mi 53) and 3.480.12 (Gr 49a), 3.53£0.05 
Mev (Mi 53). At Ea=8 Mey, the angular distributions 
of the three neutron groups have been studied by 
(Mi 53); considerable deviations from the theoretical 


curves are noted. 


XL C2(He d)N® Qn=—3.549 
See (Fr 52e). 
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XIL C8(pn)N®  Qn=—3.003 
Qo= —3.003-£0.003 Mev [ (Ri 50e) ; neutron threshold}. 
[This is one of only twp observed reactions (see Sec. ` 
XIII) which can directly lead to T=} states in N®] 
See also N". 
XI. CPH YN" O,=—2.239 
See (Fr 52e). 


XIV. N“Gn)NB  On=—10.545 
See (Bi 54) and (Ho 50b). 


XV. N¥(p d) NB On= — 8.320 


At Ep=18.7 Mev, the angular distribution of the 
ground-state deuterons, analyzed by pickup theory, 
indicates /,=1 and therefore opposite parities for the 
ground states of N? and N". No group corresponding 
to the first excited state was found; see N™ (St 54a). 


XVI. N¥G@)N® On=—4.288 
See (Bo 42). 
cu 
(Fig. 23) 
L CHIN On =0.155 


The mean of reported end points is 15541 kev 
(Li 51a). A recent value is 1551.5 kev (La 53c). The 
half-life is 5568+30 years (An 51), 5400-200 years 
(Ma 51e), 5900-250 years (Ca 54f): log /t=9.03. The 
spectrum does not appear to deviate from the allowed 
shape down to <3 kev (Mo 54e, E. Jensen, private com- 
munication, La 53c, and Wu 53). 

Since the parities of the ground state of CH and N" 
are both even, it is not obvious why the half-life should 
be so large. It may be that a strong L forbiddeness is 


= 
~ 


(pS 


Fic. 23. Energy levels of C¥: for notation, see Fig. 1. 
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involved (transition 4S to-3D1) [see e.g. (No 51) and 
(Ge 51); see, however, (Me 52) and (St 54a) ]. With 
tensor forces of appropriate magiitude, the calculated 
ft value may be made as large as necessary (Ja 54a). 
See also (Ki 52), (In 53), (Fe 53), and (Mi 53b). 


II. B'(e p)C“ Qn =0.781 
See (Fr 51c), (Ma 53f), and N”. 


Om= 4.635 
See (Po 51). 
IV. C®(ny)C# Qn =8.169 


The thermal cross section is 0.9-+0.2 mb (He 54d). 
See also (Br 53g) and (Wi 53f). 


V. C8(1)C¥ 
See (Ko 52). 


Ey,=8.169 


VI. C8(za)Be® Qm=— 3.845 E,=8.169 
See (Sa 54a) and (Hu 47). 


VII. C8(d ~)C# Qn=5.944 

The weighted mean of six Q values is 5.9423-0.004 
(Va 54). A recent determination gives 5.9423-0.010 Mev 
[ (Sp 54b); mag. spectrometer ]. 

At Ez=5 to 8.5 Mev, 6=90°, proton groups are ob- 
served corresponding to states of C! at 6.0910.015, 
6.7233-0.015, and 6.894+0.015 Mev. In the region 0 to 
8.1 Mev in C* no other levels have been observed with 
an intensity greater than 0.25 of the group correspond- 
ing to the 6.1-Mev state (Sp 54b). At Ea=4 Mev, the 
angular distribution of the ground-state protons, 
analyzed by stripping theory, indicates /,=1 and there- 
fore even parity for the C“ ground state, J <2 (known 
to be J=0) [ (Br 52), see also (Be 53d) and (Ko 53a) ], 
while the angular distribution of the protons to the 
6.1-Mev state may indicate odd parity, J=0 or 1 for 
that state (Be 53d). 


TABLE 1(14). Gamma rays from C¥(d p)C. 


Ey* Ey. Yield 
(Mev) (Mev) 10-*y/d Reference 
0.811=-0.003 1.4 g 
6.110=-0.030 6.080 2.30 © 
6.119-£0.025 - z 
o 614 +0.03 6.11 £0.04 2.34 f 
6.14 +0.03 6.10+0.04 3.8° f 
6.730-+0.040 2 
6.73 +0.03 6.70=-0.04 0.384 f 
6.72 +0.03 6.48+0.04 2.4° f 


rrected for Doppler shift. 
d for Doppler shift. 
d¢=1.21 Mev. 


Ti 
, and Bonner (private communication). 
Ea=25 Mev. 


é 
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Observed y radiation assigned to C is exhibited in 
Table I(14). The internally-formed positron distribu- 
tion suggests that the 6.10-Mev line is £1 in character 
and hence that the level has J/=1-; E2 and M1 cannot 
be excluded, however. It appears probable that this level 
is the T,=1 component of the 8.06-Mev, J=1-, level] 
of N" (Th 52). The 0.811-Mev radiation is attributed 
to a cascade transition from the 6.894-Mev levei to that 
at 6.091 Mev. The absence of the direct ground-state 
transition can be accounted for if the level has zero 
spin. In this case an association with the 8.70 Mey 
J=0-, level of N“ is reasonable [(Th 52) and R. i 
Mackin and W. R. Mills, private communication. ] 
That both the 6.1- and 6.9-Mey levels show a consider- 
able shift with respect to their N" counterparts is not 
surprising in view of the great reduced widths [see 
C8(p p)C'*]. The 6.72-Mev level has about the right 
energy to match the narrow 9.17, J=2-, level of N“. 


VII. N“(n p)C¥ Qm=0.627 « 


The weighted mean of eight Q-value determinations 
is 0.624+-0.004 Mev (Va 54). For angular distributions 
see N!®. See also (St 51b) and (Li 52). 


IX. O%(1a)C# Qn = 1.825 


The isotopic cross section is 0.46-+0.11 b for thermal 
neutrons (Ma 47). 


Nw 
(Fig. 24) 
I. B8a@n)N® QOn=1.068 E,=11.613 


Resonances reported by (Sh 53b) and (Sh 54f) occur 
at Ba=1.51, 1.64, 2.16, and 2.27 Mev [see Table I1(14) ]. 
The reduced neutron width at the 1.51-Mev resonance 
is 0.1750.012 of the reduced proton width, while at 
the 2.16-Mev resonance the ratio is 0.90.3. Angular 
distributions have been measured at Z,=1.51 Mev 
(1—0.9 cos’) and 2.16 Mev (1—(0.5-0.1)cos’é) 
(Sh 54f). See also BY (a p)C#. i 


Il. B°(@p)C® On=4.071 E,=11.613 


Observed resonances in the yields of y rays (from. 
C8*) and of various proton groups are listed in Table 
I1(14). Studies of the angular distributions of protons 
lead to assignment of 4-, 37, 4+, 4-, and 4+, respectively, 
for N“ Jevels at 12.42, 12.69, 12.79, 12.82, and 12.92 
Mev (Sh 53b). The first three are confirmed by the y-ray 
angular distributions and (p—y) correlations (St 54c). 
In the cases of the 12.69- and 12.79-Mev levels 
(Ea= 1.51 and 1.64 Mev), the reduced widths for ground- 
state protons are relatively small [(Sh 53b); see also 
(Sh 54f)]. Angular distributions of the protons at the 
1.51- and 2.16-Mev resonances are identical with those 
of the neutrons at the same resonances; see (Sh 54f) and 
BY (a n)N®. 


Se 


to 
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TABLE IT(14). Resonances in B°-+a. 


Ea Te.m. Outgoing® particle or T: 
(Mev) (kev) (x) - (mb) (kev)  y:2/3 (?/pa)t Nite J, xe Reference 
0.95 » Po 12.29 (Ma 53f) 
1.09 Po (Ma 53f) 
1.13+-0.01 4344 Po ps, d 12.42 4- (Sh 53b), (St 54c) 
1.20 Po (Ma 53f) 
1.24+-0.01 36225 po 12.50 (Sh 53b) 
1.39+-0.01 S05 Pry Ps 12.61 
1.51-£0.01 144 a 1.7 6.0 12.69 3z (Sh 53b), (St 54c), (Ta 53) 
po 4.7 0.62 0.012 : 
pi 13 0.17 0.29 
ps 5.3 0.70 0.31 
p3 42 5.6 0.47 
d 7.0 0.93 0.26 
n 32 4.3 0.19 
1.64+0.01 1444 a 1.0 8.2 12.79 4+ (Sh 53b), (St 54c), (Ta 53) 
Po 0.98 0.18 0.012 
pr 0.46 0.085 2.7 
pe 2.4 0.44 3.0 
Ps 52 9.6 4.9 
d 11 2.0 3.9 
n 3.2 0.59 0.16 
1.68+-0.01 52 bi, P2, Ps, d 12.82 4- (Ta 53), (Sh 53b), (St 54c) 
1.83--0.01 21+4 Po, pi, £2, Pa, d 12.92 4+ (Ta 53), (Sh 53b), (St 54c) 
2.16 sharp bo, Y, n 13.16 (Tar53), (Sh 54f) , 
2.27 broad Po, yn 13.24 (Ta 53), (Sh 54f) 
2.95 broad Po, y," 13.72 (Ta 53) 


a po, T pz, correspond, respectively, to protons going to the ground state, the 3.1-, 3.7-, and 3.9-Mev states of C8, 


ba=1.45 (Ai!3+A 2/3) X10733 cm. 
e From angular distribution measurements: (Sh 53b) and (St 54c). 


TH. BY@d)C? On=1.348 E,=11.613 


Observed resonances are exhibited in Table II(14) 
(Sh 53b). 


IV. Bi@n)N¥  Qm=0.154 


A y ray of 2.2+0.2-Mev energy is observed at 
E«=5.3 Mev. See (Be 50h) and (St 39). 


V. (a) C®(dn)N® O,,=—0.281 E,=10.264 
(b) C2 p)C® 0,=2.723 

In the range E4=1.86 to 3.45 Mev, resonances are 
reported at 2.05, 2.35, 2.51, 2.67(?), 2.74, 2.92(?), 3.01, 
and 3.10 Mey. The angular distributions of protons at 
and off resonance show interference of stripping and 
resonant contributions (Jo 54h). See also (Aj 52c) and 
(Ta 54b). 


VI. C®(dd)C¥ E,= 10.264 
See (Ke 51b) and (Cu 47). 


VIL. C2(da)B”® QO,=—1.349 E,=10.264 
4 See B”. a 


VII. C#(p p)C# E,=7.542 


Differential cross sections have been measured at five 
angles for Z,=0.45 to 1:60 Mev. Observed resonances 
are exhibited in Table IV(14) (Mi 54), together with 
the assignments of l, J, m which result from the scatter- 
ing analysis. The broad, 1.25-Mev resonance [see 
C3(p -y)N™] appears in the form of a slow increase in 


a s CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


the s wave, J=0, phase shift starting near 1 Mev 
[Mi 54) and R. F. Christy and G. Speisman, private 
communication ]. 


IX. C8(py)N" Qn=7.542 


Observed resonances are exhibited in Table V(14) 
[ (Se 52) and (Wo 53c) ]. The decay schemes of various 
levels of N", as derived from the y spectra, are exhibited 
in Fig. 25 [(Wo 53c) and (Cl 53a) ]. Actual values of 
y-ray energies observed at each resonance, as well as 
certain angular distributions are tabulated by (Wo 53c). 
See also (Cl53a) and (Hi53). The width of the 
E,=0.55-Mev resonance (8.06-Mev state) indicates 
s wave formation (J=0-, 1-) (Wo 53c). The observed 
isotropy of the radiation supports this assignment 
(De 49d). The level is established as J=1~ from 
C(p p)C. The y width for the ground-state radiation 
indicates an E1 transition, and hence T=1 for the 
8.06-Mev state; this assignment is confirmed by the 
absence (<0.7 percent) of the transition to the 2.31- 
Mev, J=0*, T=1 level [(Cl 53a); see also (Ge 53a) 
and (Wi 53c) ]. 


TABLE IV(14). Resonances in C(b p)C! (Mi 54). 


Ep (Mev) 1 72/3 (i2/pa) Jia Nut 
0.55 0 0.13 iF 8.06 
1.16 1 0.013 o+ 8.62 
1.25 0 0.23 0- 8.70 
1.47 2 0.167 F 8.90 
1.55 1 0.005 i+ 8:98 


n A ee 


* a =1.41 (131341) X10713 cm [see (Wo 53c)]. 


tæ 


te 
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The £,=1.25-Mev resonance is also established as 
due to s waves by its width (Wo 53c); its assignment 
as J=0 comes from C#(p p)C™, Again the large Y 
width is consistent with E1 radiation and T=1 (Wi 530). 
The y-ray angular distribution at the B= 1.76-Mey 
resonance (N"*= 9.18 Mev) indicates J= 1+, 2+, or 2-. 
The large y width suggests an Æ1 transition and hence 
J=2- (and therefore T=1) [(Da 51), (Da 51b), and 
(Wo 53c); see also (Ch $3d)]. If the transition to the 
6.44-Mev level is dipole (wI',= 1.5 ev), the angular dis- 
tribution requires J=3 for the latter. Arguments based 
on the reduced width of the Ep=2.10-Mev resonance, 
the anisotropy and strength of the radiation, indicate 
J=1", 2+, or 2~ for the 9.49-Mev state and the same J 
value for the 5.10-Mev state (Wo 53c). At the E,=3.11- 
Mev resonance, the angular distribution of ground-state 
y rays is of the form 1—(0.40+0.02) Ps (cos#) (H. B. 
Willard, private communication). 


3.945 t0.005,1*) 
TO 


2313 10004,10") 
TH 


CAR 


nit 


Fic. 25. Gamma-ray transitions in N": for notation, see Fig. 13. 


X. C8(pn)N8 Qn=—3.003 Ey=7.542 

Observed resonances are exhibited in Table VI(14) 
[ (Ba 53e), (Bl51a), and (Ad 50b) ]. The comparison 
with results from C#(d p)C# in the same range is dis- 
cussed by (Ba 53e). (Ignoring the possibility of isotopic 
spin impurity, only 7=0 states should be formed in the 
latter reaction.) 


XI. C8(dn)N* Qn=5.317 

Neutron groups corresponding to levels in N“ at 
2.2340.10, 3.850.08, 4.80+0.07, 4.97+0.07, 5.5 
+0.1(?), 5,760.05, 6.1+0.1(?), 6.23-0.05, 643 
+0.04, 7.00-+0.04, 7.50-+0.04(2), 7.720.04, and 
8.08-£0.06 Mev have been observed by (Be 53d; 
E,=3.89 Mev). Analysis of the angular distributions 
of the neutrons indicates /,=1, and therefore even 
parity for the ground state of N“ (Br 52), /,=1, even 
parity, J <2 for the 2.3 and 3.9-Mev states, and 1,=0, 
J=0 or 1, for the 4.9-Mev state (Be 53d). 


odd parity. ; 
A ]d measurement indicates a state of 


A neutron thresho 
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Ep Yonar o oly 
(Mev) lke (10-*7/), (mb) fev} DR Nie 
0.55 0.9 1.44 8.6 p 8.06 
1.16 0.12 0.56 1.3 8.62 
1,25 1.13 0.062 13.8 0,17 8.70 
1.47 20 0.074 0.72 8.90) 
1.55 7 0,037 0.13 8.98 
1.76 2.10.2 1.15 12.6 14.8 918 
2.10 t543 OAS 0.20 6.2 9.49 a 
3.11s 10.43 


aH, B. Willard, private communication. 


TasLe VI(14). Resonances in C4¥(p n) N? 
[(Ba 53e), (BI 51a), and (Ad 50b 


Ex (Mev) 

3.78 > 11.05 
4.01 11.26 
4.10 11.35 
4.18 11.44 
4,52 11.74 
4.8 12.0 

6.20 13,23 


Í 
| 


N" at 5.683+-0.010 Mev (Cook, Marion, and Bonner, 
private communication). 

ý Observed y rays attributed to transitions in N are 
shown in Table VII(14) [(Th 52), (Be 54a), (Ma 54), 
and Bent, Sippel, and Bonner, private communication |. 
See also (Be 53b). 


NIT. CHEN” 
See C*. 


On = 0.155 


XIM. C“(pn)N“ Qm=—0.627 
See N®. 


TaBie VII(14). Gamma rays from C¥(d njN". 


a b 
Ey? Yield Ey Yield Ass 
{kev) 10-%y/d (kev) 10-%/d j; 

72544 1.6 2 5.829—+5.104 
16388 155 3.945—2.313 
2310+12 SE 2.313 
3381415 1.8f, 1.857 3410+40 1.8! 5.6386—2.313 


3920+70: 0.31! 3910-50 3.945 
4460-+508 0.32? 
4940+49 0,90! 4930+40 4.910 
5100+50 1.0! 5130+30 5.104 
5720440 0.84 5730-30 5 
6490+50 0.90) 6450450 64 
20.31% < 
7050+40 0.763 7. 
7300+50 0.29! 7 


5052+25 1.565 
5690+50 0.69 


a (Th 52). F 

> e 54a) and Bent, Sippel, and Bonne”, private communication; values 
include ~0.5 percent Doppler correction. : ř 

e (Ma 54) and (Ma 55a): At Ea=1.4 Mev, the intensity of a 5.82-Mev 
line is <0.15 that of the 5.73-Mev line. - 

d Not corrected for Doppler shift. 

o Yield =1.0 X10-%y/d at E4 =2.5 Mev. 

{ E4 =1.58 Mev. 

z Ez =1.21 Mev. f 

b Assignment not certain. 

i E4=2.0 Mev. 

1 E: =4.0 Mev. 
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XIV. (a) N“(yn)NB Qm=— 10.545 
(b) N¥(y p)C® Qn=— 7.542 


(c) N“(y np)C?  Qn= — 12.489 
(d) N4(ya)B® n=- 11.613 
The cross section for neutron production [reactions 
(a) and (c)] exhibits a maximum at £,=22.5 Mev, 
T=3.2 Mev, c=15.3 mb (Fe 54). This cross section is 
considerably larger than that obtained by measurement 
of the N” produced (Jo 51b). For relative yields of the 
various reactions, see (Wr54). See also (Pe 52e), 
(Mi 53c), (Sp 53a), (Gr 54), and (St 54h). 


XV. N" (y 22)N? 
See (Pa 52a). 


XVI. N“ (n n)N"* 


At E,=3.9 Mev, a 2.30+0.05-Mev y ray is ob- 
served (R. B. Day, private communication). See also 
N" and (Sm 54). 


[N“(p ON" @m=-— 8.320 


At Ep=18.7 Mev, the angular distribution of ground- 
state deuterons indicates a pickup process, with /,=1 
and hence opposite parity for the ground states of N” 
and N". The intensity of a possible group leading to the 
first excited state of N'8(S;) is less than 4, 4, and 15 per- 
cent for /, values of 2, 1, and 0 respectively. The result 
is interpreted as excluding any appreciable admixture 
of s or d configurations in the ground state of N" 
[ (St 54a): see (Me 52) and (Fe 53) ].] 


Qn= —30.9 


XVII. N“(p p’)N¥* 
N¥(d d')N4* 


Observed inelastic proton and deuteron groups are 
shown in Table VIIT(14) (Co 52g, Ar 52e, and Bo 53). 
At E,=9.5 Mey, protons to the 2.31-Meyv state are not 
observed: o <0.4 mb [(Ro 53d); see also (Fu 48)]. 
At E,=6.98 Mev, the ratio of the intensities of the 
proton groups corresponding to the 2.31 and 3.95-Mev 
levels to the elastic group is 5 and 10 percent, respec- 
tively. For deuterons, the ratio for the 3.95-Mev level 
is 10 percent while an upper limit of about 0.5 percent 
is set for the 2.31-Mey level. The failure of the 2.31-Mev 
state to appear with deuterons indicates that it is the 
T=1 analog of the ground states of C“ and O“ (Bo 53; 
see Ad 52a). 


TABLE VITI(14). N" levels from N4(p p’)N™* and NH (d d’)Nu* 
er ee 


Nu (iek Nuk Ni 
( p)* (p> (pe (d d’): 
2.35 2.32+0.02 2.313+0.005 
3.95 3.96+0.02 - 3.945+0.005 


4.910+.0.010 
© - — $.09=+-0.02 5.104+0.010 


reference d 
.95 


reference e 
reference e 


52g); Ep =7.4 Mev. 
2e); Ep =8 Mev. 
3); Ep =6.98 and 7.6 Mev. 
y <0.5 percent of elastic group at Ep =6.98 Mev, 0 =90°. 
mpt to observe. p 2 
e 
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See (Ca 53a). 


XIX. O4(6t)N" Qm= 5.170 

The decay proceeds by emission of 1.835-+0.008 
(Ge 54), 1.830-+0.030 Mev (Pe 54a) positrons with an 
allowed shape (Ge 54), followed by a 2.300.03 Mey 
y ray (Sh 53a). The direct ground-state transition is 
<0.3 percent if it exists at all: log ft> 7.3 [(Ge 54); see, 
however, (Pe 54a) ]. The half-life is 72.1-40.4 sec: 
ft=3275-475 sec (Ge 54). This is an example of a Fermi 
0+—0+ interaction: see (Ja 54a), (Ro 53a), and (Ko 53). 


XX. Ol(da)N" Q,,=3.116 

The weighted mean of six Q-value measurements is 
3.115+0.0025 Mev (Va 54); this includes recent values 
of 3.1130.0035 [(Cr52c); elect. analyzer], and 
3.119--0.005 Mev [ (Fa 53); mag. spectrometer ].- 

No alpha group corresponding’ to the 2.31-Mev state 
is observed at Eg=2 Mev (Va 52b), 2.88 Mev (Cr 52c), 
6.77 Mev (As 51), 7.73 Mev (Bu 51f), 8 Mev (Sp 53c), 
and 19 Mev (Fr 53e). This is consistent with the assign- 
ment T= 1 to this state [see N“ (d d')N“* and (Ad 52a) ]. 
Groups have been observed corresponding to levels at 
3.9 (Fr 53e), 3.95 [(As 51) and (Sp 53c) ], 3.98-40.04 
Mev (Bu5if), 4.93 Mev (Sp53c), 5.01 (As 51), 
5.06-£0.05 Mev (Bu 5if), 5.7 Mev (As 51), and 5.98 
Mev (Sp 53c). 


Low-Lying Energy Levels of N” 


[See (Be 53d), (Wo 53c), (Cl 53a), and (In 53). ] 


1. Ground state: J=1; parity same as C and C" 
(Br 52). K 

2. 2.3-Mev state: E,=2.31340.005 Mev from 
N“(p p')N“; gamma rays [C!(p y)N¥ and 
C8(dn)N"] average 2.3134+0.007 Mev (without 
Doppler shift); J<2+ from C8(dn)N“; T=1 from 
N"(dd’)N* and O'6(d a)N“. The general assumption of 
J=0*, a consequence of the hypothesis of charge inde- 
pendence is not contradicted by any of the y-ray 
evidence, and is given some support by the absence of 
any transition from the 8.62 Mev, J=0t, level. 

3. 3.95-Mev state: E,=3.945+0.005 Mev from 
N'(p p‘)N¥; gamma rays give 3.944-+0.012 Mev; 
J <2* from C¥(dn)N"; J=0+ eliminated by observa- 
tion of transition (1.63-Mev) to 2.31-Mev level; J=1* 
favored by intensity of 1.63-Mev relative to direct 
ground-state transition. The strong £1 transition from 
the 8.06-Mev, T=1, level suggests T=0. 

4. 4.9-Mev state: E,=4.910+40.010 Mev from 
N" (p p’)N™; gamma rays give 4.920+0.020, including 
the Doppler shift; J=0- or 1- from C¥8(dn)N¥. 
J=( is favored by the absence of the transitions to the 
2.31-Mev (J=0+) state and from the 8.62-Mev 
(J=0+) state. 


5. 5.l-Mev state: E,=5.104-40.010 Mev from 


te 


ta, 


te 


a 


ga 
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N“(p p')N"; gamma rays give 5.100+0.025 Mev 
(including the Doppler shift). There is some evidence 
from C¥(d7)N"“ that J<2+: J=0 is excluded and 
J=1 is favored by 2.7-Mev transition to 2.31-Mev 
level; J=1 or 2 is suggested by CE (p y) N“. 

6. 5.7-Mev state: E.=5.683+0.010 from neutron 
threshold in’C¥(d n)N"; gamma rays give 5.700+0.020 
Mev: mean 5.6860.009 Mev. J=1 suggested by 
relative strength of transifion to 2.31-Mev state. 

7. 5.8-Mev slate: E,=5.829--0.02 from addition of 
energy of cascade y (725-+4 kev: not corrected for 
Doppler shift) to Z, of 5.1-Mev state. Seen in C#(p y) N" 
and in C8(d 2)N", 


ou 
(Not illustrated) 
Mass of O14 


From the measurement of the positron energy as 
1.83520.008 Mev (Gé@54) and of the energy of the 
first excited state of N” as 2.313-0.005 Mev (Bo 53), 
cone obtains O“—N¥=5.170+0.009 Mev; mass de- 
fect=12.168-L0.015 Mev. 


I. OX(B4)N“- 0O,=5.170 
See N“. 
IL. N¥(p2)O% On=—5.952 


Qo= —6.030.2 (Aj 54; photoplate). 

At E,=17.340.1 Mev, neutron groups are observed 
to broad or unresolved states of O“ at ~6.2, 7.5, and 
9.3 Mev. No levels were observed below E,~5.5 Mev; 
the upper linsit to the intensities of such groups is 0.25 
of the ground-state group (@=30°, 60°, and 150°) 
(Aj 54). 5 


cis 
(Not illustrated) 
Mass of C! 


The C¥(d p)C™ Q value is given as 0.12+0.05 Mev. 
This gives C8—N'=8.64-£0.06 Mev; mass defect: 
13.17+0.06 Mev. 


I. C(6-)N5 Qn=8.64 

The maximum f~ energy is 8.8+0.5 Mev. There is 
evidence of 5.5-Mev delayed y radiation. This radiation 
is presumably due to partial decay of C™ to either or 
both of the N“ states at ~5.3 Mev. There is some 
indication of weak components at higher energies. The 
half-life is 2.40.3 sec (Hu 50a, Hu 50e, Hu 52, and 
Sp 54) -(logft=S.6 and ~3.6 for transitions to the 
ground state and the 5.3-Mev states, respectively) 


(Ri 54b). 
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U. C#(d p)C® Qn=0.12 


The excitation curve shows a single resonance (see 
Nt‘) superposed on a rising background. On the assump- 
tion that the background arises from nonresonant com- 
pound nucleus formation and that C™ has either 
J=}* or 5/2*, it is found that the latter assignment is 
the more probable, and that Qo=0.15-40.15 (Ri 54b). 
A direct measurement of the proton energy gives 
Qo=0.12-+0.05 Mev [K. Spearman, preliminary result;. 
see (Ri 54b) ]. See also (Ri 54) and (Sp 54). 


TL. C#(ny)C® On=2.35 

The capture cross section is <10 mb (Ya 51). 

The following reactions leading to C! are not re- 
ported : C(t p)C! (O,,= 2.03), C#(td)C¥ (Om= —3.91), 
CH (a He) C (0,,= — 18.22), N” (n p)C!® (Omn= —7.86), 
O8(1a)C® (On= —3.89). $ 


N: 
(Fig. 26) 

I. BY(an)N5 Qm=0.154 E= 10.988 

Resonances are observed at E,=0.60, 1.03, 1.18, 1.30, 
and 1.49 Mev [(Be 54) and W. E. Bennett, private 
communication: N'*= 11.43, 11.75, 11.86, 11.94, and 
12.08 Mev]. A double resonance at E,~1.58 Mev and 
a single resonance at E,=2.05 Mev are reported by 
(Sh 54£), while (Tr 54) observes resonances at Ea= 1.54, 
2.09, 2.63, 3.00, and 3.26 Mev (NM*=12.53, 12.91, 
13.19, and 13.38 Mev). See also (Ho 50b). 


Il. B'(ap)C4 On=0.781 E,= 10.9388 


A resonance is observed at E,=1.58+0.03 Mev 
(Sh 53b: Sh 54f report that it is a double resonance). 
See also CH. 


Il. C2@p)N™ 0,=—4.961 
At E,=21 Mev, §=90°, proton groups are observed 
corresponding to the ground state and to states at 5.4 
and 6.5 Mev (Bu 51). 
IV. CH 4) C# 
See (Po 51). 


On=4.635 E,=14.841 


V. C8(dn)N¥  QOn=5.317 E,=16.150 

Resonances for neutrons with E,>3 Mev are ob- 
served at E,=0.58, 0.85, 1.55, and 1.78 Mev (Ri 50c). 
See also N". e = 


VI. C8(dp)C4% Q,=5.944 £,=16.150 

Resonances for ground-state protons appear at 
E,=0.65 (Cu 50) and 1.55 Mev (Be 41). Study of the 
angular distribution in the range Ez=0.28 to 0.65 Mev 
indicates strong stripping effects at the lower energy, 
and some evidence of resonance near 0.63 Mey. At this 
energy, the cross section is 0.84 mb (Ko 53a). 


~ > 
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: Ə -4873 
H 5 4 z = t 
i A 
1a ] J P 
fi | Š too, 4,841 | 
| all oma Jaro Cert 
7 : iV a 
| “B89 38 
| q ae -|2 26 383 
: eo 205 
i 142 58 " 
i 2 130 
S te: 11.235 = 4 is 
q j S104 6: =053 x 0.86 
H 9.807/ 
i L26 O''+d-a ae! | 
? aS | 
i j S | 
758 i y i 
1376H A (i 
6.33 HIN llez | 
te 5/27) | 
531 | | 
4713 096 5.28 (1/2, 3/2 i} l- | 
14 3 
Eie ems 
O"+p-a 
2.705 0° 
2m,c* 
AP 
y R | 
15 J=1/27 
N -4.961 
12154 C®+a-p 


= Fic. 26. Energy levels of N15: for notation, see Fig. 1. 


VIL. C8(de)B" 0,,=5.163 E,=16.150 


( IX. CH(p y)N8 Qm= 10.207 
an Oe e snooty from Ea=0.4 to 0.9 Mev Resonances observed for producti 
bw E, 2 a=0.99 Mev, 6=90°, the differential cross Say eee ic 


section is 7 mb/sterad (Li 51b). radiation are listed in Table I(15); transitions to the 
$ ; Re levels are also observed. Level assignments are 
VI. C3(d)C® On=1.309 E,=16.150 ased on y-ray angular distributions, neutron distribu- 


tions [from Cu 
See C”. N“ (n 1)N¥ 


(7)N"“] and comparison of (pn) and 
cross sections (Ba 54f). See also (Go 54g). 
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X. H(p n)N" O,=—0.627 


E,= 10.207 

Resonances reported by (Ba 54f) and (Ro 51h) are 
given in Table I(15) [sce also (Aj 52c) |. The location 
of the levels and the relative yields are in good agree- 
ment with those from the inverse reaction [ (Ro 51h) 
and (Ka 53)]. See also (Li 54a). 3 


© 


XI. C¥(dn)N® Qm=7.982 


Neutron groups have been observed corresponding 
to levels-in N!’ at 5.34, 6.32, and 7.46 Mey (Hu 50e). 


XII. C¥(6-)N15 Omn= 8.64 
See C}5, 
SUT. N"'Giy)N On= 10.834 


Qo= 10.832+0.008 (Ki 53b; pair spectrometer). 


The thermal cross section is 100-450 mb (Hu 52e). 
Observed y rays are listed in Table II(15) together with 
the N'* levels with which they are presumed to be 
associated (Ki 51). 


XIV. N“ (nn) N“ Ey= 10.834 


The coherent scattering cross section is 11.1-£0.3 bn 
(Pe 52b); the total scattering cross section is 11.4-0.5 
bn (bound atoms, epithermal neutrons) [(Hu 52e) and 
(Me 49b) ]. 

Resonances in the range E,=0.4 to 1.8 Mev are 
tabulated in (Aj 52c). Angular distributions at several 
of these resonances have been studied by (Ba 50g), 
(Fo 53), and (Jo 54a). From E,=1.8 to 4.0 Mev there 
is evidence for considerable structure in the cross sec- 
tion curve, but little agreement as to the exact location 
of the levels,involved. One level, at Z,=2.25 Mev, is 
quite definite, and inspection of the curve of (Hu 52e) 
would suggest further levels at ~2.9, 3.2, and 3.55 Mev. 
(Me 53a) list 14 resonances in the range £Z,=1.9 to 


TABLE I(15). Resonances in C4(p n)N™ and CH (p y0)N™ 
[(Ba 54f) and (Ro 5th) ].* 


D Yields Pr r r 
ER) 10-y/p 10-82/p ine kev eee ait Jia Niet 
10.54 
525 515 0.12 3/2- 10.697 
643 0.49 0.01 3/27 10.807 
1170 3.14 19.9 12 1.6 10.4 0:25 T 11.294 
1310 7.96 121.5 41 33 8 2.3 1/2*> 11.430 
1500 49.3 88.2 475 5 470 f 28 1/2 11.57 
1664 A 38 37.5 0.5 3/25 11.760 
1789 1.46 18 17.9 0.03 (5/2*) 11.877 
1883 5.55 15 14.7 0.3 (1/27) 11.964 
2024 40.3 18 17.2 0.8 (8/24) 12.096 
2079 286 54 44.5 9.5 (3/27) 12.147 
2272 10.5 22 21.8 0.2 (5/27) 12.328 
2451 45 12.495 


dths are tabulated by (Ba 54f); all are of 
S 3 Fp =1500 kev which appears 

, percent or less except that at Ep =15 ey whi pea 
Hees bal ON DE equal to the sum-rule limit. Yields are +29 percent. 


k, private comm ation. There is some 
P See also (Ka 53) and H. Mark, P ictributions of the neutrons that the 


/2+, respectively. 


a Ratios to single-particle wi 


evidence from a study of the angular dis 5 
states at 12.10 and 12.15 Mev may be 5/2* and 3, 


LEVELS OF 


LIGHT NUCLEI N15 131 


Taere I1(15). Gamma radiation from N" (a y)N' (Ki 51; 


Ey Rel, intens, Nu 
r 1.00 
0.09 9.156 
0.19 8.278 
0.56 7.356 
0.19 7.164 
0.9 6.325 
1.5 5.275 
5.287 4-0.010 2.3 5.275 a 
4.485-£0.010 0.8 6.325 


* See, however, (Ki 53b). 


3.6 Mev, while (Sp 54c), analyzing angular distribu- 
tions of (Hu 54b), list 7 levels between £,=2.5 and 


_ 4.4 Mev. 


The average total cross section (AE~10 percent) 
from E,=3 to 14 Mev indicate brodd maxima near 4, 
5.5, and 8 Mev (Ne 54b). At 14 Mev, o,=1.59+0.03 b 
(Co 52h); of this, 0.82 b is due to elastic scattering, 
0.27 b to two-particle disintegrations, 0.016 b to 
three-particle disintegrations, and 0.48 b to inelastic 
scattering exclusive of disintegrations (Sm 54). (Ph 52b) 
give 0.790.05 b for the total cross section, excluding 
elastic scattering, at F,=14.5 Mev. The elastic scatter- 
ing at this energy shows a strong forward maximum 
and smaller maxima at §=70° and 180°, indicative of 
diffraction scattering. The inelastic scattering is roughly 
isotropic (Sm 54). See also (Hi 54c). 


XV. N” (n 2n)N¥® O,,=—10.545 E,= 10.834 

At £,=14.5 Mev, the cross section is given as 
5.67+0.8 mb by (Pa 53a), while (Du 54) find 34+1.1 
mb at 14 Mev (preliminary value). 


XVI. N“(n p)C* 0,=0.627 E= 10.834 


The thermal cross section is 1.70+0.05 bn; the total 
absorption cross section is 1.78+0.05 bn (Hu 52e). 
Observed resonances are tabulated in (Aj 52c). (Gi 53a), 
using a continuous neutron spectrum and observing the 
distribution of total pulse sizes, reports values generally 
in agreement with those of (St 51b) and earlier workers, 
in addition to a number of new values which, if properly 
attributed to the present reaction, can be identified 
with N“ levels found in other reactions. 


XVII. N" (n a) B" m=—0.154 E= 10.834 
Observed resonances are tabulated in (Aj 52c) and 
by (Gi 53a): see N¥(n p)C™. ae 
XVII. NYQ@zd)C®8 Q,=—5.316 E,=10.834 

See (Li 52). F 


XIX. N"(@z)C® Qn=—4.007 £,=10.834 


The cross section is 11452 mb averaged over a fission 
neutron spectrum (Fi 53). 
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10.811=-0.015 


a en 50i): Ea =1.41 Mev, 8 =90°. 

Sp 53) and (Sp 54b). Ea =5 to 8 Mev, 0 =90°. 
e (Gi 52): Ea =8 Mev; analysis by stripping theory. 
a (St 54f) finds J =1/2 or 3/2 for these states. 


Cd 


XX. N!(d H)N5  Qm=8.609 


The weighted mean of eight Q-value determinations is 
8.6143-0.007 Mev (Va 54). Levels in N!* corresponding 
to observed proton groups are listed in Table ITI(15) 
[(Sp 53), (Sp 54b), (Ma 50i), and (Gi52)]. Study of 
the angular distribution of ground-state protons at 
Ea=0.4, 0.5, and 0.6 Mev indicates both stripping 
(,=1) and compound nucleus formation (Jo 54c). 

Gamma rays observed at Ez=4.0 Mey are exhibited 
in Table IV(15) (Bent, Sippel, and Bonner, private 
communication). At Ez=0.63 Mev, a study of (p—+) 
angular correlations indicates J=% or 3 for the 5.3-Mev 
levels (St 54f). See also (Fu 54) and (Th 54d). 


XX. NG pn)N™  Q,,= 2.352 
See (Cu 52a). 
E XXI. 0% (6+)N15 Qn= 2.705 
; See 0'5. 
Ej 
= XXIL OGN" Ọn=—12110 
; See (Wa 49i). 


XXII. O%(nd)N% Q,,=—9.884 
See (Li 52). 


TABLE IV(15). Gamma rays from N4(d pyN1.2 
a —— —————————————— 


b to 5 
a Git) aoo 
10.083-0.03 10.04+-0.04 0.15 
9.173-0.05 9.13+0.06 0.15 
8.374.0.03 8.333-0.04 0.71 
7.3440,03 ` 7.31+0.04 2.3 
6.36+0.04 6.332-0.05 0.75 
5.29 +0.03 5.26+0.04 3.34 
| Sippel, B R kalone pis 
reas slope Pad BE pirate gomniration) E: =4.0 Mey. 


ted for Doppler shift. 
also from Nu (d n)O15. 
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t 
j Taste IIT(15). N" levels from N¥(d p)N". XXIV. O!(da)N  Qn=9.807 
i y F Qo= 9.807+-0.012 [ (Pa 54); mag. spectrometer]. 
a a sm 
f 0 1/2, 3/2 or 5/2, odd XXV. OB(pa)N®  On=3.969 
f . .280+0.015 F À 
; SaO aaae Jrg Qo=3.961+0.009 [(Mi 54a); mag. spectrometer]. 
6.32840.006 6.330+0.015 3/2 or 5/2, odd $ 
7 1640.006 7.1650.015\ 1/2 or 3/2, even See also F! and (Aj 52c) and (Ro 53c). 
7.309=-0.006 7.314+0.015) (one or both) 
7.575£0.015 os 
8.3153-0.006 8.316+0.015 1/2 or 3/2 even 
8.571+-0.015 £ 
AID 1/2 or 3/2, ev (Fig. 27) 
9.165Æ0.015 SD aha 
9.8344-0.015 I. O(N" Qm= 2.705 
1 OE The maximum positron energy is 1.683-40.005 Mev; 
10.544+0.015 the half-life is 118.0+0.6 sec (Br 50d), 123.441.3 sec 
10.705-40.015 


(KI 54). The Kurie plot is linear down to ~300 kev, 
indicating a simple allowed transition (Br 50d) (log ft 
= 3.57: Fe 51b). 
I. C®(@2)O" 
See (Ki 39). 


Il. N“ (py)O5 Q,=7.347 


Observed resonances are exhibited in Table V(15) 
[see also (Du 51) and (Aj 52c) ]. The 8.0- and 9.8-Mey 
states are believed to be formed by s-wave protons 


TABLE V(15). Resonances from N“ (p y)O'5. 


Om= —8.448 


Ep (Mev) T (kev) wTy (ev) I Jia Ole 
277 <2" 0.02 7.606 
70030 10030 0.02 0 WP, eye 8.00 

1064+-2 4.8+1 0.63 8.340 

1550-20 50-+20 0.16 1/2+b 8.79 

174845 1143 0.21 (3/2-)P 8.978 

181544 741.5 0.52 9.041 

2356-8 14424 2.4 9.546 

2489-7 11-3 33 9.670 

260050 1270450 46 ® Wage» O77 


S 


a (Ta 46). Other values from (Du 51). 
b Assignment from N4 (p p)N. 


(J=3, $*) (Du 51). The reduced widths of these states 
are of the order of 20 percent of the single-particle 
value [(Vo 54) and (La 54a)]. 

At the 277-kev resonance, capture y lines resulting 
from cascades through the states in O! at 5.27, 6.14, 
and 6.82 Mev are observed; the direct ground-state 
transition has not been observed (Jo 52b). The ground- 
state transition is observed from all the other resonances. 
At the Z,=1.06-Mev resonance (O!5*=8.34 Mev), 
y rays with energies 3.040.03, 5.27--0.03 (strong) 
and 1.46--0.03, 6.824-0.08 Mev (weak) are also found 
(Li 53). See (Ba 55b). 


IV. N¥(p)O*% Qn=—5.952 E,=7.347 
See O4, 


V. N4(p p)N¥ E3=7.347 


Anomalies in the scattering are observed at E,=1.07 
and 1.60 Mev by (We 53a) and at E,=1.53, 1.72, and 
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Fic. 27, Energy levels af 0": 
for notation, see Fig. 1. 


1.78 Mev by (Go 54). The anomalies are superposed on 
a background which decreases less rapidly than the 


“Rutherford cross section [(We 53a); see also (Ta 55) ]. 


Angular distributions have been measured at ten ener- 
gies near the resonances. These are said to indicate 
=1+ for the 1.53-Mev resonance and possibly J=} 
for the 1.72-Mev resonance. The background shows a 
larger s-wave phase shift and a smaller p-wave shift 
than is expected from hard-sphere scattering (Go 54). 


VI. N4(p d)N® On= —8.320 Ey=7.347 


See NE. 


VIL. N¥(pa)C# Qm=—2.916 Ey=7.347 
d i i d at 
Broad resonances in the yield of C” are observe 
E,=4.94 (c=10 mb), 5.3 (28 mb), 5.6 (35 mb), and 
6.15 Mev (66 mb) [ (BI 52a): stacked-foil method]. 
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VI. N¥(dn)O® Q,=5.122 

Neutron groups observed at Ey=7.7 Mev are listed 
in Table VI (15), together with J, m assignments ob- 
tained from the angular distributions, analyzed by 
stripping theory. Except for the ground state and that 
at 6.84 Mev, the fits to theoretical distributions are not 
completely satisfactory, and the assignments must be 
treated with some reserve (Ev 53). 

At Ez=4.0 Mey, y radiation has been observec>witk= 
energies (corrected for Doppler shift) of 6.81-0.04 Mev 
(1.6X10-* y/d), 6.12+0.06 Mev (0.5X10-* y/d) and 
5.26+0.04 Mev (3.3X10-* y/d; probably includes 
N"(d p)N®) (Bent, Sippel, and Bonner, private com- 
munication). z 


IX. N8(p”)O® Qa=— 3.487 


The threshold is at Ep=3.183+0.008 Me: 


Q= — 3.547 Mev. This value is 60 kev higher than tl 
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Tapte X(15). Neutron groups from N" (d n)O': (Ev 53). 


O (Mev) Out lp Assignment 
5,150.16 0 1 J=5/2, 3/2, 1/2, odd 
—0,14+-0.11 5.294+0.17 2 J=7/2, 5/2, 3/2, 1/2, even 
—1.04+-0.06 6.19-40.16 1 J=5/2, 3/2, 1/2, odd 
— 1.69-40.08 6.844-0.16 0 2, 1/2, even 
—2.334-0.09 7.45-0.16 1 2, 3/2, 1/2, odd 
—3.272-0.09 8.42+-0.16 1 , 3/2, 1/2, odd 
9.06+0.16 1 i 


—3.91+0.07 


| 


given by the beta decay (Ki 54a). See also (Du 38) and 
(Ba 39). 


X. O (y 17)O! 
See (Ba 45). 
XI. O'8(He?a)O® Qn=4.969 
See (Po 52b) and (Ku 53a). 


Qm=— 15.597 


N:6 
(Tig. 28) 
Mass of N'6 


From the preliminary ground-state Q value for 
O'(da)N?: 4.237+0.010 Mev (R. Pauli, private com- 
munication), a mass defect of 10.40+0.03 Mev is 
obtained for N14. 


I. N (6-)0!8 Qmn= 10.40 


From the character of the beta decay, it is concluded 
that N! has J=2- (see O14), 


N 


Fic. 28. Energy levels cf N16: for notation, see Fig. 1. 
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TI. C (d n)N"® 
See (Hu 506). 


Qm=7.982 E,=10.47 


IIT. C" (d pyc’ Qm=0.12 E,=10.47 


The excitation function has been studied for E4=0.6 
to 3.0 Mev. A resonance is observed at Ba=2.15 Mev, 
T~400 kev (Ri54 and Ri 54b). See also C! and 
(Hu 50e). 


IV. C"(d a) B? 
See (Hu 50e). 


V.N"(Cp)N On = 4.84 
See (Cu 53c). 
VI. N5 (n y)NE  Qm=2.49 


The thermal cross section is 24+8 ub (Hu 52e). 


VIL. N! (n a)B? m=— 1.627 E,=2.49 
See (Je 48c). 
VII. N(d pyN'® Qm=0.27 


At Ea=2.0 Mev, proton groups have been observed 
with Q values (preliminary) of 0.158, —0.022, —0.118 
Mev, corresponding to excited states of N'6 at 0.11, 0.29, 
and 0.39 Mev. The ground-state group was not ob- 
served (Th 54a). A group with Q= —0.034+-0.005 Mev 
is reported by (Ma 50i). See also (Wy 49c). 


IX. O(n 4)N On=—9.62 
See O" and (Li 52). 


X. O8da)N% 0,=4.24 

Thin separated targets of O!8 bombarded with 
0.855-Mev deuterons yield alpha groups with Q= 4.237 
0.010, @ı=4.121+0.010, Q.=3.93740.010, and 
Q3=3.846-+0.010 Mev. No other a group with Q=4.23 
to 5.60 Mev was found at 135°: The upper limit to the 
intensity is 5 percent of the weakest group observed. 
Assuming Qp to be the ground-state group, this work 
indicates excited states at 113, 300, and 391 kev 
(R. Pauli, private communication). 


XI. FO(na)N Q,,=—1.49 


At E,=4.87 Mev, an a group has been observed with 

=—1.77+0.13 Mev, probably corresponding to a 
transition to an excited state of N16 (Ja 54). See also 
(BI 47c). : 


O16 
(Fig. 29) 
I. C¥@y)0% 0,,=7.149 
See (Jo 53e). 
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CCO. In Public De 


a ee RI bi eter 


Digitized by Arya Samaj Foundation Chennai and eGangotri ae 


136 F 
TT. C£(a nO" Ey=7.149 


See (Ki 39). 


On= — 8:448 


Tl. C®8(@an)C" Qm=—18.711 Es=7.149 


See (Li 53c). 


IV. C®@p)N® Qn=—4.961 £,=7.149 
. See N". 
V. C®(@a)C® E,=7.149 


Resonances derived from a phase-shift analysis of the 
scattering at several angles are exhibited in Table I(16) 
[(Hi 53a) and (Bi54a)]. At the upper limit of these 
experiments (O'*=12.5 Mev), the existence of higher 
J=0* and 2* levels is indicated by a pronounced in- 
crease in the =0 and /=2 phase shifts. The observed 
levels, which include several whose great width suggests 
a single-particle character, agree well with predictions 
based on the a-particle model. It is noted that only 
(J,) even-even or odd-odd levels would appear in the 
present reaction [ (Bi 54a) and (De 54c) ]. 


VI. C8@n)O'% On= 2.201 
See (Jo 51d) and O". 


TABLE I(16) Resonances in C! (a «)C!2.2 


Er (Mev) Tiab (kev) y?/} (hi?/pa) Ols* (Mev) Jix 
3.24 860 0.85 9.58 i= 
3.582 1 0.0015 9.835 2 
4.28 36 0.26 10.36 4+ 
5.27 10 11.10(?)> 
5.47 3300 0.76 11.25 ot 
5.82 106 0.03 11.51 25 
5.96 1600 0.73 11.62 Om 
7.04 230 0.04 12.43 les 


` The first two entries are from (Hi 53a); the remainder from (Bi 54a). 
Assignment to present reaction not certain. 


VII. N” (d n)O! m=9.122 E,=20.718 
See (Ne 37b) and 0%, 


VIII. N¥(dp)N™ 0,,=8.609 E,=20.718 
Angular distributions show strong indication of 
stripping even at E,=0.4 Mev; see also (St 54f) 
and N}¥, 
Ix. N“(d d)N¥ 
See N”. 


Ey=20.718 


X. N” (d) N! 
See (Bo 42). 


Qm=— 4.288 E,=20.718 


XI. N“(da)C?  Qm=13.570 E,=20.718 
The cross section rises gradually for Ez=0.45 to 0.90 


‘Mey. Angular distributions have been studied in this 
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range by (Ca 54d). At Za=0.63 Mev, the angular dis- 
tribution indicates interference between states of 
opposite parity (St 54f). See also C”. 


XII. N" (d 4a) E= 20.718 


See (Fo 47b). 


On= 6.292 


XIII. N¥(py)O Qn=12.110 


A resonance for capture radiation appears at E,= 1.05 
Mev (I'~150 kev, peak cross section=1 mb; T,~150) 
ev) (Sc52). The large radiative width implies an al- 
lowed £1 transition to the ground state, and therefore 
J=1- [(Sc52) and (Wi 53a) ]. The angular distribu- 
tion of 13-Mev y rays is isotropic within 7 percent 
indicating J=1. The assignment T=1 would be indi- 
cated but for the fact that the level also appears to 
have a large a width (Wi 53a). A search for y transitions 
leading to the 6.06-Mev, J=0*, state of O'8 yields a 
branching ratio <1.3X10~ [(Go 54h) and (De 53d)]. 
The capture radiation is not resonant at E,=340 key 
[see N!°(p a)C"]; the yield is less than one percent of 
that for the Z,=1050-kev resonance (Kr 54b). 


XIV. N¥(pa)C®? Qn=4.961 E,=12.110 


Resonances for ground-state (ao) and 4.4-Mey state 
(a1) æ particles are listed in Table II(16). Cross sections 
at low energies have been measured by (Sc 52). The 
resonance for ap and a; at E,=1050 kev is believed to 
be the same as that exhibited in the capture radiation 
and is therefore J=1- [see N" (p y)O'*]. Support for 
this assumption comes from the angular distribution 
of long-range a particles which is inconsistent with 
J=2* or 3~ (Co 53k). With s-wave formation, the re- 
duced “widths are Yr =1.8X10-" Mev-cm, y.2=0.04 
10% Mev-cm (Fo 53a). There is some question 
whether the a-particle emission violates an isotopic 
spin selection rule (Wi 53a). 

The resonance at E,= 338 kev is attributed by (Sc 52) 
and (Fo 53a) to s-wave protons, forming a state with 
J=1-. The reduced widths are y,2=1.3X10-" Mev- 
cm, Ye =0.07X 10-8 Mey-cm, the former amounting 
to about 10 percent of 342/2ua (under the assumption of 
p-wave formation, J=0+, a value for yy? of about 
1.5X3h?/2ua is obtained.) (Co 53k) and (Ne 53b) find, 
on the other hand, that while the angular distribution 
of a» particles is isotropic for E,<380 kev, a strong 
(cos@) term sets in above this energy. The presence of 
the (cos) term clearly indicates interference between 
states of opposite parity, and a satisfactory fit is ob- 
tained with J=0+ for the E,=338-kev state and 
J=1" for the E,=1050-kev state (O'*= 12.43, 13.09 
Mey) [see, however, C®(a a) ce]. 

Assignments indicated for the E,=429-, 898-, 1210-, 
Pee ee Tesonances derive from angular distribu- 

ms of the a1 particles (Kr 53), 4.4-Mev (C2*) y rays 
[(Kr 53), (Kr 54b), (Ba 52g), and (Ne 2 a4 aY 
correlations (Se 52a). See also (Ch 53d). 
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XV. N" (p 2)O'% 
See O15, 


Qm=— 3.487 E,=12.110 
XVI. N“ (dn)OS Qm=9.885 


Neutron groups corresponding to levels at 0, 6.1, 9.3, 
and 10.7 Mev are reported: see (Hu 49f) and (Wo 50). 


ig “TABLE II(16). Resonances in N! (p a)C#? (ao) 
and in N4 (p awy)C® (ca). 
Pon r a(as)* ola) 
te (kev) (kev) (mb) (mb) J, x Ou 
338 94b 151 nonres. 0+, 17 12.43 
429° 0.9 300 2 12.51 
898 Z 2.2 800 23 12.95 
1050° ~150 500 15 Ta 13.09 
1210° 2235 600 300 4+ 13.24 
16409 ~150 non res. res. ik 13.65 
a (Sc $2): assuming isotropy. 
es Dee for onacion of penetration (s wave); (Mi52b) 
¢ (Se 52). 
d (Kr 54b).% s 
© See text. 


"XVII. N*(6-)O" Qm= 10.40 


The 8 spectrum exhibits components to the ground 
state (~18 percent) and to the excited states at 6.1 
and 7 Mev (~40 percent each) [ (B147) and (So 46) ]. 
Gamma rays of energy 6.1330.011 and 7.10+40.02 
Mev, with intensities in the ratio 1:0.08=0.02, are 
observed (Mi 51a). The ratio of the intensity of the 
7.12-6.14-Mev transition to the 6.14->ground transi- 
tion is <0.05 (Bo 53a). This result implies that the 
B decay to the 7.12-Mev state must be weaker than 
indicated by the absorption measurement results. The 
preferred direct ground-state decay of the 7.12-Mev 
state is not ecessarily a violation of the isotopic spin 
electric dipole rule [see F(p a)O'*] (Ge 53a). 

The half-life is 7.35-0.05 sec (Bl 47), 7.380.05 sec 
(Ma 54d). The ground-state transition is first forbidden 
(log ft=6.8) while the transitions to the excited states 
are allowed. From this it is concluded that N'® has 
J=2- (Mi 51a). 


XVIII. O8(y 2)O% QOn=—15.597 

pe y The cross section exhibits a slow rise for ~3 Mev 
above the threshold (Mo 53b) followed by the usual 
giant resonance. Characteristics of the giant resonance 
are: E,=22.5 Mev, T=3.5 Mev, Omax=/./ mb, inte- 
grated cross section 31 Mey-mb (Fe 54), £,=21.9; 
Omax=8-9 mb, integrated cross section to peak=19 
Mey-mb (Mo 53c), Ey= 24.2, Omax= 11.4 mb, integrated 
cross section to peak= 31.2 Mev-mb (Jo Sib). 

Discontinuities? in the activation curve indicate 
| absorption into discrete levels’ of or" [ (Ka 54a) and 
(Sp 54a) ]: see Table II(16).¥ The sum over the ob- 
| served resonances yields an integrated cross section of 
| 14 Mev-mb to E,=22 Mev; it thus appears that a sub- 
| stantial fraction of the absorption takes place through 


LEVELS OF 


LIGHT NUCLEE 01$ 
discrete levels (Ka 54a). Detailed analysis of the 15.98- 
Mev break indicates a level position of 16.02 Mev and 
a width of 22+8 kev. Radiative widths for the levels 
below 19 Mev, derived from the individual integrated 
cross sections, range from 2 to 30 ev, and are consider- 
ably smaller than would be expected from #1 absorp- 
tion (A. S. Penfold and B. M. Spicer, private communi- 
cation). The region 17.6 to 17.9 Mev has been investi- 
gated with monochromatic y rays by (Ca 54c) who 
reports evidence for a level at 17.71-40.01 Mev with 
[~20 kev. See also (Pe 52e), (Ha 53d), and (Mo 53c). 


XIX. O(y p)N™ On=—12.110 
See (Wa 49i). 


XX. OM(ya)C2 On=—7.149 


For E,,<27 Mev, the cross section for production of 
C® (indicated by observation of @-+recoil tracks) ex- 
hibits a broad (~5 Mev) maximum near 17.5 Mev; the 
peak cross section is ~50 ub (Mi 53c). (Na 52b) find 
at £,=17.6 Mev, less than 10 percent of transitions 
leading to the 4.4-Mev state of C®: they report a cross 
section o(17.6)=195+8 yb. (St 54h) reports evidence 
for excited states of O! at 14.2(?), 16.75, 17.3, 22.6, 
23.15(?), and 24.6 Mev with J=2*, T=0. See also 
(Er 54b) and (Gr 54). 


XXI. OM%(y4a) On=— 14.426 


The cross section for production of 4-pronged stars 
shows maxima at £,=22.6, 25.8, and 29.5 Mev, with 
possible indication of finer structure [(Go 52): 700 
stars; see also (Mi 53c), (Li 53b), and (Hs 53) ]. For 
E, near the first maximum, two classes of stars are 
reported by (Mi 53c) ascribed to O'(y 2a)Be*® (ground 
state) and O! (y 4a). It is considered improbable that 
excited states of Be? or C? are involved (Mi 53c). 
(Li 53b) and (Go 50h), on the other hand, believe that a 


TaBLe III(16). Levels in O from O” (+ n)jO*. 


dE cs ode 
ay a ote one EER 
15.98 0.078 15.9 0.06 
16.44 0.045 16.4 0.05 
16.67 0.071 16.7 0.18 
16.85 0.142 = 
16.92 0.222 102 Des 
17.04 0.575 17.4 0.80 
17.44 0.236 
17.54 0.185 
17.692 0.185 K 
17.88 0.07 T 
8.48 0.117 
18.76(?) rene 
18.91 0.511 18.9 0.66 
19.3 15 
` 20.7 2.1 
21.9 7.5 


a (Sp 54a) and A. S. Penfold and B. M. Spicer (private communication), 
quoted to +20 kev. ‘ 

b (Ka S4a). a 

e (Ca 54c): 17.71 +0.01 Mev, T ~20 kev. ge 8 
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large fraction of the stars involve levels of C? at 9.6 Mev 
(Go 50h) and 11.3 Mev (Li 53b), yielding Beê in the 
ground state. 

In the range Z,=23 to 25 Mev, the reaction is be- 
lieved to proceed mainly via a 12- to 13-Mev level of 
C.f (Li 53b) and (Hs 53)], possibly leading to the 2.9- 
Mey state of Be® (Li 53b). For E,>25 Mev, the domi- 
nant mode by far is via levels in C? near 16 Mev, yield- 
ing Be® in the 2.9-Mev state [(Li 53b) and (Hs 53)]. It 

-is suggested that the process here involves E1 absorp- 
tion into 7=1 states of O! which then decay preferen- 
tially to T=1 states of Cl. A few cases involving a 
17-Mev, T=1, state of Be® are reported [(Hs 53): 
see (Ge 53a) ]. See also (Li 52c). 


XXII. (a) O8(y na)C" On=—25.859 
(b) O%(y)N8  Qm=— 25.006 
See (Er 54c). 


XXII. O'8 (7 n’)015* 


At E,=14 Mev, gamma rays have been observed 
from the excitation of the 6.13- and 7-Mev levels of 
O!8; o~0.2 bn (Th 54b). See also OY. 

XXIV. O!(p p’)O!S* 

Inelastic proton groups are observed corresponding 
to levels at 6.05, 6.13 [(Ar 52e) : E,=8 Mev: (Bu 53b): 
E,=9.5 Mev], 6.9 and 7.1 Mev (Bu 53b), 8.6 and 9.7 
Mev [(Fu 48): E,=15 Mev]. At E,=93 Mev, a broad 
inelastic group is observed corresponding to unresolved 


levels at 6 and 7 Mev (K. Strauch and W. F. Titus, 
private communication). See also (Co 52g). 
XXV. F¥(pa)O% Qn=8.124 

In addition to the ground-state group, four a-particle 
groups are observed, corresponding to levels at 6.06 
(nuclear pair emitting), 6.14, 6.91, and 7.12 Mey 
[(Bu 50c), (Ch 50), and (Fr 50e) ]. The reaction ex- 
hibits a large number of resonances (see Ne). 

The existence of pairs and absence of y radiation from 
the 6.06-Mev level clearly implies J=0 for this state. 
The energy spectrum and the angular correlation 
[(De 54a), (De 49b), and (Ph 51a)] are consistent 
with J=0*. The half-life is 5.00.5 10-1 sec, con- 
siderably longer than predicted by the a-particle model 
(De 54a). Two-quantum decay amounts to less than 
10 percent [R. Latham and others, cited in (De 54a) J. 

The angular correlation of internal conversion pairs 
is consistent with J=3- for the 6.14-Mey state 

< (De 54b), as is the (ayy) angular correlation [ (Ar 50) 
and (Ba SOf)] and the polarization of the 6.1-Meyv 
y radiation (Fr 52a). The half-life of this state is <10-4 
‘sec (C. A. Barnes, private communication), 10-">7,>5 
X102 sec (S. Devons, preliminary value, private 


communication) ; see also (Go 53g). 


Study of (asy2) and (asy) angular correlations leads 
to assignments of J=2+ and J=1- to the 6.91- and 
12-Meyv levels of O', respettively (Se 52b); polariza- 
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tion measurements support these assignments (Fa 53a), 
(Wi 53) find the 6.91—0 transition (E2)> 200 times 
more probable that the 6.91—6.14 transition (£1), 
Since on the single-particle model (ignoring isotopic 
spin), the expected ratio is about 1/6, this observation 
Suggests that the isotopic spin selection rule inhibits 
the latter by a factor>1200. The 7.120 transition 
(£1) is found to be > 120 times more probable than the 
7.12 to 6.14 transition (£2). Since a ratio of ~2.4% 107 
would be expected, the observation indicates an in- 
hibition of <2X10% (Wi 53). According to (Ge 53a), 
the ground-state transition should be favored by a 
factor of ~200, even if the states are pure T=0, The 
half-life of the 6.91-Mev state is <1.210-"4 sec, while 
that of the 7.12-Mev state is <8 10-5 sec (S. Devons, 
preliminary values, private communication). 

A search for a 2- level at E,=873 kev reveals no such 
level in the range E,=7.1 to 8.7 Mev and no evidence 
of degeneracy with the 6.91- and 7.12-Mev levels 
(Pe 54d). This result appears to be in agreement with 
predictions of the a-particle model of (De 54c). See also 
(In 53). 


Fie 
(Not illustrated) 
Mass of F'6 


The mass of F!6 may be roughly estimated from that 
of N'®, assuming charge symmetry and making the 
necessary corrections for the difference in Coulomb 
energies and the neutron-proton mass. On this basis the 
mass defect of F!§=10.44-5.5~15.9-++1 Mev, and Fé 
is then ~1.1 Mev unstable with respect to proton 
emission. See (Al 50g). 

The following reactions leading to F!6 ave not been 
reported: N'(He? 2)F'® (Q,,=—1.5), O! (p nu) FS 
(Qm=—16.7), and O'8(He? HNE (Qm=— 15.9). 


Nu 

(Not illustrated) 
I. N7 (6-)O* O16 QOn= 8.8 
The decay is complex. See O17, 


IL C¥@p)N7 0,=—9.8 
See O18, 

UI. O"(p)N" Q,=—8.0 
See O!8, 

IV. O8(y p)N7 0,=—16.1 
See (Re 54). 


V. Photospallation reactions 


A number of spallation reactions yielding N" have 


been observed by (Re 54); the integrated cross sections 
to 180 Mev are given. 
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(Fig. 30) 
I. C8(an)O Q,,=2.201 £,=6.344 
Resonances are reported at Ez=1.066-£0.005 (T~3 
kev) and 1.344+0.010 Mev (Jo 53e), and at Ea=2.44, 
2.66, 2.76, and 3.30 Mev (Tr 54). The narrowness of the 


first resonance may imply a large J for the 7.16-Mev 
state (Jo 53e). 


II. N¥(@p)O!" Qm=— 1.198 


Qo= —1.16 Mev [(Ro 51g); photoplate]. 

Qo= —1.16 Mev [ (Hj 53) and (Hj 53a) ; photoplate]. 

Proton groups corresponding to an excited state of 
ON at 0.8 Mev (Ro 51g), 0.86 Mev [(Hj 53) and 
(Hj 53a) ] have been observed. See also (Ka 52a). 


II. N®(dx)O 0,,=9.885 E,=14.027 
X 2 

See O18, 

IV, NU(d p)N' O,,=0.27 E,=14.027 
See N!6, 

V. N'(da)C# On=7.683 E,=14.027 
See (Ho 40). 

VI. O(n y)O" O,=4.143 
capt <0.92 mb (Sa 47). 

VIL. O'8( 12) O18 E,=4.143 


At £,=8.2&%10~ ev, the cross section is 11.7 b 
(Hu 54a). The epithermal scattering cross section 
(free) is 3.76-+0.02 bn (Hu 52e). The cross section is 
almost constant from thermal energies to Z,=0.3 Mev 
(Hu 53c). The course of the nonresonant cross section 
up to #,=2 Mev can be accounted for by an s-wave 
interaction involving the 0.88-Mev level, with a reduced 
width of the order of the single-particle value (Th 52b). 

Observed resonances are indicated in Table I(17). 
The level assignments are based on peak heights, peak 
shapes, and angular distributions. Polarization studies 
with Li*(p 2)Be’ neutrons in the range E,=0.25 to 
0.6 Mev indicate strong interference between the P; 
level (E,=0.43 Mev) and the S, background. The 
absolute magnitude of the polarization at 0=90° is in 
good agreement with the known level parameters 
(Wi54c). The E,=1.0-Mev resonance (O"*=5.08 
Mev) may be the D; component of the *D term of which 
the ground state is the other member (Ad 53). A search 
in the range E,=1.0 to 1.3 Mev reveals no indication of 
a level corresponding to that reported at E.=3.229 Mev 


` in F9(@a)O" (R. L. Becker, private communication). 


At E,=14.1 Mev, the elastic scattering cross section 
is 0.7 b (Co53b), while the total cross section is 
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(Mev) (kev) íb} la P/BGM/2pa) J,” QF 
0.443% 45 16.5 1 0.038 IJET 4.56 
1.00 100 7.9 Z 0.26 ofan 5.08 
1322 35 6.7 i 0.009 . 3/275 5.39 
1.66" <7 SA >O 23/2 3.71 
1.84 <10 38 SO 23/2 5.87 
1.91° 30 3.0 1 0.0056 1/2- 5.94 
2.374 120 0.3 0 0.013 1/2* 6.37 
3.33° 220 3.6 2 9.049% 3/27 7.28 
3.80° 800 3.0 1 0.080% Side 7.72 
4.35! 280 2.2 1 0.025" 1/2- 8.28 
4.6% 8.5 
4.8% 8.7 
5.157 8.98 
5.7% 9.5 
(5.9)z (9.7) 
(6.4)z (10.2) 
(6.7 )# (10.5) 
(6.8) - (10.6) 


a R. L. Becker, private communication. 
b (Bo 50f) and (Bo Sic). 

e (Bo Sic) and (Ba 52b). 
4 (Bo 51c), (Ba 52b), and (Ri 510). 
e (Fr 50b) and (Ba 52b). 

! (Fr 50b), (Ba 52b), and (Hu 53c). 
e (Hu 53c). 

b (Vo 54). 

i (Ad 53). 


1.650.05 b (Co 54f), 1.590.03 b (Co 52h), 1.64-£0.04 
b (Po 52a), 1.68-40.09 b (Ag 53). The total cross sec- 
tion has been measured from E,=14.1 to 18 Mev; it 
remains essentially constant in that region (Co 54f). 
See also (Go 52d), (Hi 54c), (Ne 54c), and (Wi 54e). 


VIII. O' (1 n’)O'* E,=4.143 


The inelastic scattering cross section at E,=14.1 Mev 
is 0.5 bn; the angular distribution is roughly sym- 
metric (Co 53b). The cross section for production of 
6-7 Mev y rays is 0.25 b (Th 54b). See also (Ba 53a) 
and (Sc 53). 


IX. O(n 22)O% On=—15.597 E,=4.143 
See (Je 44) and (Sh 45). 
X. O(n p)N  On=—9.62 By = 4.143 


The cross section rises from threshold to a peak of 
8930 mb at E,=14 Mev, falling to ~60 mb at 18 Mev 
(Ma 54d). The cross section is given as 35 mb at 
E,,=14.1 Mev (Hu 54a) and as 49425 mb at £,=14.5 
Mev (Pa 53a). 


XI. O(n d)N™ O,=—9.885 E,=4.143 
At E,=14.1 Mev, the cross section is 15 mb (Hu 54a). 


XI. O(na)C® Qn=—2.201 Es=4.143 

Thirty-two resonances are reported by (Gi 53a), 
corresponding to levels in O" between 6.8 and 12.8 Mev. 
At E,=14.1 Mev, the crosssection is 310 mb (Hu 54a). 
See also (Wi 37e). a 
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18 
a 
g 
b 
a2 
410 
6.8 
46.7 
JE 4 
te ee 5.9 
45.7 
4515 
— 748 
1/2- 34.6 
JE 35 
3.80 
73.33 


3.06 
32 2er (a 
2 moc? _1.918_| 
O'* +d-p 
0.872 vet Z 
B 
-0.603 Q” J=5/2* 
Ne*°+n-a -1.198 
N'*+ a-p TA -1.825 
C" a-n 
E a Fic. 30. Energy levels of O!7: for notation, see Fig. 1. 


XMI. O¥%(d £)O"” Q,=1.918 


The weighted mean of nine Q-value determinations is 
1.919+-0.004 Mev (Va 54). A recent result is 1.915 
0.010 Mev [(Sp 54b); mag. spectrometer]. Recent 
values for the energies of the first three excited states 
are: 875, 3055, 3840 kev h(Sp{54b): +12 kev], 893, 
3005, 3853 [(Kh 53): mag. analyzer, no errors stated]. 


i CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


Other values, including higher levels, are tabulated in 
(Aj 52c). See also (Bu 49c). 

At Ea=8 Mev [(Bu 50e), (Bu 51f), and (Bu 5ih)] 
and 19 Mev (Fr 53e), the angular distributions, an- 
alyzed by stripping theory, indicate /,=2, and there- 
fore J=}+ or 5/2+ for the ground state, and /,=0, and 
therefore J=1+ for the 0.88-Mev state of O'7: (see 
also F'8.) Reduced widths derived from the absolute 


na atc 


to 


s6 


~ cross sections are about 0.1 and 0.2X3h?/ 


U. N” (æ n) F” 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


ENERGY 


/2ua for the 
ground state and 0.88-Mev state ET see 
(Fu 54) and (Fa 54c). 

The energy of the y ray from the first excited state is 
870.5-£2.0 kev (without Doppler shift); the internal 
conversion coefficient is consistent with £2 radiation 
(Th 52). The p—y angular correlations are isotropic 
at Ea=0.8 Mev (Th 53) and at 1.7 and 2.0 Mev (Ph 52), 
consistent with J=3 for tle excited state. The lifetime 
of (2.541)X10 sec is considerably shorter than 
predicted by the extreme independent-particle model 
and appears to indicate some participation of core 
excitation in the transition (Th 53a). See also (Be 54b), 
(Ho 54d), (Kh 54), and (Th 54d). 


XIV. N'7(6-)O"##0% +n Qn=8.8 


The half-life is 4.14+-0.04 sec (Kn 48), 4.15-£0.1 sec 
(St 51a). Eg(max)=3.740.2 Mev (Al 49c): log fi=3.8. 
The £ decay proceeds to an excited state or states of 
O" which in turn decay to O! by neutron emission. 
The neutron spectrum has a maximum at 0.92-40.07 
Mev [(Al49c) and (Ha 49c)] and a half-width <0.5 
Mev. 


XV. O" (p p')O17* 
See (Wa 54). 


KVIE (Sis) OL Gan Oar OM 
See Fl’. 
XVII. F8(a)O"” Q,= 10.042 


The weighted mean of four Q-value determinations 
is 10.039+-0.0f0 Mev (Va 54); this includes a recent 
value of 10.028 by (Wa52b; mag. spectrometer). 
Observed a-particle groups are tabulated in (Aj 52c). 


XVII. Ne®(2a)0"™ Qm=— 0.603 
See (Jo 51e) and Ne”. 


F" 
(Fig. 31) 


I. FY(6HO" Qn=2.767 

The decay proceeds to the ground state of O": 
E;+(max) = 1.748+0.006 Mev, log/f=3.38. The spec- 
trum has the allowed shape down to 570 kev and an 
upper limit of one percent is placed on possible transi- 
tions to the 875-kev state of O" : logft>4.3 (Wo S4a). 
Recent values of the half-life are 66.0+-1.8 sec (Wo 54a), 
62.5+1.0 sec (Wa 54), 66.00.5 sec (Ko 54). See also 
(Aj 52c). 


Om=—4.747 
See (Li 37) and F®. 
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Fic. 31. Energy levels of F": for notation, see Fig. 1. 


II. O%(p po E,=0.594 


Observed anomalies in the scattering are exhibited 
in Table III(17) [(La 51d), (La 51e), (Ep 53), and 
(Se 54)]. In addition, sharp resonances (I <25 kev) 
are reported by (Se 54) corresponding to excited states 
of F"* at 5.05, 5.30, 5.50, 5.70, 5.90, 6.15, 6.75, 6.90, 
and 7.40 Mev. The differential cross section at Ep =2.00 
Mev, 0=167.2° (lab) is 94+2 mb/sterad. The low- 
energy cross section indicates a large S and a relatively 
large D phase shift (Ep 53). Differential cross sections 
have been measured at E,=9.5 Mev (Bu 53b). See also 
(BI 54a). 


IV. OM(py)EFY  On=0.594 


Nonresonant capture has been studied for £,=0.8 
to 2.1 Mev. This work indicates, in addition to the 
direct ground-state transition (yı), a transition (y2) toa 
state of F at 487-410 kev which then radiates (73) 


Taste III(17). Anomalies in O+-p scattering. 


7 s~ = 
Ep (Mev) T (kev) Fim fae References 
2.66 19.9 3.10 1/2> (La 51d), (La 51e), 
and (Ep 53 
3.47 <3.5 3.86 hem „(La 51d) Rnd (La 5ie) 
(3.99) ~500 4.35 (3/2*) (La 5id) and (La 51e) 
41 400 45 (3/27) (Se 54 A 
(4.39) 240 4.73 Bian (La 51d) and (La 51ef 
4.2 350 4.6 372+) (Se 54) > 
48 200 5.1 1/27 (Se 54) 
6.4 150 6.6 4/25 (Se 54) 
el 


° had 
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to the ground state. The ratio of ys to yı is about 10 
over the energy region studied. The yı and ys radiations 
are approximately isotropic, while y2 has an almost pure 
(sin’6) distribution (Z,=1.9 to 1.9 Mev). At E,=1.35 
Mev, the cross section for production of yə radiation is 
6-3 pub. The fact that most transitions involve the 
0.51-Mev state (yə) indicates that the nonresonant 
yield is not to be attributed to this state; it is suggested 
that direct, one-stage capture is involved (Wa 54). 

' he relative F” activity has been measured from 
E,=1.1 to 4.1 Mev. The cross section increases almost 
linearly with proton energy from 1.1 to 3.75 Mev 
except for a sharp resonance at 3.47 Mev [ (La 51d) and 
(La 5ie) ]. See also (Wi 53c). 


V. O(a n) O,=—1.631 


A neutron group has been observed corresponding to 
an excited state of F” at 0.536+0.010 Mev (Aj 51b), 
0.53&0.06 Mev [(El 51a) and (Mi 53)]. Angular dis- 
tributions of neutron groups to the ground and first 
excited state of F!’, analyzed by stripping theory, indi- 
cate J=3 or 5/2+ for the ground state and J=3* for 
the first excited state [(El 51a), (Mi 53), and (Aj 51b): 
Ea=8 and 3.1 Mev]. 

A neutron threshold determination indicates that 
the first excited state of F!” is at 5105 kev (Cook, 
Marion, and Bonner, private communication). 


VI. F8(y 2n)" Qm=— 19.567 
See F. 
ow 
(Fig. 32) 
I. C” (a p)NY Om=— 9.8 E,=6.238 
See (Su 51b). 
I. O'(¢ £)O8 
See (Gu 53c). 


Qn=3.724 


IM. O"(7 p)N" Q,=—8.0 E,=8.063 
See (Ch 49d). 


IV. OW (@a)C4 0,=1.825 E,=8.063 
The thermal cross section is 0.46-+0.11 b (Ma 47). 


V. O72 ~)O" Qn =5.838 


~~ Proton groups have been observed corresponding to 
the ground state (Qo=5.821+-0 010 Mev) and to a state 
at 1.9863-0.013 Mev. No other groups are observed 
corresponding to levels below 4.8 Mev [(Ah 54d) and 
(Ah 54e) ; mag. spectrometer: Ea=0.9 Mev ]. However, 
at Ea= 1.39 and 1.98 Mev, [ (Ho 54£); mag. spectrom- 
eter] report two states with Q=3.39340.016 and 
3.861-+-0.016 Mev correspgnding to states of O!8 at 
1.98 and 2.45 Mev. 
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Fic. 32. Energy levels of O18: for notation, see Fig. 1. 


VI. O87 p)N” 0O,=—16.1 


The cross section rises slowly for ~3 Mev above 
threshold (Mo 53b) and then shows the usual giant 
resonance at E,=24+2 Mev (c=37 mb). The inte- 
grated cross section to 25 Mev is 150--100 Mev-mb 
(St 51a). The integrated cross section to 180 Mev is 
500 Mev-mb (Re 54). 5 


VIL. F8(8+)08 Q,=1.671 
See F'8, 
Fis 
(Fig. 33) 
I. F8@408 Q,,=1.671 


The positron end point is 635-415 (Bl 49a), 649-49 
kev (Ru 51). The spectrum is simple. The half-life is 
112+1 min (Bl 49a): log ft=3.62. [The fact that the 
B transition to the ground state of O!8 (J=0t, T=1) is 
allowed suggests that F!8 (assumed T=0) has J=1*, 
since 0—0 transitions in 8 decay require AT=0 
(see C? and O%).] 


Il. N“ (ay) 0,=4.412 


Resonances have been observed at E«= 1.530+0.003 
Mev (T<1.5 kev) and 1.6170.003 Mev (T <1 kev). 
In both cases, transitions have been observed to an 
excited state at 1.0754-0.010 Mev with wI',=2.20.3 
ev. The transition from the first resonance to the ground 


ve 
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state has been found with wl',=0.7+0.1 ev. Angular 
distributions of the y rays are also reported (Pr 54c). 


I. N'(an)F! Qm=—4.747 E,=4.412 


Eleven resonances, from E«= 6.53 to Ea=8.58 Mev, 
are reported by (Fu 38). 


IV. N” (a p)O" Qn=—1.198 #,=4.412 
Observed resonances in the range E«a= 1.5 to 4.2 Mev 


are exhibitéd in Table I(18) [(He 53), (Ro 51g), and 
(Ch 47b) ]. See also (Aj 52c), (Ka 52a), and (Ch 53d). 


V. N¥(aa)N¥ E,=4.412 


Observed anomalies in the elastic scattering are 
exhibited in Table 1(18) [(De 39a), (Br39), and 
(He 53) ]]. The scattering cross section has nearly the 


Taste I(18). Resonances in N" (a p)O" and N¥(a@a)N¥. 


Eq (Mev) T(c.m.) (kev) Fist 
2.935% 2744 6.694 
3.140" 9348 6.854 

~3.50b ~460 ~7.1 
4.2b 7.7 
4.60 30 
5.28 oS 


b Nu (a p)OU: (Ro IEE 
° Nu (a a) NM: Dana 


5 Nula Bore and N(aa)N¥: (He 53) (widths are from first reaction). 
a NH (a SN De 39a) and, (Br 39). 


Fic. 33. Energy levels of F'8: for notation, see Fig. 1. 


Rutherford value for Ea<2.5 Mev; there is no indica- 
tion of an anomaly near Ea= 1.7 Mev [see N“(a p)O": 
(Aj 52c) ]. 


VI. O¥(dy)F8 Qn=7.527 
At Eaz=1.1 Mev, the cross section is <0.5 mb (Si 54). 


VIL. O8(dn)FY O,=—-1.631 E,=7.527 


The excitation function has been studied for E4=1.8 
to 5 Mev [(Bo 51b) and (Ne 37b) ]. 


VIII. O'8(d p)O” QOn=1.918 B,=7.527 

Resonances for both the ground and the excited-state 
proton groups are reported at E4= 1.7, 2.2, and 3.0 Mev 
by (He 48a). Broad maxima in the ground-state proton 
yield curve are found at 2.5, 2.9, 3.4, and 3.7 Mev, 
while the short-range protons show broad maxima in the 
yield at 2.6 and 3.3 Mev (Ea=2.4 to 3.8 Mev). The 
angular distributions at these energies indicate fhat™ 
the emission of short-range protons proceeds almost 
entirely by stripping while the ground-state protons 
(l,=2) show more the influence of, compound nucleus 
formation [(Va 54a) and T. F. Stratton, private com- 
munication]: [Compare (Be 54b)]. At Ea=19 Mev; 
the cross sections for the formation of the ground and 
0.87-Mev states of O” are, gespectively, 35.543.5 and base 
22.743.5 mb (Fr 53e). ‘ = 
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IX. o(d dO" 
See (Fr 53e). 


Ey=7.527 


X. OM(da)N“ Qn=3.116 Ey=7.527 
At Ea=19 Mev, the cross section for long-range 


alphas is 2.60.3 mb; the cross section for formation 
of the 2.31-Mev, T=1, level of N” is <0.1 mb (Fr 53e). 


XI. O" (tn) FS Qm= 1.270 
See (Po 51). 
XII. O'(He? p)F8 @m= 2.034 


See (Ku 53a) and (Po 52b). 


XM. O"(æ pn)F'S Qn=—18.532 
See (Te 47). * 

XIV. O" (dn) Fs Om=3.384 
See F”. 

XV. O8(pn)FS  Qm=— 2.453 


The threshold is E,=2.590--0.004 (Ri50e), 2.584 
+0.010 Mev (Ma 55). 


XVI. O8(He? NES Q0,,= —1.689 
See (Ku 53a). 

XVII. Ne’(64)F 0,,=4.2 
See Ne’. 

XVII. F?(yn)F8 Q,,=—10.408 
See F”, 

XIX. F8(n2n)F®§ Q,=—10.408 
See F”. 

XX. F8(pd)F8  Qm=-—8.183 


At #,=18 Mev, the angular distribution of ground- 
state deuterons, analyzed by pickup theory, indicates 
7=0 and therefore even parity, J=0+ or 1+ for F!8 
(Re 54a). 


XXI. FP9d)F8  Q„=—4.151 
See (Bo 50d) and (Sh 51a). 


a 
XXII. Ne*(da)F® Q,=2.781 
Qo=2.791+0.009 [(Mi 54b); mag. spectrometer]. 


At Ez=7.8 Mev, alpha-particle groups have been 
observed corresponding to levels in F! at 1.05, 1.83, 
“2.20, 2.61, 3.23, 3.92, 4.42, 5.01, and 5.61 Mev (Mi 51d). 
(No levels of F'8 have yet been observed in reactions 
which could lead to T= {states without violation of 
isotopic spin conservation.) 


c 
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Ne!’ 
(Not illustrated) 
Mass of Ne!’ 
From the beta decay (Go 54d), Ne!8— F!8= 4.20.2 
Mev. The mass defect is then 10.4+0.2 Mev. 
I. Nel8(64)F8 Om=4.2 


The maximum energy of the positrons is 3.20.2 
Mev, the half-life is 1.60.2 sec: logft=2.9+0.2 
(Go 54d). 


Il. O'8 (He? 2)Ne!® Q,=—3.0 
See (Ku 53a) 
IU. F(p 2n)Ne® Qm=— 15.4 
See (Go 54d). 
o: 
(Fig. 34) 
Mass of O! 


On the assumption that the proton. group with the 
highest Q value, observed by (Th 54a), (Ah 54b), and 
(Ho 54f) in O! (d p)O*, corresponds to the ground state 
of O”, the mass defect for O= 8.930+0.024. 

I. O%(6-)F¥ Qnm=4.781 
The decay is complex: see F”. 
Il. O8(ny)O® On=3.956 
The thermal cross section is 0.21-+0.04 mb (Hu 52e). 
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Ill. O8(d p)O” O,,=1.731 


Qo= 1.730-+0.008 [(Ah 54b); mag. spectrometer ]. 
Qo=1.732=£0.008 [ (Th 54a); mag. spectrometer ]. 
Qo=1.735=£0.008 [(Ho 54f); mag. spectrometer ]. 

A search for higher energy protons reveals no further 
groups with Q in the range 1.7 to 2.3_Mev or 2.6 to 3.4 
Mev; the Q=1.73-Mev group is therefore presumed to 
represent the ground state (Mi 54e). Proton groups 
corresponding to excited states of O have been ob- 
served with Q=1 6360.008 and 0.262-0.006 Mev 
[(Ah 54b) and (Mi 54e) J, 1.632-0.008 Mev and 0.263 
+0.010 Mev (Th 54a), 1.641-+0.008 Mev and 0.264 
+0.013 Mev (Ho 54f); O%*=0.096 and 1.470 Mev. 
No other proton groups are observed at Ea=2.18 Mev, 
= 140°, corresponding to states in O" below 2.4 Mev 
(Th 54a). 

Absolute differential cross sections have been meas- 
ured at Ey=3.01 Mev for the Q=0.3-Mev group. 
The angular distribution is peaked in the forward 
direction (c=213-16 mb/sterad at 5.4°, c.m.), and 
indicates /,=0 and therefore J=}* for the 1.47-Mev 
excited state of O" [T. F. Stratton, private communica- 
tion and (St 54g) ]. 


IV. F8(72 p)O® Qm=— 3.999 
See F” and (Je 50d). 
Fis 
(Fig. 35) 
I. O'8(¢ 2) FS Qm= 1.270 
See (Kn 45) and (Ho 50b). 


Ey= 11.678 


Il. O(a p)FÊ Q,=—8.124 

At E,.=20.6 Mev, proton groups are observed corre- 
sponding to excited states at 1.360.05, 2.67-0.05, 
and 3.92-40.05 Mev (Bu Sic). 


II. OV(dn)F® Qn=3.385 E= 13.792 
See (Da 40) and (Ho 50b). 

IV. Od p)OB On= 5.838 E= 13.792 
See O18. 

V. O%(da)N Qn=9.807  Es=13.792 


See N". 


VI. O8(pn)F® Qn=—2453 Ey=7.955 


Observed resonances are listed in Table I(19) 
[(Ma 55), (Ri50e), and (Bl 51a) ]. See also (Ma 53d). 


VIL. O'8(pa)N® Qn=3.969 Ey=7.955 
The excitation function indicates a small anomaly 
at E,=560 kev (Co 53h), and resonances at Ep= 6405 
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Tage 1(19). Resonances in O8(p nj)F*. 


Ep (Mev) P (key)® Rel, yield s J4 
2.657£+0.002* 4042 0.399 4 
Dale t 0.109 10.542 
2 3545 0.160 10.586 
Ki 604-2 0.807 10.839 3/2 
F 45-10 0.420 10.957 1/22) 
3.268-+-0.002" 65-62 1.000 11.050 3/2 
3.386-+0.002* 45-2 0.932 11.162 1/2(?) 
3.495-+6,004" 0.580 11.265 
3,600+0,020" $520 0.966 11.37 r 
3.755-+0.020* 0.966 11.51 
33° 50° 
3.8 150 
4.25 250 12.0 
5.08 500 12.8 
5.63 330 13.3 
6.20 300 13.8 
6.67 450 14.3 
—— = = — — = anna mera i 
a [(Ma 55) and (Ri 5Ge)]: thin target neutron yields. 
b (Ma 55). 
e Th 1 following values from (BI Sta}: stacked foil method; estimated 


cross sections in mb. ` 
4 (Ma 55): neutron angular distributions, 


kev, T~15 kev [(Se51b) and: (Co 53h)] [680 kev, 
(Mi52b): T=40 kev], and 850+5 kev, [~45 kev 
[(Se 51b) and (Co 53h) ] superposed on what may be 
one or more broad resonances. From a study of angular 
distributions in this region, (Co 53h) finds /=} and 4, 
respectively, for the levels in F” corresponding to the 
resonances at 640 and 850 kev. The absolute parity of 
these levels was not found but is stated to be opposite 
to that of the broad J=} level(s) which give the main 
contribution to the yield away from the location of the 
sharp resonances. See also (Ro 53c). 


VIII. O8(d2)F! Qn=5.729 


At Ea=2 Mev, @=0° and 20°, neutron groups are 
observed corresponding to levels of F” at 0.2, 0.9, 1.4, 
1.6, 2.2, 2.75, 3 85, 4.5, 4.8, 5.2, and 5.5 Mev [(Se 53c): 
+0.1 Mev]. 


IX. O8(6-)F¥ On=4.781 


The decay branches 30-10 percent to the 197-kev 
state (Eg=4.540.3 Mev) and 70 percent to a state at 
1.57 Mev (E3=2.9+0.3 Mev) [ (Bl 47a) and (Jo 54g) ]. 
With a half-life of 29 sec, log ff=5.55 and 4.33 respec- 
tively (Fe 51b). Gamma rays of energy 71= 136648 
kev, y2=199.6£1.5 kev, and ys=111.541.5 kev with 
relative intensities 0.67:1.00:0.04 are reported, as is 
the transition, ys, from the 1570-kev level to the 110- 
kev level. The y:—v2 coincidences indicate that yi 
arises from the transition 1570-197. The transition 
197-110 has an intensity <0.5 percent, and the transi- 
tions 1570-ground state and 157071350 are <3 and 4 
percent respectively. The lifetime of the 197-kev 
state is (1.04+0.2)X 107 sec, while that of the 110-kev 
state is K10-® sec [compare F!(p p’)F%*]. Lower 
limits on log ft for 8 transitjons to the ground state and 
those at 110 and 1350 kev are 6.5, 7.3, and 5.3 (Jo 54g). 
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Fic. 35. Energy levels of F”: for notation, see Fig. 1. 


The present observations, together with Coulomb 
excitation data [F¥(aa’)F"*], require J=} and 
§/2* for the 110-kevy and 197-kev levels [see also 
F9(p p')F*]. The assignment J=3+ for the 1570-kev 
state is obtained from thé O! beta decay, from the 
yı— Y2 angular correlation, and a study of 4. If the 
beta transitions to the 1570- and 197-kev states are 
allowed and that to the ground state is forbidden, it 
follows that O” has J=5/2+ (Jo 54g). See also (Ho 50b). 


X. F"(yn)F® Qn=—10.408 
Discontinuities in the activation curve are observed 


a at several y-ray energies in the range Z,=10 to 15 


* CC-O. In Public Domain. Gurukul Kangri Collection, Haridwar 


Mev [(Go 54f) and (Ta 54)] [see Table II(19)]. A 
“giant” resonance appears at E,=22.2 Mev, T=5.6 
Mey, integrated cross section=77 Mev-mb [(Fe 54): 
see also (Ho 52a) ]. See also (Pe 52e), (Pe 53c), and 
(Sc 54). 


XI. Fy 2n)F" Qn=—19.567 
See (Ho 52a). 
XII. F? (n n’) F9* 


For £,<2.0 Mev, gamma rays are observed with 
E£,=110, 200, and 1380 kev. The threshold for the 1380 
kev y ray is at E,~1.7 Mev, indicating that it arises 


ee 


%¢ 
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from the 1.57-Mev, rather than from the 1.35-Mev, 
state. The excitation functions for the 110-kev and 
200-kev gamma rays are consistent with J = 4 and 5/2* 
for the 0.110-Mev and 0.197-Mev states of F” [J. M. 
Freeman, private communication]: (Va 54b) find the 
threshold for Z, = 1.37-Mey radiation to be at E,=1.57 
Mev. 

At £,=2.5 Mev, gamma rays are observed with 
E,=11041, 19742, 1254+20, 1349-15, 1460+30, 
and (1560-30) kev. The upper limit to the intensity 
of the cascade decay of the 0.2-Mev state is 2 percent of 
the direct ground-state transition [(Da 53c) and R. B. 
Day, private communication]. See also F”. 


XIII. F(p p) F* 


In the range E,=0.6 to 1.9 Mev, two inelastic groups 
appear, corresponding to I" levels at 110 and 197 key 
[(Pe 54b), (Pe 54d), (Ba 55), and (Mi55)]. See Table 
IIL[(19). Ground-state transitions are observed for both 
levels; the’intensity of the 197-110-kev cascade is less 
than 1 percent of the 197-kev transition (Ba 55). 
The mean lives, measured by the recoil technique, 
are (1.0-+0.25)10-*® sec for the 110-kev state and 
0.8 10-7 sec (with an uncertainty of a factor of two) 
for the 197-kev state [(Th 54): compare O(6-)F]. 
These lifetimes limit the 197-kev transition to £2 or 
faster, and the 110-kev transition to #1 or M1 [ (Ba 55) 
and (Sh 54e) ]. The K-shell internal conversion coeffi- 
cients agree within experimental error, +25 percent, 
with an £1 assignment for the 110-kev transition and 
with £2 for the 197-kev transition. Since the ground 
state has J=4*, these results indicate J=4~ or 3~ for 


the 110-kev state and J=5/2* for the 197-kev state’ 


(Mi 55). If the 197-kev state were J = 3*, the M1 transi- 
tion would be expected to outweigh the £2 by a factor 
~10'. If the 110-kev state were J=}, the cascade 
transition would be E1 and should be >10‘ times 
stronger than observed [ (Ba 55) and (Sh 54e) ]. 

The reaction exhibits a number of pronounced reso- 
nances (see Ne”); in all cases, the angular distribution 
of 110-kev y rays is isotropic, suggesting J=} for the 
110-kev state. At known J=2- resonances, the 197-kev 
radiation has the distribution 1+ A cos?@, with A=0.17 
to 0.44 (Ba 55). Such a distribution requires £2 radia- 
tion, J=8+ or 5/2+, and excludes M1 (Sh S4e). The 
angular distribution of protons leading to the 197-kev 


Taste II(19). Discontinuities in F" (y n)F!8 cross section. 


Ey (Mevy)* Ey (Mev)? Yield (Mev-mb) 


109 11.0 0.2 
9 

3 11.5 0.3 

11.9 11.9 0.9 

2 G3 15.3 0.5 


b ue oy et iat Yields are integrated cross sections for indi- 


vidual levels. 


LEVELS OF LIGHT NUCLEI F19, 


CC-O. In Public Domain. Gurukul Kangri Collectior 


Nets : 147 


TABLE suit Energies of the first two excited states of F", 


a b c d e 


108.8 +0.48 1101 
196.0-b1.4 197.5415 


109,1 41.0 HO4e1 11342 I 


f 
E 2 te 
196.8 £1.5 1972 196-42 195 #1. 


ic protona, mag. spectronfeter. 

+ y rays. 

(M , internal conversion. 

OR, B, Day, priy ate communication: Fian wF, y raya. 
e (He 54 d (Te 54b): F* SEIN 198, y rays. 

1 (Jo 54): laa) F, y raya 

e (Jo 54g): O19 decay, y rays. 


state from 27 levels of Ne” is consistent with J=3* or 
5/2* [(Pe 54b), (Pe 54d), (Ch 54b), and (Sh 54e)]. 
The assignment J=5/2* for the 197-kev state is con- 
sistent with, and the assignment J=} for the 110-kev 
state is required by, the Coulomb excitation results. 
See F(a a’) F'%*, 

At £,=8 Mev, proton groups are observed corre- 
sponding to states of F” at 1.37, 1.59, 2.82, 3.94, 4.06, 
4.41, 4.48, 4.59, and 4.76 Mev. No other groups of 
comparable intensity are observed for E.<6.7 Mev 
(Ar 52e). See also (Ch 54d) and (Co 52g). 


XIV. HY (dds) Re 


At E,=1.05 Mev, y rays are observed from the ex- 
citation of the 0.11- and 0.20-Mev states of F (Th 54d). 


XV. F(a a!) F* 
Gamma radiation of energy 110 kev and 197 kev 
[see Table II1(19) ] is observed for Ea=0.6 to 3.5 Mev 
[(He 54b), (Sh 54e), (Jo 54), and (Te 54b)]. The ex- 
citation function is characterized by a slow increase 
with energy over the entire range, with resonance struc- 
ture superimposed for E.>2 Mev. The shape of the rise 
( (He 54b), (Sh 54e), (Jo 54), and (Te 54b) ], the abso- 
lute cross sections at low energies, and the y-ray angular 
distributions agree well with Coulomb excitation, as- 
suming £1 and £2 excitation for the 110- and 197-kev 
levels respectively, and the derived transition probabili- 
ties are in satisfactory agreement with direct lifetime 
measurements (Sh 54e); see F(p p')F"*. See also x 
(Te 54) and (Sh 54c). ? 
A 


XVI. Ne®(6*)F® 0,,=3.256 
See Ne”. 

XVIL Ne (da)F® On=6.434 
Qo=6.432+0.010 Mev [(Mi52) ; mag. spectrograph |. 


At Ea=2.13 Mev, alpha-particle groups are observi 
corresponding to SRA states of F? at 113482 
192+12 kev (Mi 52). 


Ne! 2 
(Fig. 36) 
I. Ne(B)F® Qn=3.256 


The positron end point 
The half-life is 18.50. 
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4.261 
O+ Hen 
3.658 
3.435 _ Ne®°+He®-a 
O"+a 


joass.0202 — 
19 (J=1/2*) 


4557 
TS 521 
4.253 


| Ne 
2moc | 
n2354 
3.274 
FM He`-t 
-4.038 
F"? +p-n 
) Fic. 36. Energy levels of Ne”: for notation, see Fig. 1. 


(Wh 39). The absence of low-energy y rays (see F") 
indicates that the transition takes place to the ground 
state, J=}*, of F”: log ft=3.30; log ft for the 110-kev, 
J=}, state of F® is >6.0; logft to the 200-kev, 
J=5/2*, state is >5.5. It follows that the ground state 
of Ne” is J=3+* (Jo 54g). 

The spectrum of Ne’ recoils suggests that the Fermi 
part of the beta interaction is scalar [(Al54) and 
(Ma 541) J. 


If. O'(a@m)Ne® 0,,=—12.162 
See (Te 47). 


Il. F'(pn)Ne® @m=— 4.038 

The threshold for the ground state is 4.253--0.005 
Mev (Wi52a). Slow neutron thresholds are observed 
corresponding to states of Ne! at 25545 and 289-+5 
kev (Cook, Marion, and Bonner, private communica- 


tion). 
YT, Nen (y n)Ne® Q,=—16.908 
See Ne”. 
O70 
(Not illustrated) 
i Mass of 0% 


(Ba 39e) calculates the mass of O” to be 20.0168 
amu; mass defect=15.6 Mev. By an extrapolation 


o 


AND T. 
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from heavier A =4n isobars, (Sh 54d) predict a mass 
difference O?—]°=7.4 Mev, mass defect O”%=13.3 
Mev. We adopt: O” mass defect= 13.342 Mev. Then 
O” would be stable to neutron emission by ~4 Mev. 

O% has not been observed. Two possible reactions 
leading to O” are O'8(/ p)O” (On = — 0.5), Ne”? (n He?)0” 
(Qn= —22): 


E2" 
(Fig. 37) 
I. PASNE Qn =7.052 


See Ne”, 


IW. O8d2)FP 0,=5.729 H,=12.329 


See F”. 


III. O(g p)O” 
See OP, 


IV. O8(da)N'® Qm=4.24 
See N16, 


V. F9(ny)F2  Qm=6.600 


The thermal cross section is 9.4+2 mb (Se 47). 
A y ray of energy 6.63-40.03 Mev is observed, with an 
intensity of 0.35-40.10 photon per capture (Ki 51a). 
Two resonances for F® production are observed at 
E,=280 and 590 kev, with widths ~20 kev and peak 
cross sections of 1.2 and 1.3 mb, respectively; wI'y~15 
ev [(He 50b) and (Hu 54a) ]. 


VI. F? (n )F¥ E,=6.600 € 


The coherent scattering cross section is 3.8-40.3 b; 
the total cross section (epithermal, bound) is 4.00.1 
b (Hu 52e). The cross section is constant at 3.8 b 
from 600 ev to 20 kev (Hu 54a). Resonances in the 
total cross section are listed in Table I(20). If the 100- 


TABLE I(20). Resonances in F(x )F¥. 


En (kev) T (kev) Smax (bn) Fick 
28.28 3 29.0 6.63 
49.78 5 30.0 6.65 
99.5 16 25.0 6.70 

Lo} 
275» 30 10.0 6.86 
340> ~200 8.0 6.92 
420» 25 8.8 7.00 
510° 35 5.7 7.08 
600° 30 6.0 7.17 
950° 3.3 7.50 
1240° 3.0 7.78 
1660° 3.4 8.18 
2040° 3.2 8.54 


a (Hi 54d). Cross sections i i i 
of a si. Sense eet Areas estimated from published curves 


è (Bo , 
e (Wi 51) and (Hu 54a). 
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Fic. 37. Energy lévels of F”; 
for notation, see Fig. 1. The pro- 
ton binding energy is 10.599 Mev. 


Ne2° Js0* 


kev resonance is assumed to arise from s-wave neutrons 
forming a state of J=1, the reduced width is 0.006 of 
the sum-rule limit [ (Bo 50f) and (Vo 54) ]. The average 
total cross section has been measured for Z,=3 to 14 
Mev; it decreases from 2.2 b at 3 Mev to 1.7 b at 
6 Mev and thereafter remains approximately constant 
at that value (Ne 54b). At Z,=14 Mev, a= 1.70+0.05 
b (Co 52h). At 19.0 Mev, «= 1.84+0.06 b (Da 53f). 


VIL. F?(n 0/)F* E,= 6.600 


Observed resonances in the yield of 0.1- and 0.2-Mev 
y radiation (see F") for Z,<1 Mev are exhibited in 
Table II(20). A satisfactory fit to the excitation func- 


tion for E,<0.35 Mev is obtained on the assumption 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar. 


of an s-wave resonance at E,=0.100 Mev and a p-wave 
resonance at E,=0.270 Mev (J. M. Freeman, private 
communication). The cross section for production of 
1.3-Mev radiation is 0.52+0.18 b at Z,=2.5 Mev 
(Gr 51a). See also (Va 54b) and (Da 53c). 

R D amn 


VII. F!(22n)F2 Q,=—10.408 E,=6.600 


At E,=14.5 Mev, the cross section is 60.620 mb * 
(Pa 53a). See also (Aj 52c). 


IX. F9(1 p)O® Qn =— 3.999 E,= 6.660 Si 


At E,=14.5 Mev, the cross section is 13550 mb 
(Pa 53a). See also (Ho 30b). - 
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TABLE IT (20). Resonances in F” (n n IES, 
(J. M. Gisela private communica ine) 


Er) = §(0.1-Mev E,@) — o8(0.2-Mev 
EA Q (bn) 2 (kev) : (bn) P Fret 
270 S2 300(?) 0.7 6.86 
390(?) 1.5 420 2.0 7.00 
500(?) 1.1 7.08(?) 
780(?) 0.6 780 0.8 7.34 
830 0.8 7.39 
880 0.7 7.44 
950 0.7 950 1.3 7.50 


a Cress sections 440 percent. 


X. F¥(n aN QOn=—1.49 E= 6.600 


Resonances are reported at H,=4.3 and 4.8 Mev by 
(Wi 37e). See also N+’. 


XI. F!(d p)F® - Qm=4.375 

Energy levels of F* derived from the proton groups 
[see, in particular, (Wa 52b) J are tabulated in (Aj 52c). 
At Ea=3.6 Mev, the angular distribution of the ground- 
state protons, analyzed by stripping theory, indicates a 
mixture of 7,=0 and 2 and therefore J=1+* for the 
ground state of F. The 0.65- and 2.05-Mev levels 
(unresolved) appear to be formed by /,=2: J=1, 2, 3*; 
the 3.49- and 3.53-Mev levels (unresolved) by /,=0: 
J=0, 1+. Possible configuration assignments consistent 
with these data and with the 8 decay are discussed 
(Br 53c). See also (B1 53). 


XII. Ne®(da)F® Q,,=2.672 


Alpha-particle groups have been observed at Ey=7.8 
Mev corresponding to the ground state and to a level 
at 0.570.13 Mev (Mi 51d). 


XII. Na®(nc)F2 Qn=—3.898 
See (Je 50d). 


Ne? 
(Fig. 38) 
I. O(a 2)Ne® Om=—12.162 E,=4.746 
See (Te 47). 
Il. O(a p)F# Om=—8.124 E,=4.746 
See F". i 


Til. O(a pn) F# 
See (Te 47). 


Qn=—18.532 E,=4.746 


IV. O'%(aa)0' E,=4.746 


_ The elastic scattering has been studied in the range 
= E.=0. 9 to 4.0 Mev by (Ca 52b) and from Ea=3.9 to 
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Taste II1(20), Levels in Ne” from O'8(@ a)O". 


ater) TAG) -43/3M2/2ua) New Tie 
2.490" 24 0.22 6.738 oF 
3.045 10 0:36 7.182 3- 
3,090 5 0.011 7.218 ò+ 
3.380 10 0.047 7.450 2+ 
3.885 3 0.006 7.854 2+ 
5.6 9.2 (1-) 
6.7 10.1 (1-) 


a The first five values are from (Ca 53b), and H. T. Richards, private 
communication; the last two are from (Fe 40). 


6.9 Mev by (Fe 40). Observed resonances and assign- 
ments are exhibited in Table III(20). 


V. F(p y)Ne* 0O,,=12.870 


Resonances for capture radiation are listed in Table 
IV(20). In all cases the radiation is of about 12-Mev 
energy, indicating transitions to the 1.64-Mev, J=2* 
state of Ne” (Si54). For the &,=669-kev resonance 
this assumption is confirmed by direct measurement of 


the y-ray energy, H,=12.09+0.28 Mev (Ca 51a), and , 


the observation of 1.66-40.02-Mev radiation in coin- 
cidence, to the extent of approximately one quantum 
per capture (Jo 52e). The fraction of capture processes 
at this resonance is 1.72--0.25 percent of the (6+7) 
Mev radiation [from F!°(p ay)O'*] (Ca 51a). The angu- 
lar distribution of the hard radiation is isotropic within 
2 percent (Ha 54e). The y-y angular correlation is 
1—(0.39+-0.08) cos’@, consistent with J=1 for 
the E,=669-kev state [known to be J=1+ from 
F(p ay)O!*], J=1 or 2 for the 1.63-Mev state, and 
J=0 for the ground-state; the high-energy component 
is dipole (Clegg, Jones, and Wilkinson, private com- 
munication). It is not clear why the direct ground-state 
transition does not occur. 

A search for resonant capture radiation at #,=598 
kev (J=2- level) and E,=874 kev (J=27) leads to an 
upper limit of 0.04 percent and 0.05 percent, respec- 
tively for the fraction of such transitions relative to 
F®(p ay)O!8; Py<15 and <2 ev. For the upper level, 
assumed to be T=0, the expected £1 transition is 
inhibited by a factor of at least 30, presumably by the 
isotopic spin selection rule (Wi53g). (Ha 54e) report 
weak resonances at #,=874, 935, 1280, 1355, and 1380 
key [yield relative to F(p ay)O!®~0.1 percent] as well 
as a relatively strong resonance at E,=980 kev, in 
addition to those found by (Si 54). See also (Wi 530). 


TABLE IV(20). Resonances in F!9(p y)Ne*®* (Si 54). 


Ep Ty Yields 3 a 
(kev) (kev) (percent) (mb) Ne2* 
669 7.5 1.8 0.48 13.505 
1092 <1.2 1.3 >0.05 13.907 
1324 4.0 >1.5> 0.081 14.129 
1431 15.7 >1.5> 0.19 14.230 


re a ee nee Ps eee 
a Relative to F?(p ay) O18, 
b No resonance is observed for F(p ay) O1, 


~S 
oe 
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Ne ; 
12154 
Fic. 38. Energy levels of Ne”: for notation, see Fig. 1. v peaa 


VI. (a) F9(p p)F# E,=12.870 


(b) F! (p TAO h 
The elastic scattering exhibits marked anomalies at 
energies corresponding to the F(p ay)O' resonances 
at E,=669, 874, 935, 1355, 1381, to the F'(p y)Ne® 
resonance at 1431 kev and to the F 1(p a) O** resonance 
at Ep=845 kev. Unresolved structure is also evident 


near E,=1700 key. Analysis of the angular distribu- 
tions confirms the assignment [see F?(pa)O'*] of 
J=1+ for the E,=669-kev and 935-kev resonances 
and permits assignments of J=0* for the 845-kev 
resonance, of J=2- (17 not excluded) for E,=1381 


kev, and of J=1* for E,=1431 kev [(WeS4a), 


(We 55), and (Ba 55a): see also (Pe 54b) J. me 


> 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


152 F. 


Resonances for inelastic scattering involving the 
110-kev, J=}, and 197-kev, J=5/2", states of F” are 
listed in Table V(20), In general, the resonances ob- 
served are identical with those reported for other FY+ p 
reactions, although the relative intensities differ greatly. 
The known J=1* levels formed by s-wave protons at 
E,= 669, 935, and 1431 kev lead to the 110-kev state 
of F" in preference to that at 197 kev, while for the 
p-wave, J=2-, 873-kev resonance the reverse is true; 
the J=2-, 1355-, and 1381-kev resonances lead to both. 
The nonresonant yield of the 197-kev radiation appears 
to be mainly due to Coulomb excitation, while that of 
the 110-kev radiation suggests a contribution from 
broad, unresolved s-wave resonances [(Ba 55) and 
(Pe 54b): for angular distributions, see FJ. Cross 
sections [see Table V(20)] and angular distributions of 
the inelastic protons at several resonances are reported 
by (Pe 54d) [see F? and (Ch 54b) ]. 


VII. F8(pa)O% Q,=8.124 E,=12.870 


Five a-particle groups are reported. All show reso- 
nance effects with relative intensities varying greatly 
with bombarding energy. The long-range group (ao) 
leaves O'S in the ground state (J=0*) ; the next longest 
(az) results in the formation of a J=0* nuclear pair- 
emitting state at 6.06 Mev, while the three remaining 
groups (c1,a2,a3) lead to y-ray emitting states at 6.14 
(J=3-), 6.91 (J=2+), and 7.12 Mev (J=1-). Reso- 
nances for ao and œr [Tables VI(20) and VII(20) ] are 
generally identical and different from those for ay, a2, as 
[Table VIII(20)]. The resonances for a and a, are 
required to have even J and even parity or odd J and 
odd parity, while the a1, a2, and as resonances, insofar 
as their assignments are known, are all odd-even or 
even-odd. 

At E,=224 kev, the angular distribution of 6-Mev 
radiation is anisotropic (Ne 54d). The 6-7-Mev radia- 
tion produced at the Z,=340, 483, 669, and 935-kev 


TABLE V(20). Resonances in F!(p p’)F!* (Ba 55). 


(kev) Toa (kev) ay Gey 
669 1+ 7.5 2043 (<0.2) 
780 ~10 02 ~5 
831 8.3 <0.2) AR 
845 or 30 ~2 (<0.3) 
873 2 5.2 ae 8720 
900 4.8 <0.3) ~20 
935 it 8.0 147422 (<1) 

1092 <1.2 >13 >100 
1137 3.7 (<0.4) ~20 
IR) ~86 1743 (<1) 
1290 19 (<1) 31 
1355 Qe 4.5 1744 274-7 
1381 I 15 26-44 40+-6> 
1431 JE 14.6 1894+30: 7424 
1620+10 ~5 1233 (<2) 
1670+10 3 (<2) 
171020 3647 1745 


ee ——————————————— EET 


a Values in parentheses are upper limits of resonance contributions. 
be.d (Pe 54d) report 42.7 +4, 187 +15, and 7+2 for these cross sec- 
tions, from the inelastic proton yields. 
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Taste VI(20). Resonances for ground-state @ particles 
(ao) in F!9(p «)O'.* 


Ep (kev) 


T (kev) o (mb) pce Ne? 

720 35 0.2 13.56 
780 <10 >0.3 13.62 
840 25 0.5 Ose 13.68 
1100 ~60 2 ý 13.92 
1370 ~25 16 14.17 
1367 54 46.5 De 14.17 
1720 142 54.7- oF 14.50 
1865 134 77.3 il 14.64 
2125 <80 6.7 4 14.89 
2325 <80 22.6 Dis 15.08 
2600 ~300 37.3 or 15.34 


a The first five items are from A. L. Schardt and W. A. Fowler (private 
communication); cross sections include only the resonant part. See also 
(Ch 50). The remainder are from (Pa 53), (Pa 53c), and R. L. Clarke, 
private communication. 

b From F9(p p) FY. 


TABLE VII(20). Nuclear pair resonances (a,) in 
F(p a)O!* [(Ch 50), (Ph 51a), and (De 54a)]. 


Ep (kev) Ts (kev) T> (kev) > o® (mb) > Net 
710 35 13.55 
780 13.61 
842 30 24 162 13.67 

1130 60 43 1.0 13.94 

1236 75 58 PEG 14.04 

1367 25 26 8.5 14.17 

(1.75)? 14.53 

(1.9) 14.67 

a (Ch 50). 
b (Ph 51a). 


resonances is isotropic [(De 49d), (Da 51b), (Sa 52), 
and (Sa 53c) ]. At Ep=340, 669, and 935 kev, the ai—y 
correlations establish that J=1+ for the corresponding 
Ne” levels [(Ar 50), (Ba50f), and (Se52b)]: this 
assignment is confirmed for the 935-kev ‘resonance by 
the a;-angular distribution (Pe 54d). Correlations and 
angular distributions at the H,=874-kev resonance 
establish J=27- for this level [ (Se 52b) and (Pe 54d) J, 
and the observed polarization of the y rays is con- 
sistent with this assignment (Fa 53a). The level 
corresponding to E p=598 kev is assigned J=2— by 
(Ch 50a) on the basis of y-ray angular distributions; 
this assignment is confirmed by J. Seed and G. Dearnley; 
see (Wi 53g). Levels corresponding to resonances at 
E,=1290 and 1355 kev are assigned J=3+ by (Ch 50a). 
Angular distributions of the a groups at E,=1290 kev 
are not inconsistent with J=3+, but lead to J=2- for 
the 1355-kev resonance (Pe 54d). Gamma-ray angular 
distributions at E,=1381 kev indicate a pronounced 
anisotropy, and are consistent with J=2- [(Sa 53c); 
see also (Pe 54d) and F(p p)F"9]. This is evidently not 
the same as the 1.38-Mev ap resonance. 

An analysis of certain Ne” levels into partial widths, 
based on information from F(p p), (p £), (py), and 
(p a) is given by (Ba 55) and (Ba 55a). It is noted that a 
number of T=1 levels may be expected in this region 
(compare F”). See also (Aj 52c) and (Be 53b). 
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VIII. F” (pn) Ne” O,=—4.038 £,= 12.870 


Observed resonances are listed in Table [X(20) 
[(Wi 52a), and (BI 51a)]. i 


IX. F(dn)Ne® Qm=10.645 


Neutron ‘groups corresponding to levels at 1.5, 2.2, 
AD oven 71, 7.8, 9.0, and 10.1 Mev are reported 
[(Bo 40b) ” and (Po 42}]. Gamma rays of energy 
9 340.1, 9.9740.1, 10.614-0.1, and 11.51+0.2 Mev 
are reported by Bent, Bonner, and Sippel (private 
communication). See also (Te 51a). Thresholds for 
y radiation are observed at Hy=1.15 and 1.35 Mev, 
corresponding to excited states of Ne” at 11.69 and 
11.87 Mev (Bu 54b). See also (Aj 52c) and (Th 54d). 
At Ea=4 Mev, the angular distribution of ground-state 
neutrons, analyzed by stripping theory, indicates 1,=0 
and therefore the same parity for Ne” and F, J=0 or 1 
for the former (Br 52b). 


X. O(N Om=7.052 

The decay is to the 1.6-Mev state of Ne”: Eg(max) 
5.4060. 017 Mev 
(Al 52a). The energy of the subsequent gamma ray is 
1.627+0.005 (Wo 54b), 1.6310.006 Mev (Al 52a). 
The relative intensity of the ground-state transition is 
<3.2X10-4 (Wo 54b). The Fermi plot is straight to 
E,=1 Mev. With a half-life of 11.4 sec, log/t (excited 
state) =4.99, log ft (ground state) 29 (Wo 54b). 


XI. Ne®(y 2)Ne” Q,,=—16.908 


A giant resonance is observed at 21.5 Mev with a 
width of 6.6 Mev and a peak cross section of 7.3 mb 
(Fe 54). 


TABLE VIII(20). Sara am 6-7 Mev y rays* (aranaz) 
F” yOu 
Nole added in proof —Recent work indicates that the E = 1355 
and 1381-kev resonances are actually located at Zp=1346 and 
1372 kev: see (Ba 55). 


Ep (kev)? r (key)e œ (mb)e œ (mb)> J, x Ne2* 
224.44 1.04 >0.2 12.083 
340.44 2.94 59 160 ite 13.193 
483.14 2.2 29 >32 13.329 
598 37 4 7.1 Gar 13.438 
669 7.5 25 57 T 13.505 
831 8.3 7 19 13.659 
873.5° 5.2 250 540 2 13.700 
900 4.8 9 23 13.725 
935.3 8.0 88 180 1* 13.759 

1092 <1.2> >24 > 13 13.907 
1137 3.7 15 13.950 
1176 ~130 7 19 13.987 
1290 19.2 19 29 (3*) 14.096 
1355 4.5! 79 89 2y 14.157 
1381 15.0 190 300 ae 14.182 


For aioe 6 Mev, see (Wi 52a). 
Ch 50). 


BEC 
© (Bo 48c). 

Hu 5. 
5 cu rie See 53) find 872.5 41.8 kev. 
t (Ba 55) and (Bo 48 
£ (ch Be Ar 50) and (Ba SOf). 
b tch 50a) and (Wi 538)- 
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TABLE ET Resonances in ia nN”. s 


Ep (Mev) T (kev) Nem 
4.29 45 16.95 
4.46 80 17.11 
4.49 20 17.14 
4.57 20 ’ 17.21 
4.62 60 17.26 
4.71 25 17.35 
4.78 45 17.41 
4.99 20 17.61 > 
5.07 30 17.69 
5.20 70 17.81 
4.84 
5.41 18.01 
5.78 18.36 
6.09 18.65 
6.47 19.02 


a The first ten values are from (Wi 52a), the others are from (BI 51a), 
broad resonances, stacked-foil method. 
> 


XII. Ne (ya)O On=—4.746 


The reaction has been observed with (Li’+ p) y rays. 
The cross section for transitions to the 6- and 7-Mev 
states of O" is one or two orders of magnitude greater 
than the cross section for the reaction to the ground 
state of O" [(Er 53a) and C. A. Barnes, private com- 
munication ]. 


XIII. Ne? (p p’)Ne™* 


Inelastic proton groups corresponding to levels at 
1.44 and 4.36 Mev are reported by (Co 52g). A 1.64- 
Mev y ray is observed for E,=2 to + Mev (Co 54). 
No evidence is found for a level at 2.2 Mev [(Co 52g) 
and (Co 54)]. Study of the inelastic scattering at 
E,=2.725 Mev indicates 2<J <7, even parity, for the 
1.63-Mev state of Ne” [(Ga 53) and A. Galonsky, 
private communication]. See also (Ro53d) and 
(En 54). 


XIV. Ne®(d d’/)Ne™* 


At Ea=7.8 Mev, an inelastic deuteron group is ob- 
served corresponding to a state at 1.660.02 Mev 
(Mi 51d). At that energy the cross section is 98 mb 
for the inelastic group; the angular distribution shows a 
pronounced forward peak and a secondary maximum 
near be.m. = 85° (Mi 520). 


XV. Ne? (a a) Ne” 
See (En 54) : Mg”. 


XVI. Na®(6t)Ne® Qm=15.3 wee 
The decay proceeds to excited states of Ne” between 
6.8 and 10.8 Mev which decay by a-particle emission * 


(Al 50g). The half-life is 0.23-0.08 sec (Sh 51c}, 3 


0.385-£0.01 sec (Ho 53a). See also (En 54). a 
x 65 Si 
XVI. Na”(pa) Ne” Qn=2.372 s T 
The weighted mean of several Q-value determination 


is 2.378+0.003 Mev (Va 54). gare” oia “ae ee 


| 


i 
3 
í 
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F 
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+0.02 (St 54e), 1.6294+0.008 Mev (Ne 54e), has been II. F9(dn)Ne® Qm=10.645  2,=17.400 è 
observed as have alphas corresponding to an excited See (Bu 520). 
state of Ne” at 1.6342-0.004 Mev (Do 53). At a reso- 
nance located at Ep=1.255 Mev, the a—y correlation L Fe p» =4.375 E,=17.400 
permits the unique assignment of J=1* to the Mg™ I. F°(d $) Qm=4.37 v d 
state and J=2* to the 1.63-Mev state of Ne” (Se 53b). See (Sn 50a). 
See also (Ca 54b). 
New IV. F8(d ÀF! Qm=— 4.151 L,=17.400 ' { 
e? 
~ See (Kr 41). t 
(Fig. 39) j 
I. O(a n)Ne% QOn=—0.704 V. F9(da)O" Qm=10.040 £,=17.400 ry 
See (Ro 44a) and (Ho 50b). See (Bu 38). 
5 > 
L 49 
AG A fs 
21.551 
Fest 7 
19.862 
o° +He® be 
a 
17400 
13.025 6 
Fp Fic. 39. Energy levels of Ne” 
for notation, see Fig. 1. l 
q 
y 
=è 
1er | 
3.522 Ng. 
2m,c* 
> sf bt 
0.347 PRAVA 0.499 
[E i] Ne®°+t-d 
D Ne” Je, 2t fi 
nas o™*a-n 
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VI. Ne” (n n)Ne® Ey= 6.756 


The total cross section for thermal neutrons is 2.79 
+0.14 b (Ca41); the scattering cross section is 
2.40.2 b (Ha 50f). 


VII. Ne*®(%a)O"” 0,=—0.603 E,=6.756 

An excitation function from £,=1.8 to 3.3 Mev indi- 
cates resonances at E,%=2.12, 2.45, 2.62, 2.72, 2.87, 
and 3.26 Mev [(Jo5le); see (Aj 52c)]. Numerous 
additional resonances, extending to E,=7 Mev, are 
reported by (Gi54b: continuous neutron spectrum), and 
(F1 53: d—d neutrons). See also (Si 50). 


VILL. Ne%(d p)Ne* On=4.530 


Qo=4.5290.007 [ (Va 52); mag. spectrometer]. 
Qo=4.526+0.009 [(Ah 54) ; mag. spectrometer]. 


Levels derived from proton group observations are 
listed in»Table I(21) [(Ah 54) and (Mi 51d)]. A 
study of the angular distributions of the proton groups, 


TABLE I(21). Proton groups from Ne®(d p)Ne*!. 


Ne** (Mev)* Ne=* (Mev)® J, x° o (mb)¢ 
0 0 3/2, 5/2* 13 
0.349-£0.006 0.33-£0.05 3/2, 5/2* 21 
1.68£0.07 1/2, 3/2 14 
2.788-+0.008 2.79-0.05 1/2* 50 
3.73-£0.06 <5/2 21 
4.71-£0.05 5/2 97 
5.44+0.05 <5/2 38 
5.74-£0.05 <5/2 41 
(6.66-0.09) 
7,300.06 <5/2 54 
8.28-£0.06 
8.91-0.06 
9 


a (Ah 54) and K. Ahnlund, private communication; Ea =0.875 Mev. 
b (Mi 51d); Ea =7.8 Mey. | a 
o (Mi 52c); angular distributions analyzed by stripping theory. 


analyzed by stripping theory, gives J, assignments for 
the levels [see Table 1(21) ]; the ratio of the intensities 
of the ground and first excited state groups suggests 
that these states comprise a 2D term, with J=} and 
5/2, respectively. At Ea=8 Mev, the cross section for 
the reaction is 500 mb (Mi 52c). See also (Zu 50a). 


IX. Na (GHN Qn=3.522 

The positron end point is Es(max) = 2.50-£0.03 Mev; 
no y radiation with Z,>0.51 Mev is observed (Sc 52a). 
The half-life is 22.80.5 sec (Sc52a), 22.90.4 sec 
(Ph 53a), 27 sec (Bo 53d). Based on the first value, 


log fi=3.56. See also (En 54). 


X. Na®¥(da)Ne! Qn=6.902 
Alpha particles are observed corresponding to levels 
of Ne” at 0.343-0.010, 1.735=0.016, and 2.852+0.021 


Mev [[(Sp 51a); see also (Fr 51d) ]. 


Ne?!, 
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Nez 
(Fig. 40) 
I. O(a n) Ne Qm=—0.704 E,= 9.658 
See (Ro 44a) and (Ho 50b). 


I. F8(a p) Ne On=1.703 

Proton groups have been observed corresponding to 
levels of Ne® at 1.27 and 3.35 Mev [ (Fa 54d); see also 
(Fo 54) ]. No evidence is found for an earlier reported 
level below 1.28 Mev; [see, however, (Hj 52) ]. Gamma 
radiation from the 1.28-Mev level has been observed 
by (He 54b), (Sh 54e), and (Fo 54) and from levels 
at 3.3 and 4.9(?) Mev by (Fo 54). The radiation from 
the 3.3-Mev level to the 1.28-Mev, J =2*, level is three 
times as probable as the ground-state transition, sug- 
gesting J=1*, 17, or 2* for the 3.35-Mev level (Fo 54). 


Il. Ne (d p)Ne® Om=8.137 


Qo=8.13740.011 [(Mi 52) ; mag. spectrometer ]. 

A proton group corresponding to the first excited 
state has been observed by [(Am 50); E.=1.39 Mev] 
and [ (Zu 50a); Ez= 1.17 Mev]. 

IV. Ne” (y 1)Ne#! 

See (Fe 54). 


Om= — 10.364 


V. Ne*(p p) Ne”* 

A 1.28-Mev y ray is observed for Ep=1.35 to 4.4 
Mev; there is no sign of an earlier reported level at 
0.4 Mev (Co 54). See also (He 47). 


VI. Na2(6)N& Qm=2.841 


The decay proceeds almost entirely to the 1.28-Mev 
the half-life is 2.60 years (La 49a); 


state of Ne”: 


11007 
Na"™% t-a 


2s; 


Fic. 40. Energy levels of ¿for notation, see Fig. 1. 


> 
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Eg*(max) = 5424-5 kev (Ma 50a), 5405 kev (Wr 53); 
the y-ray energy is 1275.040.5 kev (P. Marmicr, 
private communication: magnetic spectrometer). Elec- 
tron capture occurs in 9.92:0.6 percent of the dis- 
integrations [ (Sh 54b) and (Kr 54c) ]. Log/t for the posi- 
trons=7.39 (Fe 51b); for A-+-L capture, logft= 7.27 
(Ma 54h). 

The internal conversion coefficient indicates that the 
transition is probably £2, and hence that the 1.28- 
Mev state has J=2+ (Hi53b). Since Na” has J=3, 
and presumably even parity, the beta transition should 
be allowed. The spectrum follows the allowed shape 
down to 25 kev (Ma 50a) and the B—y correlation is 
isotropic (St 51h). The capture ratio appears to exclude 
L-forbiddenness as an explanation of the large ft value 
(Sh 54b). The direct ground-state transition has been 
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observed; log ft ~13. The shape is consistent with the 
D» spectrum (Wr 53). See also (Ba 54e), (En 54), and 
(Zw 54). 


Ne” 
(Fig. 41) 

I. Ne” (6-)Na” Q,=4.388 

The decay is complex; the ground-state transition 
with Hg(max)=4.21-£0.015 Mev (compare with Qm) 
comprises 93 percent while a 7 percent branch with 
Eg(max) = 1.18-£0.04 Mev proceeds to an excited state 
of Na” [perhaps the state at 3.01 Mev: see (En 54) 
for the Na” level diagram]. The half-life is 40.20.04 
sec; logft=4.94 and 3.78 for the ground state and 
excited state transitions, respectively [(Br 50c); see 


Fic. 41. Energy levels of Ne”: for 
notation, see Fig. 1. 


23 
ee SEP SOS 


Mg + n-a 


E” 


i 
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also (Ki 52) ]. (Using Q,, for the ground-state transition REFERENCES 
energy, one obtains log ft=5.17.) (Closed November 15, 1954) 


1302 AT 23 te (References are arranged and designated by first two letters of 
II. Ne#(n7)Ne On=5.189 first-mentioned author’s name, and year. Otherwise duplicate de- 
The thermal cross section is 36--15 mb (Hu 526). signations are distinguished by postscript letter.) 


Á Ad 50a Adair, Phys. Rev. 79, 1018 (1950) 


| Jez N e23 ae 7 Ad 50b Adamson, Buechner, Preston, Goodman, and Van 
TI. Ne? (d p)Ne® 0,,=2.964 Patter, Phys. Nee 80, m (1950). 

Oo= 2.964-£0.007 [ (Va 52); fe Ad 52 Adair, Phys. Rev. 86, 155 (1952). 

Fd 2 ers 007 [(Va 52); mag. spectrometer]. ‘Ad 52a Adair, Phys. Rev. 87, 1041 (1952). 

Oo= 2.968 0.008 [ (Ah 54a) ; mag. spectrometer]. ‘Ad 53 Adair, Phys. Rev. 92, 1491 (1953). 


o a Ad54 Adair, Phys. Rev. 96, 709 (1954). 
At Ea=0.875 Mev, no proton groups are observed Ad 54a Adair, Darden, and Fields, Phys. Rev. 96, 503 (1954). 
corresponding to excited states of Ne? below 250 kev Ag52 Agnew, Los Alamos Report 1371 (1952). 


ea Z A Ag C 682 rerzoli, N í 
(Ah 54a). (Zu 50a) reports states at 0.98 and 1.75 Mev, MEDS ET say and! Querzoli Nuovo 
while (Mi 51d) report states at 1.79 and 3.0(?) Mev. Ah54 Ahnlund, Arkiv f. Fysik 7, 155 (1954). 


Ah 54a Ahnlund, Arkiv f. Fysik 7, 459 (1954). 
Ah 54b Ahnlund and Mileikowsky, Arkiv f. Fysik 8, 161 


IV. Na¥(u(p)Ne* Qm=— 3.606 (1954). i 
=. Ah 54d Ahnlund, Thulin, and_Pauli,, Arkiv f. Fysik (to be 
At £,=14.5 Mev, the cross section is 34-15 mb published). 
(Hu 53c). Ah 54e Ahnlund, Phys. Rev. 96, 999 (1954). 
: Aj51 Ajzenberg, Phys. Rev. 82, 43 (1951). 
hg n Aj5ib Ajzenberg, Phys. Rev. 693 Witte 
. Mes Ne Sy is Aj52 Ajzenberg, Phys. Rev. 87, 205A (1952). 
V Meso) New Oa oe ‘Aj 52b  Ajzenberg, Phys. Rev. $8, 298 (1952). 
See (Ho 50b). Aj52c Ajzenberg and Lauritsen, Revs. Modern Phys. 24, 321 
(1952). 
P Aj53 Ajzenberg and Buechner, Phys. Rev. 91, 674 (1953). 
Table of Atomic Mass Defects Aj54  Ajzenberg and Franzen, Phys. Rev. 94, 409 (1954). 
= Aj54b Ajzenberg and Franzen, Phys. Rev. 95, 1581 (19549) 
Tachi 5 5 Niele Al47b Allen, Burcham, and Wilkinson, Proc. Roy. Soc. (Lon- 
Nuendo T AEAN EEO TE AES Ea don) 192A, 114 (1947). 
ni 8.3638+0.0029 Ne 21.2 +0.1> Al49a Alvarez, Phys. Rev. 75, 1815 (1949). 
NB 9.179 +0.013% Al49c_ Alvarez, Phys. Rev. 75, 1127 (1949). 
H: 7.5815-0.0027" Nu 6.998 +0,010% Al 50g Alvarez, Phys. Rev. 80, 519 (1950). f 
H? 13.7203-0.006* Nis 4.528 +0.011° Al 52a ahs? Physica 18, 1034 (1952); Phys. Rev. 88, 1257 
H? 15.8271+0.010° nis 10.40 -0.03> 1952). 
Nu 13.0 £0.20 Al52b Allred, Armstrong, Hudson, Potter, Robinson, Rosen, 
He? 15.8086-+0.010* and Stovall, Phys. Rev. 88, 425 (1952). 
He! 3.6066-0.014" ou 12.168 +0.015> A153 Allison, Phys. Rev. 91, 882 (1953). 
Heš 12.92 0.08? fests 7.233 +.0.0128 Al 53a Almqvist, Allen, Dewan, and Pepper, Phys. Rev. 91, 
Hes 19.40 +0.036? o! 0 (standard) a 1022 (1953). ] 
He? 31.5 +25 ou 4.221 +0.006" Al53b Allen and Jentschke, Phys. Rev. 89, 902A (1953). 
s o! 4.522 +0.022° Al54 Alford and Hamilton, Phys. Rev. 95, 1351 (1954). 
Lië 12.99 0.15? O» 2930 0.024? Al54b Allan, Nature 174, 267 (1954). 
Lis 15.850 +0.021* o2 133 42b Al54c Alston, Crewe, Evans, Green, and Willmott, Proc. 
Li? 16.969 +0.024° j F; Phys. Soc. (London) 67A, 657 (1954). 
Lis 23,296 +0.028* pe 159 -E1> Al 54d Ango Pepper, and Lorrain, Can. J. Phys. 32, 621 
i S28 1b E om EE 1954). 
i beak Ee 6.988 EDS Am50 Ambrosen and Bisgaard, Nature 165, 888 (1950). 
Bet 20.8 1» E 6.193 oA $ An51 Anderson and Libby, Phys. Rev. $1, 64 (1951). 
Be? 17.832 +0.0248 z a EO Ar 50 Eaei Na Rev. 79, 170 (1950); Phys. Rev. 80, 34 
3 ; 0.027* : i : 1950). 
Be ‘a ae 0.028" D Ar52 Argo, Adair, Agnew, Hemmendinger, Leland, and 
Bel? 15.560 0.026" Ne’ 10.4 _ 0.2 Taschek, Phys. Rev. 87, 205A (1952). 
Bet oe E Ne 7405 oua Ar 52d Argo, Thek AE E and Leland, 
A Ne” —1. 019° ys. Rev. 87, 612 (1952). 
BS 25.1 0.3? Nen re Tne Ar 52e Artian ae passat Hausman, and McDole, Phys. 
a "076 40.029: New ate .023° Rev. 88, 1291 (1952). 
Bi 15:015 Eai Nez 1.646 +0.023° Ar 54 sgl to tpe Sawyer, Stovall, and Tuck, Phys. 
E 0228 Rev. 93, 483 (1954). 
Be eae sine Na” 14.2 +0.5>_ Ar54a Argo and Hemmendinger, Phys. Rev. 96, 8514 Oe 
Bu 19.0 2» Nee TP EUDE As 51 EE ae Raffle, Proc. Phys. Soc. (London) 64. s 
; Na 312 +£0.025° 54 (1951). 
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c ae | Bas0g Baldinger, Huber, Ricamo, and Zuati, Helv. Phys. 


Phys. Rev. 83, 512 (1951) Acta 23, ae eal oP n ark BES 
alj 4 Jer, and Lauritsen, Phys. Rev. 83, = y i a Moori Be De ae Ts 
b Din Whalin ys a ia the present article and in (Aj 52c). Ba Sia Bra I Mooring, tree, Phys. 
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New Analysis of the Interferometer Observations 


all epochs during a year. 


of Dayton C. Miller 


R. S. SHANKLAND, S. W. McCusxev, F. C. Leone, AND G. KUERTI 
Case Institute of Technology, Cleveland, Ohio 


R For nearly thirty years the results of the Michelson-Morley experiment obtained by Dayton C. Miller on 
` Mount Wilsonshave stood at variance with all other trials of this experiment. As interest in Miller’s results 
has continued to the present time, and since the original data sheets are available to the present writers, it has 
seemed appropriate that the observations be subjected to a new analysis. It is now shown that the small 
periodic fringe displacements found by Miller are due in part to statistical fluctuations in the readings of the 
fringe positions in a very difficult experiment. The remaining systematic effects are ascribed to local tempera- 

ture conditions. These were much more troublesome at Mount Wilson than those encountered by experi- 
menters elsewhere, including Miller himself in his work done at Case in Cleveland. As interpreted in the 
present study, Miller’s extensive Mount Wilson data contain no effect of the kind predicted by the aether 
theory and, within the limitations imposed by local disturbances, are entirely consistent with a null result at 


INTRODUCTION 


HE null result obtained in the Michelson-Morley 
experiment at Cleveland in 1887! was the culmi- 
nation of the long nineteenth-century search for an 
aether and proved to be a major incentive for the 
theoretical developments made by Lorentz, FitzGerald, 
Larmor, Poincaré, and especially Einstein. The crucial 
significance of the Michelson-Morley result stimulated 
many repetitions of this experiment during the next 
fifty years, especially as the implications of the theory of 
relativity unfolded. All trials of this experiment except 
those carried out at Mount Wilson by Dayton C. Miller 
yielded a null result within the accuracy of the observa- 
tions. Miller’s observed fringe displacements also were 
very small, being on the average only about 1/13 of 
those predicted by the aether theory for the 30 km/sec 
velocity of the earth in its orbit. However, as these small 
residuals have received no adequate explanation, and 
since interest in them has continued, it has seemed 
desirable to make a rather detailed analysis of the 
observational material. 
Michelson and Morley had originally planned to re- 
peat their experiment at intervals of about three months 
throughout a year and thus to allow for any possibility 


that the earth’s orbital velocity at one epoch might 
fortuitously combine with the motion of the entire solar 
system through space and give a small resultant velocity 
to their apparatus. However, after completing the July, 
1887 observations, they did not return to this problem, 
and the completion of the aether-drift experiment for all 
epochs was finally carried out by Miller during the years 
from 1921 through 1926 in Cleveland and at Mount 
Wilson. Miller’s Mount Wilson experiments’ were con- 
ducted at intervals from April, 1921, through February, 
1926, being interspersed with and followed by extensive 
trials at Cleveland. 

In February, 1921, the interferometer was set up at 
Mount Wilson, and Miller made numerous observations 
during the period April 8-21, 1921. These data indicated 
a possible small periodic effect, with an average second- 
harmonic amplitude of about 0.04 fringe, but Miller 
suspected that temperature effects or magnetostriction 
in the steel base of the interferometer as it rotated in the 
earth’s magnetic field might be the cause. Accordingly 
he had designed a concrete base for the interferometer, 
which was cast at Mount Wilson in October of 1921. 
Undoubtedly the greatest incentive to continue the 
experiments came from Professor Albert Einstein who 
visited Miller at Case on May 25, 1921, and urgéd that 

3D. C. Miller: (a) Proc, Natl. Acad. Sci. 11, 306 (1925); 


Sci 61, 617 (1925); (c) 63, 433 (1926); (d) Astr es 
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168 SHANKLAND, McCUSKEY, LEONE, AND KUERTI 
Taste I. Trials of the Michelson-Morley experiment. 

Observer Year Place D 2D/d(2/c)? A Ratio 
Michelson® Tssi Potsdam 120 cm 0.04 fringe 0.01 fringe 2 
Michelson and Morley” 1887 Cleveland 1100 0.40 0.005 40 
Morley and Miller® 1902-04 Cleveland 3220 1,13 0.0073 80 
Miller? 1921 Mt. Wilson 3200 1.12 0.04 7 15 
Millere 1923-24 Cleveland 3200 det 0.015 40 
Miller (sunlight)! 1924 Cleveland 3200 112 0.007 80 
Tomaschek (starlight): 1924 Heidelberg 860 0.3 0.01 15 
Miller® 1925-26 Mt, Wilson 3200 1.12 0.044 13 
Kennedy! 1926 Pasadena and 200 0.07 0.001 35 

Mt. Wilson 

Illingworthi 1927 Pasadena 200 0.07 0.0002 175 
Piccard and Stahel* 1927 Mt. Rigi 280 0.13 0.003 20 
Michelson, ef al.! 1929 Mt. Wilson 2590 0.9 0.005 90 
Joos™ 1930 Jena 2100 0.75 0.001 375 

a A. A. Michelson, Am. J. Sci. 22, 120 (1881); Phil. Mag. 13, 236 (1882). 

t A. A. Michelson and E. W. Morley, Am. J. Sci. 34, 333 (1887); Phil. Mag. 24, 449 (1887). 

eB. W. Morley and D, C. Miller, Phil. Mag. 9, 680 (1905); Proc. Am. Acad. Arts Sci. 41, 321 (1905). 

a D. C. Miller, Data sheets of Observations December 9 to 11, 1921 (unpublished). 
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i further trials be made to remove any possible doubts he concluded that magnetostriction in a steel or Invar 


| concerning the earlier results obtained in this experiment. 

Miller conducted a series of observations with the 
concrete base interferometer from December 4-11, 1921, 
but with rather discouraging results. Much of the time 
the interferometer behaved poorly, the fringes were 
often unsteady, temperature variations near the instru- 
ment were troublesome, and vibrations caused by high 
winds often made observation entirely impossible. How- 


i ever, at last on December 9-11, 1921, thirteen sets of 
i readings comprising 153 complete turns of the inter- 
ferometer were made under favorable conditions, some 


with “excellent seeing.” These data gave an average 
periodic amplitude of only 0.04 fringe, nearly the same 
as the April, 1921 results with the steel base inter- 


i ferometer, and Miller concluded that “all effects are 
yi probably due to the instrument. This is the end!’” So, 
| shortly thereafter he dismounted his apparatus for its 
i return to Cleveland where it was reassembled in a base- 


ment room of the Case Physics Laboratory. 

Since the 1921 experiments with both the steel and 
| the concrete bases for the interferometer gave essentially 
the same results, Miller concluded that magnetostriction 
i was not the cause of the small periodic effects observed. 
3 However, since the mechanical rigidity of the concrete 
{ base instrument was less than that for the steel, a 
| conclusive argument could not be made. The magnitude 
; of the magnetostriction effect is marginal. Nevertheless 
| it cannot be ignored, since it would produce a second 
t harmonic in the fringe displacements. Professor G. Joos, 
| in his repetition of these experiments in 1930 at Jena, 
p - used a fused quartz base for his interferometer because 


4 D. C. Miller, Research Notebook, December 11, 1921. 
5 G. Joos, Ann. phys. 7, 385 (1930) and personal communica- 
tion in a letter of May, 1954. 


z 


base would be excessive for the precision he expected to 
attain. 

After the 1921 trialsat Mount Wilson, Miller probably 
would have abandoned further work on the problem 
except for a visit to Case made by H. A. Lorentz during 
the following spring. During his visit he discussed the 
Mount Wilson observations and encouraged Miller to 
continue the experiments. 

From the time the steel base interferometer was set up 
again at Case in 1922 until it was returned to Mount 
Wilson near the end of the summer of 1924, Miller 
carried out many experiments designed to test and im- 
prove the performance of the apparatus. He established 
the order of magnitude of the fringe shifts produced by 
unequal heating of the air in the optical paths of the 
interferometer, by varying the speed and reversing the 
direction of rotation of the instrument, and by various 
adjustments of the centering pin. He also used different 
light sources, including sunlight. 

In many of the Cleveland trials of 1923 and 1924, 
especially those for which the data sheets record the 
best “seeing” of the interference fringes, the periodic 
effects are very small. These data were not analyzed in 
detail by Miller as he considered them only preliminary 
to the later work at Mount Wilson. However, 19 sets of 
observations made from August 23 to September 4, 
1923, and 42 sets of data taken from June 27 to July 26, 
1924, including those made with sunlight, for which the 
interferometer was in the best adjustment and the 
fringes were often noted as “good” or “excellent,” 
constitute some of the best data obtained with Miller’s 
interferometer. These 1923-1924 Cleveland results are 
included in Table I which summarizes the important 

trials of the Michelson-Morley experiment. 
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After completion of the July, 1924 trials, the appa- 
ratus was returned to Mount Wilson where it was again 
set up in the observation hut, which was moved back 
some distance from the canyon edge to reduce the effects 
of wind. After a considerable number of preliminary 
trials, extensive series of observations were made by 
Miller at four epochs: March 27-April 10, 1925; July 
24-August 8, 1925; September 10-23, 1925: and Febru- 
ary 3-12, 1926. > 

The readings of fringe position were taken at 16 
azimuth positions starting from the north and measured 
clockwise. A data sheet usually contained readings for 
20 turns of the interferometer, although some contained 
a greater or lesser amount of data. For his analyses 
Miller averaged the 20 readings at each azimuth and 
then applied a linear correction for fringe pattern drift 
so that the averages closed as a periodic function in the 
360° rotation. Furthermore, a constant was then sub- 
tracted from these averages so that their mean was zero. 
These adjysted means were plotted in azimuth and con- 
nected by straight lines to give a graph for harmonic 
analysis with the Henrici machine. A sample data sheet 
which presents details of his reduction method is 
published in Miller’s paper in reference 2. 

Miller’s harmonic analyses of these curves yielded 
amplitudes and phases for the first five harmonics. The 
second harmonic provides the parameters needed for 
computing the presumed aether drift. The phases ob- 
tained from these analyses were never capable of being 
fitted into a logical relationship corresponding to an 
oscillation about the north point during the course of a 
sidereal day. As reported by Miller,? the axes were dis- 
placed from the meridian as follows: “for February 10° 
to the west of north; for April the displacement is 40° 
east; for August 10° east; and for September 55° east.” 
This azimuth anomaly has been the greatest obstacle to 
the acceptance of the small periodic amplitudes re- 
ported by Miller as having relevance to an aether-drift 
effect. Synge and Gardner® developed a theory of the 
Michelson-Morley experiment, including the effects of 


acceleration, which was not inconsistent with the oc- 


currence of azimuth anomalies, such as found by Miller, 
but further predictions of this theory have been dis- 
proved in experiments by Ditchburn." 


STATISTICAL ANALYSIS OF MILLER’S DATA 


The data collected by Miller have recently been re- 
examined with a view to establishing the relative im- 
portance of statistical fluctuations and physical causes 
in the small periodic effects which he obtained. 

Consider first the individual data sheets. It is gener- 
ally agreed that an experienced observer with keen 
eyesight, such as Miller had, can estimate the fringe 
Position to 0.05 fringe, which may be regarded as the 
least count of the instrument. This personal factor will 


6 J. L. Synge and G. H. F. Gardner, Nature 170, 243 (1952); 


in Soc. 26, 45 (1952). lye Be 
ar Roa Da S. Heavens, Nature 170, 705 (1952). 


introduce variations in the data in addition to those 
arising from other causes. Prominent among the latter 
possibilities are mechanical vibrations and bending as 
the interferometer rotates, temperature disturbances 
producing both a drift of the fringes and periodic 
changes, and magnetostriction which also would cause a 
periodic effect, 

A typical data sheet contains entries in 16 columns 
for the azimuth points of the instrument, and 20 rows, 
one for each complete turn of the interferometer. Under 
ideal conditions with no aether drift nor other systematic 
or random effects present, all entries in a table should be 
identical. In a few sets of data taken at Cleveland in 
1924, this is nearly the case. Usually, however, the 
readings show considerable variation, having the general 
character of random fluctuations superimposed on an 
irregular drift. A standard analysis of variance technique 
may be used to examine the degree of randomness in the 
entries on a data sheet. 

Let x be a reading of fringe position and let Z;(7=1, 2, 
- - +16) be the average of the ith column of data (azimuth 
position on the interferometer); furthermore, let z be 
the average of all 320 entries, xij, on a data sheet, where 
the xj, and hence the averages, have already been freed 
of the linear drift of the whole fringe system in the same 
manner as Miller did it. Suppose that these 320 values 
be considered as random sample values drawn from a 
normal population. Furthermore, suppose the arrange- 
ment into columns and rows is a random one. Then 
statistical theory shows that the variance of + may be 
resolved into a contribution due to variation of the 
individual entries in the columns about the column 
means and a contribution due to differences between 
column means and the mean of the whole assembly. A 
measure of the relative significance of these is the 
statistic F defined by 


16 
Eaz) 
320x19 5! 
j BS 


15 16 20 
LX L o a 

i=l j=l 
Under the hypotheses of randomness and normality 
of population, the probability that F for a given sample 
will exceed a given value has been tabulated. For the 
number of degrees of freedom involved in the present 
data, theory shows that the probability of obtaining by 
pure sampling fluctuation an F>1.71 is only 0.05; the 
probability of obtaining an F> 2.21 is only 0.01. Tiesė 
probabilities are accepted limits for rejecting the 
hypothesis that the array could have arisen by sampling 
fluctuations in a normally distributed population. When 
a large number of data sheets are analyzed, one would 
expect only one out of twenty to exhibit an F>1.71 if 
the population of which the sheets are samples consists 
only of randomly fluctuating data, If many of the data 
sheets lead to an F-value gréater than the limits quoted, 

3 ae 
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Fic. 1. The distribution of F-values for 216 sets of Mount Wilson 
data. The smooth theoretical curve is normalized so that the area 
under this curve is equal to the area of the histogram. 


it is highly probable that some systematic effects are 
present in the data. 

Figure 1 shows the frequency distribution of the F- 
values computed from the data sheets together with a 
theoretical distribution of F appropriate to the number 
of degrees of freedom involved. It is apparent immedi- 
ately that the observed number of large F-values far 
exceeds the number to be expected on the basis of a 
random sampling from a normal population. In fact, 
36.5 percent of the F-values exceed the critical value 
1.71, and 25.3 percent exceed the value 2.21. The 
analysis indicates that the fluctuations in the column 
means cannot be attributed entirely to random effects, 
but that systematic effects are present to an appreciable 
degree. 

A second method also shows that the periodic effects 
observed by Miller cannot be accounted for entirely by 
random statistical fluctuations in the basic data. Five 
representative sheets with original unprocessed data 
were subjected to an autocorrelation analysis. These 
sheets were selected as typical of the Mount Wilson 
data. Miller had deduced for them second-harmonic 
amplitudes of 0.021, 0.045, 0.059, 0.082, and 0.123 
fringe respectively. Thus they embrace nearly the entire 
range of the second-harmonic effect which is under 
examination. 

In three sets of data (Nos. 42, 75, and 79 in Table TI) 
a strong period of 8’ (second harmonic) was found rela- 
tive to those of periods 4 through 12. However, in the 


8 We are indebted to R. L. Stearns, Dr. E. F. Shrader, and Dr. 
L. L. Foldy for making the autocorrelation analysis; the MS thesis 
of R. L. Stearns, Case Institute of Technology, 1952, gives details 
of the analysis and description of machine used. 
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other two sets of data analyzed by this method, the 
period of 8 was no more prominent than the other 
periodic components. This indicates that the true mag- 
nitude of the systematic effect varies greatly with the 
conditions of observation, as would be the case if caused 
by local disturbances. A summary of the pertinent re- 
sults regarding the period of 8 is given in Table II. The 
units in the table are fringes. 

Column 2 gives the amplituáe of the second harmonic 
as obtained from the Henrici machine by harmonic 
analysis; column 3 gives the amplitude deduced from 
the correlogram and contains whatever random fluctua- 
tions are in the data; column 4 gives the amplitude with 
the random effects removed. It is again apparent that 
random statistical processes contribute considerably to 
the periodic effect when it is small but that the larger 
amplitudes are relatively unaffected and cannot be 
explained in this manner. 

Consider next the actual harmonic analyses of the 
Mount Wilson data. Miller’s harmonic analysis records 
give amplitude distributions for the first five harmonics 
for which the relevant statistical parameters are sum- 
marized in Table III. The values of the mean amplitude 
Ā,(k=1, 2, ---5) and the standard deviation s(A;) have 
been computed directly from his data cards. Altogether, 
for the four seasons, 306 sets of data were used. There is 
no significant seasonal variation in the mean amplitude 
of the second harmonic, the values of A» being 0.042, 
0.049, 0.038, and 0.045 fringes for April, 1925, July, 
1925, September, 1925, and February, 1926 epochs 
respectively. 

Table ITI indicates clearly the dominance of the first 
and second harmonics over the others in the harmonic 
analyses. While the third, fourth, and fifth harmonics 
have an order of magnitude equal to the standard 
deviations, the first and second harmonics are con- 
siderably larger. However, the average Ag here is only 
1/13 of the value to be expected on the basis of the usual 
aether-drift hypothesis. It is interesting to note that 
among the trials which yielded unusually large ampli- 
tudes of the second harmonic (A2>0.08 fringe), 14 out 
of 17 were made either at the beginning or the end of a 
series of observations, or were made under adverse 
conditions noted on the data sheets. 

The question now is, “What part of the observed 
average amplitude for the second harmonic, deduced by 
harmonic analysis, may reasonably be attributed to 
random statistical fluctuations in the data?” Let sets of 


Taste II. Summary of autocorrelation analysis. 


Miller's Correlogram 
observed Uncorrected Corrected for 
Sheet Az Az random effects 
15 0.021 0.032 0.013 
23 0.045 0.054 0.043 
79 0.059 0.062 0.060 
75 0.082 0.079 0.078 
42 0.123 0.129 0.126 
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observations be selected at random from a population 
which is normally distributed and be plotted as ordinates 
with the order numbers of drawing as abscissae. Suppose 
a harmonic analysis is made of each set, Then the 


probability that an amplitude of a Ath harmonic, A+, 
chosen at random among the sets, will lie between A and 


A+dA is ` 
P(A)dA =n}? exp(—h?nA 2/2) AdA, 


where 7 is the number of observations analyzed and h is 
the measure of precision of the population. From this, 
the mean amplitude A+, often designated M(A,), and 
the standard deviation o(A,) are found to be, re- 
spectively, 


os m \3 4—\ 1 
A,.= ( ) and «o(A,)= (=) P 
2Qnh? 2nh? 


independent of k. 

A tabulķtion of the values of @; irrespective of the 
azimuth i from which Miller made his harmonic analyses 
indicates an approximately normal population with a 
standard deviation .0.05 fringe. Thus in this extreme 
case, if all of the variation in the data used for analysis 
were attributable to random fluctuations, one would 
expect 


A,=0.023 fringe and o(A,)=0.012 fringe. 


On the usual assumption that the squares of the random 
and systematic contributions to amplitude add to pro- 
duce the observed (A2)?, it appears that not more than 
15 percent of the second-harmonic amplitude can be due 
to statistical causes. This is in accord with the result of 
the autocorrelation analysis and leaves an average 
residual amplitude of 42=0.038 fringe to be explained 
in other ways. 

Thus there can be little doubt that statistical fluctua- 
tions alone cannot account for the periodic fringe shifts 
observed by Miller. On the other hand, the presence of a 
periodic effect in the column means z; may be clearly 
demonstrated in a way which again exposes the incom- 
patibility between the phase anomalies already noted 
and the usual kinematic accounting in aether-drift 
experiments. 

When the 2; for twenty sheets of the July-August, 
1925 Mount Wilson data are plotted as a function of 
azimuth index 1, the result is as shown in Fig. 2. These 
sheets have been chosen so as to span the 24 hours of the 
sidereal day. There is obviously considerable scatter in 


TABLE IIT. Summary of harmonic analyses—Mount Wilson data. 


Mean amptitude Standard deviation 


nen Ax (fringes) s(Ax) (fringes) 
0.033 
2 004i 0.022 
: 3 o 0.011 0.008 
4 0.007 R o 
5 0.005 i 
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Fic. 2. The individual column means Z; are plotted as a function 
of azimuth pesition for the July, 1925 observational data. Large 
circles and the connecting curve show the second harmonic effect 
exhibited by the averages, (Z,;), due to ordering in azimuth. Units 
for the ordinate are fringes. 


the Z; at each azimuth position, but the average values 
(ži), calculated for each index, show a marked second 
harmonic effect. The large filled circles represent the (Zi. 
Similar results have been found for the other three 
epochs of the Mount Wilson observations. 

Now let the values of (#;) for i=9, 10---16 be aver- 
aged with those for 7=1, 2, 3---8. When this is done and 
a smooth curve is drawn through the points, the result 
is the curve labelled “July” in Fig. 3. Similar curves for 
the other epochs are also shown in the figure. Twenty 
data sheets, representative of all sidereal times, were 
used for each sample. 

Note that all of the resulting periodic functions have 
amplitudes between 0.02 and 0.03 fringe, but that they 
differ in phase. Particularly striking is the difference be- 
tween the February, 1926 result and the others. 

It is, on the other hand, possible to calculate in a 
rather straightforward way the fringe shift pattern, 
averaged over all sidereal times, to be expected on the 
assumption of an absolute motion of the earth through 
an aether on the basis of the usual kinematic considera- 
tions. Let V be the vector velocity of the earth at a 
given epoch toward an apex whose celestial coordinates 
are a, 6; let V, be the projection of V in the plane of the 
interferometer, ¢ be the latitude of the observer, and y 
be the azimuth of the telescope arm of the interferome- 
ter measured from the north point through the east. 
Then calculation shows that, the value (Az). of the 
fringe shift, averaged over all sidereal times, is given by 


(An) w= V2F (6,¢) cos2y. (1) 


Since ¢ is constant and ô is fix 1 at a specified epoch, 
an average fringe shift of secon. harmonic nature re- 
sults. Clearly, the angle y equals 27(t—1)/16 and the 
computed (Am)x corresponds to the observational 
3{(€:)+(€4s)]. Thus there is a direct correspondence 
between the curves of Fig. 3 and Eq. (1), and no phase 
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Fic. 3. Second harmonics in the $[(#,)+-(€i4s)] for the four epochs 
of the Mount Wilson data. 


anomalies of the kind observed in Fig. 3 are permissible. 
The four curves should have a common maximum (or 
minimum) at i=1; only the amplitude may be different 
at different epochs. 

This criticism is, of course, only different in form from 
the objections raised by v. Laue? and Thirring. Both 
authors point out that the Lorentz criterion, which 
requires the mean value of V, over a sidereal day to bea 
vector in the meridian, is not satisfied by the results of 
the Mount Wilson observations of 1921 and 1925, and 
Thirring shows for a single daily pattern of these 
observations that it cannot be made to coincide with 
any one of the daily patterns derivable from the usual 
kinematic theory upon variation of 6. The present form 
of the criterion, if it may be so designated, is perhaps 
more convenient when a large amount of data is to be 
checked for consistency. 

In the interpretation of his observations, Miller 
hypothesized that, for unknown reasons, the daily swing 
of the vector V, might be symmetric about some line 
other than the meridian, the angular deviation from the 
meridian being permitted to vary from epoch to epoch. 
In terms of Fig. 3, this is simply the deviation of the first 
maximum from the north point, but the angles in our 
figure differ considerably from those found by Miller.* 
Actually, Miller drew the curves A vs 0 (A=azimuth of 
V p, max ; 9= sidereal time) for each of the four epochs and 
determined his anomalies from what he called the “own 
axis” of the plot. The meaning of that term is obscure. 
Contrary to expectation, it does not designate the hori- 
zontal axis through the centroid of the observational 
curve, since the ratio of the upper to the lower area is 
found to be 1.3, 1.7, 1.4, and 0.6 respectively for the four 
epochs. 

Under these circumstances, little significance can be 
attached to the remarkable agreement between the 
yalues of a and ô, computed from the V, vs 0 curves on 
the one hand and from the A vs @ curves on the other 
hand, as presented in Tables I and II in reference 2, p. 
230. The second method of computing uses the 0 values 
at which A passes “its own axis” and the quantity 
Amax,-both taken from the curves. Both parameters 


9M, v. Laue, Handbuch d. Experimentalphysik (1926), Vol. 
XVIII, pp. 95, 101. Pae 

wH. Thirring, Z. Physik 35, 723 (1926); Nature 118, 81 
(1926). o 
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depend, of course, on the choice of the “own axis,” but 
the first of them is extremely sensitive to that choice and, 
furthermore, is directly equal to the right ascension of 
the apex, a. Hence, even if one were to accept the 
azimuth anomalies as an unknown effect to be explained 
by a more refined theory, any numerical results based on 
the A vs @ curves should be accepted with reservation. 

Whether a reliable determination of the other pair of 
parameters, Vp, max/ V p, min and OY», min, from the Vp vs 0 
curves is at all possible has not been decided. The 
method of averaging small groups of V -values (each V, 
corresponding to one set of twenty turns) in order to 
obtain twenty V ,-averages roughly equidistant in 0, and 
the use of these twenty points as true representatives of 
the curve sought cannot be considered statistically 
satisfactory because of the enormous scatter of the 
observational results. However, a more refined rede- 
termination of these parameters that enter the first 
method of computation did not seem justified to us for 
the following reasons. A a 

If the azimuth anomalies are accepted, then Eq. (1) 
establishes the simple connection, 


(An) x, max = VF (ô, po) (2) 


between the maximum of the fringe shift average ob- 
served during the epoch and the function F (ô, p0), whose 
derivative with respect to ô can be shown to vanish for 
5=90°. Now, (Az) sv, max changes only slightly from epoch 
to epoch (cf. Fig. 3), which makes it feasible to satisfy 
(2) approximately by the same constant vector V for all 
epochs. However, the vector V must contain the variable 
contribution of the orbital motion of the earth. This 
difficulty may be overcome by choosing a cosmic Vo such 
that Vo>>Vorital and ôo is not far from 90°; but a further 
obstacle arises here, namely, at ô= 90°, Fô, po) becomes 
simply a? cos?go, where a? is a constant completely de- 
termined by the geometry and. wavelength of the 
interferometer. Thus, the magnitude of Vo+tVorbita 
cannot be chosen freely. A reduction factor becomes 
necessary so as to make (An)an mx= k V?F (ô, po) where k 
is about 1/20 in Miller’s cosmic solution. 

It seems to us on the basis of this discussion that the 
internal consistency of the cosmic solution is not so 
great a surprise as it appears at first glance. It certainly 
is not cogent enough to serve as a logical support of the 
claim that the half-period effect observed is a true 
aether-drift effect. We therefore did not embark on a 
statistically sound recomputation of the cosmic solution, 
but rather concentrated our efforts on an interpretation 
of Miller’s observations in terms of systematic local 
disturbances such as may be caused by mechanical 
effects or by nonuniform temperature distributions in 
the observation hut. 


POSSIBLE MECHANICAL EFFECTS 


The question may be raised whether some oi the local 
causes which are responsible for Miller’s results can be 
found in the mechanical performance of his apparatus: 


tb 
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Miller’s papers and notebooks do not offer much perti- 
nent material but it has been possible to extract some 
information from his report,? from the originals of the 
photographs published there, and from interviews with 
former co-workers. These data make it possible to 
investigate the motion and deformation of the steel 
cross carrying the optics of the interferometer. 

The cross consists of four arms bolted together, each 
arm being a box structure held together by rivets. The 
central portion of the cross rests squarely on a wooden 
support which floats on mercury. 

Consider the beams B, and B», each formed by one 
pair of opposite arms. If the supporting float is hori- 
zontal, the ends of the beams will sag under the influence 
of gravity somewhat below the level of the supported 
central portion. Now let the cross turn about the beam 
B» through a small angle a;. This will cause the beam B, 
to unbend slightly, since only the component of gravity 
normal to the beam contributes to the bending moment. 
The mirrbrs rigidly cornected to the ends of B, will 
therefore suffer small additional rotations in opposite 
directions about axes parallel to Bs. A very small change 
in the length of the light path will be the consequence of 
this change in the relative position of the mirrors on By. 

In reference 2, p. 215, Miller supplies direct evidence 
of the effect of bending, when he states that an end load 
of 282 g placed on one end of the four arms produces a 
shift of one fringe.!! We may reasonably assume that 
half this load, 6=141 g, placed simultaneously on two 
opposite arms would have produced the same effect; a 
simple argument now gives the fringe shift due to an 
angular rotation a; about Bz in units of one fringe as 


n= (1—cosa1)q/6, (3) 


where q depends on the weight of the beam and on its 
distribution, and the analogous formula holds for a 
rotation œ about Bı. We obtained! for q the value 
48X10 g. 

From Eq. (3) we can find the angle a; necessary to 
produce a total fringe shift nı=2X0.044, which is the 
mean of the observations on Mount Wilson. The result 
is a= 1.3°, which seems too large to permit an interpre- 
tation of the observed fringe shifts in terms of a wobbling 
motion of the slowly rotating cross (one turn in 50 sec). 
But smaller libratory irregularities, perhaps of the order 
of a few tenths of a degree, must undoubtedly have been 
present in Miller’s experiments. In order to find out 
about their possible contributions to the results, the 
dynamic constants of the apparatus in rotation were 
determined, and a simple analysis of its motion was 
made. 


STNG aga is not strictly a constant; according to a check on 
February 4, 1926, it is 313 g. apse : 

12 Tt will be noticed that the quantity ô is determined when the 
structural properties of the cross, the geometry of the light path, 
and the wavelength are known. The calculated ô came out about 
Miller, but an overestimation of Ee 
stiffn vas to be expected, since no correction for incomplete 
aby os REE and ate was made. We wish to thank E. Krasnoft 


for his assistance in these computations. 


Assuming the apparatus strictly symmetrical about 
the vertical, one obtains for the moments of inertia C 
and A about the vertical symmetry axis and ahout a 
horizontal axis through the tenter of mass, respectively, 


C=1.67%10" g-em?, A=9,05% 10" g-cm?. 


The center of mass is 48.5 cm above the centroid of the 
displaced mercury, and the metacenter is 210.5 cm 
above the center of mass. The restoring moment 
produced by a small angle # between the symmetry axis 
of the apparatus and the vertical is 9D, where 


D=2.48% 10" dyne-cm. 


The period of roll about a horizontal axis through the 
center of mass is therefore only 1.20 sec. 

The motion of the system may be considered as that 
of a (hanging) top. The restoring couple is 6D. Depend- 
ing on the fixity of the centering pin (which was not the 
same in all experiments), the fixed point may be 
identified either with the center of mass itself or with the 
bearing of the pin, which is about 51.5 em below the 
center of mass. The equatorial moment of inertia is in 
this latter case A= 1.2210" g-cm?; C and D are, of 
course, unchanged. 

Let us consider the motion in the neighborhood of 
steady rotation about the vertical axis of symmetry. 
The values of the constants A, C, D, together with the 
very small spin r= 27/50 sec~, are such as to preclude 
regular precession, whichever fixed point is chosen. The 
motion is of the “rosette-type,” the instantaneous 
position of the symmetry axis being given by” 


8=nsinayl, w:=[(C3?+4AD)/4A*)}, 
p= unl, w= (C/2A)r. 


Here 9 and y are the polar coordinates of the axis of 
symmetry, and 7 is an amplitude constant characteristic 
of the asymmetry of the initial impulse. 

It is now possible to find an approximate expression 
for the fringe shift due to angles a and a, subtended 
simultaneously by B, and Bz with the horizontal plane, 
when the symmetry axis moves according to Eq. (4). 
One obtains 


An=n— n= (pW /26)7? sin’ [w] cos[2(r—aw:)t]. (5) 


(4) 


The circular frequency w: is very close to (D/A)? and 
corresponds to the roll period mentioned before. Thus 
An oscillates with twice the roll frequency, the ampli- 
tude being modulated with the circular frequency 
2(r—w2). The value of w is not very sensitive tava 
change of the fixed point; the roll period increases from 
1.20 sec to 1.40 sec if the fixed point is assumed at the 
pin bearing. The period of modulation, however, de- 
creases from 39 sec to 10 sec if, instead of the center of 
mass, the pin bearing is assumed fixed. $ A 

Obviously, these rapid oscillations cannot account for 


3 F, Klein and A. Sommerfel@?, Ueber die Theorie des Kreisels 
(Leipzig, 1898), p. 331. ° 
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the true second-harmonic period (25 sec) which strongly 
dominates such runs as shown by sheets 79, 75, and 42 in 
Table II. Beyond that, one may in general expect the 
contributions of these oscillations to cancel each other in 
the average throughout a set of 20 runs. In order to 
estimate the efficiency of the averaging process, assign 
to the amplitude constant 7 the (improbably large) 
value of 0.5°; compute a table of 20 rows according to 
Eq. (5) by setting ‘= 504/16, where k= 1, 2, - - -320; and 
determine the column averages (Am)q,;. It turns out 
that the maximum value of |(Az)x,,:| stays well below 
0.005 fringe. 

These conclusions would not be strongly affected by a 
slight asymmetry of the apparatus about its vertical 
axis as must undoubtedly have been present. 


EFFECTS OF TEMPERATURE 


Miller’s 1923 laboratory tests of the effects of thermal 
variations on the interference fringes have been studied 
with the view of establishing their relationship to his 
Mount Wilson observations, where temperature effects 
due to intense sunlight during the day and to canyon air 
currents at night were often troublesome. Miller’s own 
suspicions that thermal effects might be important are 
shown by the entry in his laboratory notebook for April 
14, 1921: “Sun shining full on side of house. There was a 
very large drift which seems to be in the direction of the 
sun; indicating possibility that the entire effect is due to 
temperature!” 

In the laboratory tests, electric heaters were placed at 
the level of the mirrors and about three feet from the 
circle travelled by them. The altered refractive index of 
the heated air and the thermal effects on the mirror 
supports change the optical path lengths of the inter- 
ferometer, and when these are affected unequally in the 
two arms, the fringes shift in position. On the assump- 
tion that the four arms of the interferometer all have 
the same thermal insulation, a localized temperature 
anomaly or a temperature gradient across the room will 
produce a second-harmonic term in the fringe positions 
as the interferometer rotates, similar to that anticipated 
for an aether drift. 

Localized heating will also produce a first harmonic in 
the fringe displacements when one of the four inter- 
ferometer arms, for example that containing the ob- 
serving telescope, has different thermal insulation prop- 
erties from the others. The effects of heat on the 3rd, 
4th, 5th, and higher harmonics of the fringe displace- 
ments should be small fox any temperature conditions 
likely to be encountered. 

The laboratory tests of 1923 were conducted with 
various amounts of thermal insulation protecting the 
light paths and mirror supports. In some trials the air 
was directly exposed to the heaters; in many cases the 

‘glass and wood casing that served at Mount Wilson as 
insulation for the light path was employed; and in 
certain experiments additional corrugated paper was 
placed over the vertical glass walls of the casing, the 


= e 
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LEONE, AND KUERTI 
Tapte IV. Laboratory heating trials (unit: fringes), 


Periodic Controls Heat 
amplitudes Set 17 Set 28 Set 18 Set 29 
Ay 0.006 0.006 0.010 0.021 
Ae 0.015 0.010 0.049 0.052 
A; 0.006 0.005 0.009 0.005 
As 0.004 0.003 0.006 0.011 
As 0.001 0.001 0.003" 0.002 


1 


mirror mountings, and in some cases over the steel base 
of the interferometer as well. In the experiments where 
the air in the optical paths was directly exposed to heat, 
large second harmonics (A2=0.35 fringe for one heater, 
and about twice this value for two heaters) were always 
observed in the fringe displacements, and with the 
expected phase. Shifting the heaters to a different 
azimuth produced a correspondIng change in the phase 
of the second harmonic. When the optical paths and 
mirror supports were thermally insulated, the second 
harmonics were greatly reduced; for example, with the 
glass coverings as used at Mount Wilson in place, the 
amplitudes were reduced to about 0.07 fringe. 

Among the laboratory trials made by Miller at Case in 
1923 are four sets of observations which reveal rather 
clearly the character of the temperature effects. In these 
sets the optical paths of the interferometer were en- 
closed with the glass thermal insulation as used at 
Mount Wilson, and, in addition, corrugated paper was 
placed over all arms of the interferometer. In two of 
these trials no artificial heating was used, but in each of 
the two sets immediately following these controls, the 
heater, in the position mentioned above, was in opera- 
tion. The results of these experiments are given in 
Table IV. It is evident that the heater. produced dis- 
turbances which increased the amplitudes of all five 
harmonic components. However, the effect on the second 
harmonic A» is much the largest, as is to be expected on 
physical grounds. Furthermore, the phases of the second 
harmonic in sets 18 and 29 have values consistent with 
the position of the heater. The first harmonic A, is 
somewhat increased by heat, and for this also there is 
some physical justification. The increases in the ampli- 
tudes A3, A4, and A; are much smaller. 

It must be emphasized that the foregoing analysis of 
these tests reveals small but certain temperature effects, 
in contrast to Miller’s statement that he had shown the 
absence of periodic effects caused by artificial heating 
when the light path was thermally insulated as previ- 
ously described.* Similar conclusions regarding the 
danger of spurious second harmonics due to thermal 
conditions were reached by Joos in his elaborate prepa- 
rations for a repetition of the Michelson-Morley experi- 
ment at Jena. In fact, Joos concluded from his labora- 


tory trials that temperature disturbances would be so 


serious that photographic recording of fringe positions 


H See reference 2, p. 220. 
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would be impossible except in a well-insul 
laboratory.” 

Thus Miller’s experiments in 1923 do not rule 
possibility of attributing the remaining systematic 
effects in the Mount Wilson data, which are most 
prominent in the second harmonic Ao, and to a lesser 
degree in the first harmonic 4,, to temperature causes. 
In what follows, we shall interpret the systematic effects 
on this basis, but must admit that a direct and general 
quantitative correlation between amplitude and phase 
of the observed second harmonic on the one hand and 
the thermal conditions in the observation hut on the 
other hand could not be established. The reason for this 
failure lies in the inherent inadequacy, for our purpose, 
of the temperature data available. 

Let us first discuss the physical consequences of the 
weak radiation field maintained across the hut by the 
temperature differences of the walls. Since periodic 
temperature variations of only 0.001°C in the air of the 
optical "sams would produce fringe shifts as large as the 
average effects observetl at Mount Wilson,!® a very 
conservative estimate was made of the wall temperature 
differences necessary to produce temperature oscillations 
of that magnitude. For this purpose it was assumed that 
(1) the ambient air temperature in the hut was es- 
sentially constant, (2) radiation was absorbed only by 
the vertical glass plates of the casing, and (3) heat 
transfer from the glass to the air inside the casing was by 
conduction only.'’ The resulting wall temperature differ- 
ences are about ten times as large as those usually 
recorded by the four thermometers located on the walls 
of the observation hut. There is no doubt, however, that 
this factor ten would be very considerably reduced if 
convection of the air inside the casing were taken into 
account and jf the contribution of the cover of this 
casing, facing the roof of the hut, could be evaluated. 

In reality, however, the effects of temperature on the 
apparatus must have been very complex, being mixed 
contributions of changes in density of the air in the 
optical paths, angular deflection of the mirror supports, 
and thermal expansion of the steel frame, the latter 
effect introducing a long time lag. It is practically im- 
possible to carry through calculations which would 
predict the over-all behavior of the interferometer due 
to temperature anomalies, since hardly any of the 
necessary data for such calculations exist. In fact, the 
readings of the four thermometers constitute all of the 
available information about the temperature (and 
radiation) pattern in the hut. They give essentially the 
air temperature along the wall (but not the wall 
temperature), and say nothing about the temperature 
distribution along the roof, itself a low-gable construc- 


ated basement 


out the 


i icati ssor G. 
18 According to the personal communication from Professo G 


54); so reference 5. | z 
ee ie agreement with similar estimates made by 


. R. J. Kennedy, Proc. Natl. Acad. Sci. 12, 621 (1926); and by G. 


. Rev. 45, 114 (1934). z 
oe Repte to ee colleague Dr. H. G. Elrod for these 


calculations. 
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tion with ridge at 20° to the N-S direction, and with its 
under side only a few feet above the thin wooden cover 
of the casing of the light paths. We conclude from the 
foregoing estimate that an interpretation of the system- 
atic effects in terms of the radiation field established by 
the nonuniform temperatures of the roof, the walls and 
the floor of the observation hut is not in quantitative 
contradiction with the physical conditions of the 
experiment. 

In view of the local factors affecting the temperature 
conditions of the interferometer, we may now ask 
whether the epoch averages shown in Fig. 3 should not 
in some way correlate with what is known about the 
mean temperature conditions at the several epochs. 
Thus it has been noted that in Fig. 3 the curve of the 
February, 1926 observations differs considerably in 
phase from those for the other epochs, We believe this 
behavior is correlated with the fact that throughout the 
February experiments the thermometers on the north 
and west walls of the house consistently registered 
temperatures from 1°C to 2°C lower than the south and 
east wall thermometers. This situation resulted pri- 
marily from a blanket of snow which covered the ground 
on the N-W side of the hut. Furthermore, the west wall 
of the hut was water-soaked throughout the February 
runs. As a result the average temperature gradient 
through the hut was in the general SE—NW direction in 
February, in contrast to that in July and September, 
1925 when the temperature gradient averaged through- 
out a day was more nearly along the N-S line. 

Turning now to single sets of observations, we 
may first try to correlate the smallest fluctuations 
J occurring’? in the Mount Wilson sets with the ther- 
mometer readings. There are eleven sets with f-values 
smaller than 0.015 fringe. If we introduce ‘AT) 
=av.|T;— T| (i=1, 2,3,4) as an indicator of uni- 
formity in temperature conditions, where T; denotes a 
thermometer reading, then (AT),, ranges from 0.02°C to 
0.25°C with a mean value of 0.11°C for these ob- 
servations. 

Thus small f-values usually go with small (AT), 
values, but the correlation is very incomplete, as (AT) 
values of 0.2°C are very often found in the Mount 
Wilson sets. Furthermore, sets with small /-values 
usually do not compose a normal population. For ex- 
ample, if (f:+:,s)/2 is computed for half of one of 
these sets, then usually a much larger f-value results. 
This agrees with the existence of periods other than$ 
found in the autocorrelation analysis of the two samples 
having small A» shown in Table II and indicates that 
many of the small f-values observed are the result of 
chance cancellations. 

If, on the other hand, the largest f-values (>0.08 
fringe) are considered, one is left, after elimination of 

1I Let averages 2; be calculated as on p. 169 and consider 
(4:+4i3)/2 as an approximation to the amplitude of the second 
harmonic. The maximum absolute difference between any of these 


numbers will be denoted by 24, where the fluctuation f now 
corresponds to A2. 
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sets where remarks such as “sun shines on interferome- 
ter” occur, with a selection that appears to consist of 
two different groups. In one group, including sets 103 
and 106 of March 28 around 5:00 v.m.; 128 of April 8, 
5:00 p.m.; 135 of April 9, 3:00 p.m.; 45 and 47 of 
September 17, before noon, 68 and 69 of September 19, 
around 8:00 A.a.s and sets 4, 5, 6 of February 4, around 
3:00 p.at.; and 12 and 13 of February 5, around noon, 
(AT) is large (>0.4°C). In the other group, which 
includes sets 72 to 75 of August 6, around noon; sets 80 
to 82, soon after sunrise on August 7; 88 to 90 near 
midnight of August 7 and 8; and 91 to 94 following 
sunrise on August 8, (AT)w is about 0.2°C or smaller. 

Thus in the April, September, and February epochs 
the largest occurring f-values are associated with large 
(AT), values, but not so in the July epoch. We have no 
certain explanation for the existence of large effects 
when the wall thermometers give reasonably small 
(AT),, values, but would point out that no temperature 
data are available to reveal thermal conditions at the 
roof, which may be responsible for the large fringe 
displacements at the times of highest altitude of the sun. 

Although the thermometer readings cannot be used to 
describe the thermal conditions in the observation hut 
in all cases, they should nevertheless provide an indica- 
tion of the stability of the thermal pattern in the hut 
affecting the fringe positions, particularly during the 
night hours. Accordingly the Mount Wilson data for 
each of the four epochs of observation have been 
searched for sets of readings taken during this part of 
the day and which exhibit “similar” temperature 
patterns. 

Ideal observing conditions could obtain if all four 
thermometers read the same and remained constant 
throughout a series of observations. Departures from 
the ideal always occur, and the extent to which tempera- 
ture disturbances are revealed by the recordings of the 
thermometers may be estimated by applying the 
following criteria: 


1. The magnitude of the average absolute tempera- 
ture difference (AT)n=((T:—1))a(¢=1, 2, 3, 4). 

2. The time rate of change of the temperature pattern 
in the room as indicated by the readings of the four 
thermometers. 

3. The drift in mean temperature, T, with the time. 

4. The linearity of the drift of the fringe pattern and 
the nonreversal of sign of this drift during a series of 
observations. 


Tne last two criteria are particularly important because 
of the complex time lag in the thermal behavior of the 
heavy steel interferometer base and the mirror supports. 

Thus, on the hypothesis that the second harmonics of 
the fringe displacements are due primarily to tempera- 
ture conditions, the observed fringe behavior throughout 
a set of midnight-dawn experiments should be the same 
within the experimental uncertainties on nights when 
the temperature conditions temained rather constant, as 
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Fic. 4. The temperature pattern in the observation hut and the 
observed fringe shift is shown here for three dates: Fig. 4(A), 
August 30, 1927 (Cleveland); 4(B), September 23, 1925 (Mount 
Wilson); 4(C), July 30, 1925 (Mount Wilson). Each group of four 
points in the upper parts of the figures represents the thermometer 
readings on the four walls of the hut at a given time between 
midnight and dawn (a range of about 5 hours in sidereal time). 
Arrows at the left of the temperature scales indicate a 1° range. 
The lower part of each figure shows the fringe shift as a function of 
azimuth index for the times corresponding to the temperature 
patterns. The lower part of each figure shows the fringe shift as a 
function of azimuth index for the times corresponding to the 
temperature patterns. The dashed curve in Fig. 4(C) shows the 
Tinge displacement when direct sunlight fell on the interferometer 
at dawn. 


defined by the criteria above. This, in fact, is the case. 
Figure 4 illustrates the observed influence of several 
temperature conditions on the second harmonics of the 
fringe displacements. Figure 4(A) shows ten sets of 
observations, Nos. 31 to 40 inclusive, made in the hut on 
the Case campus between midnight and 5:00 A.m. on 


August 30, 1927. During this entire interval the readings _ 


of the four thermometers were remarkably constant, the 
average temperature changing by only 0.4°C. Likewise 


to 
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the second harmonics are almost identical in both phase 
and amplitude throughout the entire series. This be- 
havior persists throughout almost five hours of sidereal 
time as the earth makes nearly } of a revolution and 
would be extremely unlikely if the fringe shifts were due 
to any cosmic effect. On the contrary, it strongly sup- 
ports our hypothesis that local temperature conditions 
are the dominant factor producing the observed second 
harmonics. x 

Similar correlations between the observed second 
harmonics and the temperature conditions existing from 
midnight to dawn can also be made in each of the four 
epochs during the Mount Wilson experiments. In all but 
one case where the criteria given above indicated good 
temperature conditions, the second harmonics were 
nearly alike during the time interval encompassed by 
the observations. A typical example of Mount Wilson 
results taken under rather good temperature conditions 
is shown in Fig. 4(B) for runs 75 to 83 inclusive taken 
from #2218 a.m. to 6:00 a.m. on September 23, 1925. 
Figure 4(C) will be dis@ussed later. 

Other sets of Mount Wilson data taken under steady 
temperature conditions and which show constancy of 
both amplitude and phase in the second harmonics of 
the fringe displacements are the following: April 2, 1925, 
sets 113, 114, 115, 116, 117, and 118, taken between 1:52 
A.M. and 4:58 a.m.” On August 8, 1925, sets 88, 89, 90, 
91, 92, and 93 were made between midnight and 6:26 
AM. and with similar room temperature patterns 
throughout, and again the amplitudes and phases of the 
second harmonics are nearly alike. A final example is 
provided by data taken between 2:30 a.m. and 6:38 A.M. 
on February 11, 1926, in sets 84 to 91 inclusive. Here 
again the temperature patterns during all runs are 
similar, and within the experimental errors, all the 
second harmonics are alike. 
` The April 2 and September 23 runs show another im- 
portant property, namely that the maxima of the second 
harmonic curves are definitely removed from the north 
point (the maxima of the other two night runs being 
close to it). This behavior throughout nearly six hours 
of sidereal time conclusively rules out cosmic effects. It 
also removes magnetostriction as the possible cause of 
the fringe displacements, since the interferometer steel 
cross was not dismantled during the four epochs of the 
Mount Wilson observations; hence any magnetostriction 
effect should not change its directional character during 
the four epochs. : 

In addition to the night runs cited above, daytime 
conditions occasionally gave steady temperature condi- 
tions as judged by the aforementioned criteria for three 
or more consecutive sets, which then usually showed 
similar second harmonics. When temperature conditions 
departed markedly from the ideal as defined by our 

9 Thi y: zi > sets 119 and 120 taken after sunrise, 
heal meen m nt the room had changed due to 


heati zall of the hut. These latter sets exhibit con- 
o amplitudes and different phases in the second 


harmonics. 
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criteria, widely varying second harmonics, both as 
regards amplitude and phase, were observed in con- 
secutive sets. 

Particularly remarkable among the daytime observa- 
tions are sets 56, 57, and 58, made in 1924 in a basement 
room of the Case physics laboratory between noon and 
6:00 p.m. on July 8, which give almost identical null 
results. In fact, throughout most of the turns of the 
interferometer the fringes showed no change in position 
whatever. The temperature conditions were ideal; all 
four thermometers gave identical readings throughout 
each set of observations, and the drift in average room 
temperature was only 0.1°C throughout the entire 
afternoon. If our hypothesis is correct, then this group 
of observations is the best ever made with Miller’s 
interferometer and shows the zero effect consistent with 
the results of other experimenters.” 

Among the Mount Wilson night set$ there is, however, 
one unusual series of observations, Nos. 21 to 28 in- 
clusive, made between 1:43 a.m. and 6:04 a.m. on July 
30, 1925. Here the temperature criteria are in part 
violated (but not very strongly), and in part satisfied, 
yet the run shows an extremely erratic behavior 
[Fig. 4(C)]. We have no ready explanation for this 
apparent departure from the four other night runs with 
“steady thermometric conditions.” Perhaps the fact 
that the canvas over the roof was not yet installed is the 
reason; or again canyon winds may have been unusually 
troublesome. It is certainly not possible to blame the 
lack of regularity of this run on the statistical incom- 
pleteness of its single sets. If one reduces half of a set by 
Miller’s method, one obtains essentially the same result 
as that obtained by reduction of the whole set, the 
agreement between the two results becoming pro- 
gressively closer during the course of the night’s 
observations.”! 

The run of Fig. 4(C) differs in the following respect 
from the other night runs. One may define some 
measure of the “noise” of a set of observations by taking 
the average, for the whole set, of the absolute values of 
the first differences of the single readings. In the run 
shown in Fig. 4(A), one so finds 0.079 and 0.063 fringe 
for sets 31 and 32 around midnight, and this figure drops 
steadily to 0.035 for No. 40 at 5:00 a.m. In the Mount 
Wilson night runs, with the exception of that of Fig. 
4(C), the corresponding “noise” drops from about 0.110 
to about 0.060 fringe, while it decreases from 0.190 to 


% Sets 57 and 58 were made with sunlight, rather than the usual 
acetylene source employed in set 36. The null results obtained with 
sunlight are of especial interest in that they disprove the“Kitz 
emission theory of light. Trials of the Michelson-Morley experi- 
ment with sunlight or starlight had been urged by Tolman and 
also by LaRosa to test this point, see R. C. Tolman, Phys. Rev. 35, 
136 (1912); M. LaRosa, Nuovo cimento 3, 345 (1912); Phys. 
Z. 13, 1129 (1912). eat: 

2 This holds for all of these night runs; in particular, the sets 
shown in Fig. 4(A) from No. 35 on have a surprisingly high degree 
of statistical completeness in the above sense. Together with the 
autocorrelation results of Table II for sets 42, 75, and 79, this seems 
to justify Miller’s method of correcting for drift by assuming it to 
be linear. 
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0.100 in the-night run of Fig. 4(C), if the last set, No. 28, 
is omitted.” This indicates an irregularity in the condi- 
tions of the exceptional run which is not present in the 
other night runs and is hardly reflected in the ther- 
mometer readings. ‘ 


CONCLUSIONS 


We believe that this discussion of the effect of temper- 
ature permits the following inference: Under the most 
favorable experimental circumstances the second har- 
monics in the Mount Wilson data remain essentially 
constant in phase and amplitude through periods of 
several hours and are then associated with a constant 
temperature pattern in the observation hut. This, to- 
gether with the statistical and mechanical analyses, 


2 Set 28 (indicated by the dashed line) was taken at sunrise, 
and Miller noted, “Sun shines on interferometer; fringes becoming 
unsteady.” The verf large increase in the second harmonic in 
this set clearly reveals the effect of radiant heat on the inter- 
ferometer, and similar effects occur at dawn in other sets. In 
judging the sensitivity of the interferometer toward radiation, 
it should be kept in mind that it was exposed to the direct sun rays 
only through cracks in the beaverboard wall of the hut or leaks 
around windows and the door. On July 31, a large canvas cover 
was put over the roof (apparently at some distance above it) and 
over most of the walls of the hut in order to protect the inter- 
ferometer from light leaking through and to get rid of the effects of 
the direct irradiation of roof and walls. The canvas cover was at 
least partly in place also in the September and February epochs. 
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forces us to conclude that the observed harmonics in the 
fringe displacements are not due to a cosmic phenome- 
non (aether drift), nor to magnetostriction, nor to 
mechanical causes, but rather to temperature effects on 
the interferometer. These disturbances were much more 
severe at Mount Wilson than those encountered by 
other observers in their repetitions of the Michelson- 
Morley experiment performed in laboratory rooms. 

Table I summarizes the results of the significant trials 
of the Michelson-Morley experiment. The first three 
columns of the table record the experimenter, year, and 
place of observation. The fourth column gives the 
optical path length D of an interferometer arm, as used 
in each of the several experiments. Column 5 gives the 
maximum anticipated fringe displacement correspond- 
ing to a 30-Km/sec aether drift velocity when the 
apparatus is rotated through 90° in its optical plane. 
Column 6 lists the amplitude A of the second harmonic 
of the fringe shifts actually found by each observer. In 
every case the observed double amplitudes 2A are much 
smaller than the expected fringe displacements listed in 
column 5. To provide a basis for comparison of the 
several trials of the experiment, the ratios of the ex- 
pected fringe displacements in column 5 to the values of 
2A actually found in each experiment are listed in the 
last column. 
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Prediction and Retrodiction 


SATOSI WATANABE 
U.S. Naval Postgraduate School Monterey, California 


An attempt is made within the framework of the accepted quantum physics to achieve the maximum 


parallelism between prediction (inference of the future observational data from the present ones) and 
retrodiction (ifference of the past observational data from the present ones). To implement this program, it 
is shown that the “retrodictive state function” (extrapolation of the present data to the past) can be just as 
useful as the ordinary “predictive state function” (extrapolation of the present data to the future). This leads 
to a formalism in which time-reversal becomes a linear transformation and double time-reversal becomes a 
c-number. In spite of this formal symmetry, it can be shown that the actual success of a retrodiction depends 
on the satisfaction of an additional condition which is not required in prediction, and which is not always 
fulfilled. From the same point of view, a logical loophole is pointed out in the indiscriminate application of the 
H-theorem to the past. The so-called irreversibility of observation is interpreted in terms of the decrease of 


“information” in the process of inference. 


1. INTRODUCTION 


N accordance with its expected role in human ac- 
tivities, physical theory is pre-eminently a pre- 
dictive instrument. Man is, however, not immune to 
temptation of the adventure of guessing with the same 
instrument what happened in the past outside the reach 
of his own observation. In the nonstatistical domain of 
classical physics, retrodiction must be in principle just 
as successful as prediction. However, in statistical 
applications of classical physics and in quantum physics, 
a careful study is needed to determine the confirmability 
of an attempted retrodiction. The present paper is 
intended to provide an answer to some of the rudi- 
mentary questions in this rather neglected field of 
intellectual interest. Although some new points of view 
and a new formalism are introduced, the content of this 
paper will remain perfectly faithful to the accepted 
premises of classical and quantum physics. It should be 
noted that retrodiction is a question defined differently 
from the so-called time-reversal, although it is related to 
this in a‘certain way which will become clear in our 
Sections 3 and 5. 

There have been at least three circumstantial in- 
centives which motivated undertaking this work. In the 
first place, it was emphasized by the author ina previous 
paper! that an essential difference between classical 
physics and quantum physics lies in the fact that in the 
latter the result of an observation can be used as the 
initial condition of the “state” immediately after the 
observation, but not as the final condition of the “state” 
immediately before the observation. Although this is in 
agreement with the customary usage of quantum 
physics, the conscious emphasis on this fact led the 
author himself to inquire whether one could not formu- 
late quantum physics in such a way that the result of an 
observation can be used as the “retrodictive state” just 


l i ibuted by the author to the monograph, Louis 
de ETA a Beate (Albin Michel, Paris, 1952), p. 385. 
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before the observation.” This question will be answered 
in Section 5 of this paper. Although the answer is in the 
affirmative, the actual usefulness of such a retrodictive 
theory is extremely limited. 

The second motive stemmed from an enlightening 
illustration that Dr. Keith Symon chose in a conversa- 
tion to explain the reason why the H-theorem cannot be 
used for the past. He imagines that a man discovers on 
a desk two piles of playing cards, one in a perfect order 
and another in disorder. In spite of the fact that every 
permutation of cards has the same a priori probability, 
he would not guess that the well-ordered pile is a result 
of shuffling, but he would justifiably infer a selective 
human intervention in the past of the well-ordered pile. 
Keeping in mind that a “permutation of cards” corre- 
sponds to a quantum state, ‘“‘well-ordered-ness” and 
“disordered-ness” to macroscopic cells, and “shuffling” 
to ergodic process, the reader will find that this pattern 
of inference is given a mathematical expression in our 
formulation of retrodiction in Section 4. 

Thirdly, everyone familiar with the quantum theory 
of time-reversal* is rather disturbed by the fact that the 
operation of time-reversal is not a linear transformation, 
and also by the fact that the operation of double time- 
reversal does not become an identity transformation. 
One could expect that these esthetically unwelcome 
features of the theory may be avoided by a formulation 
which treats prediction and retrodiction on an equal 
footing. It will be shown in Section 5 that this expecta- 
tion is justified. 

The problem of retrodiction may be -formulated in 
brief as follows: An observer B would like to guess from | 
his own experimental data the result of another observer 


2Tt is the pleasure of the author to note with thanks that Dr. 
Adolf Griinbaum in a private communication encouraged under- 
taking clarification of this question. See also A. Griinbaum’s 
article in the monograph, Philosophy of Rudolf Carnap (Tudor 
Publishing Company) (to be published). . 

3S. Watanabe, Phys. Rev. 84, 1008 (1951). See also, S. 
Watanabe, Revs. Modern Phys. 27, 26, 40 (1955). 
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A who observed the system some time before B and who 
has not confided his result to B. The main difficulty for 
retrodictor B arises from the fact that in his retrodictive 
inference he has to assume, apart from his own experi- 
mental finding, an a priori probability to each possible 
initial state (in which A might have found the system). 
There is in genera} no reason to assume an equal a priori 
probability for each quantum state except when the 
initial ensemble given to A can justifiably be assumed to 
be the result of an ergodic process. It will be explained in 
Section 3 by a simple example how easily a retrodictor 
can completely fail while a predictor cannot fail, 
naturally in the statistical sense of the word. 

However, by assuming the uniform a priori initial 
probability, one can obtain an interesting formalism 
which exhibits on one hand a complete symmetry with 
respect to the two directions of “time,” but which on the 
other manifests a definite one-way-ness of the direction 
of human “inference.” In short, the present paper may 
be said to be an elaboration in the light of quantum 
physics of the following pregnant words due to W. 
Gibbs‘: 

“Tt should not be forgotten, when our ensembles are 
chosen to illustrate the probabilities of events in the real 
world, that while the probabilities of subsequent events 
may often be determined from the probabilities of prior 
events, it is rarely the case that probabilities of prior 
events can be determined from those of subsequent 
events, for we are rarely justified in excluding the con- 
sideration of the antecedent probability of the prior 
events.” 


2. MICROSCOPIC RETRODICTION 
Let S be a complete set of eigenstates, 
Sin Day Sey 795 DH 2975 (2.1) 


of a family of mutually commuting observables defined 
with respect to a certain physical system. We shall use 
the same symbol S also to designate this family of 
observables. The completeness of S implies that the 
probability p; of the system being found in state S; 
satisfies 


ipl. (2.2) 


A family T of mutually commuting observables which 
do not commute with S will define another complete set 
T of eigenfunctions. 

The probability that the system which was in state S; 
at the initial instant will ke found in state T; after 7 
seconds will be denoted by 


P(ST;, 7)=PG>)), (2.3) 
SiS, TT, (2.4) 


where S and T may or may not be the same complete 


4J. W. Gibbs, Elementary Principles in Statistical Mechanics 
(Yale University Press, New Haven, Connecticut, 1914), p. 150. 


eo 


set. On account of the assumed completeness, we have 
wi PG7)=1. (2.5) 


From the invariance of dynamical laws for time- 
reversal (reversibility) or from the invariance for space- 
and-time-inversion (inversibility) we can conclude the 
inverse normalization? : ; 


zae i y 


This can also be derived from the unitarity of transition 
matrix. 

Suppose that an observer A observes the system at 
t=0 with the observable-family S, and that observer B 
observes the same system at ‘=7 with the observable- 
family T. “Prediction” consists in the following position 
of problem on the part of A. Knowing that observer B 
will observe with T at /=7, observer A proposes to guess 
the result of B on the basis of his own result. If observer 
A had the result S;, then his prediction will be that the 
probability of B obtaining T; will be P(ivg). This 
means that, if observer A prepares a large number JV of 
the cases where the result at /=0 was S,, then VP(i—7) 
will be the number of cases where observer B will obtain 
T; at t=7. Observer A will be called predictor and ob- 
server B monitor. 

“Retrodiction” is now to be defined in a close analogy 
to the previous problem, only interchanging the roles of 
A and B. Knowing that observer A observed the system 
with S at ‘=0, but not knowing what his result was, 
observer B proposes to infer the result of A from his own 
result that the system is found in 7; at ‘=r. B will be 
called retrodictor and A monitor. 

This question does not have a unique answer unless 
retrodictor B assumes a certain statistical behavior of 
monitor A regarding selection of the ¿nitial states. 
Independently of his own result at ‘=r, retrodictor B 
may have some general information about A, on the 
basis of which he may assume that monitor A has the 
general habit of selecting (and handing over to B) states 
S: with weight w:()[;w:=1). If A prepared a large 
number NV of cases at =0, then Vw; among them must 
have been in S;, according to the assumption. At the 
receiving end, Vw;P(i—j) among these Vw; will turn 
out to be in 7;. The total number of cases which will 
land in T; will then be 


Z: Nw.P Gj), < (2.7) 
among which Vw,P(i—j) have originated from Si. 
Then retrodictor B will say that the probability Q (iJ) 


that a system which was found at ‘=7 to be in T; had 
been found in S; at ¿=0 is 


Nw;PG}) w:P (i) 
LiNwPG>j) Dew PG) 
We use the “left-to-right” order to indicate the chrono- 


logical direction, and an arrow to indicate the direction 
of inference. 


OG) = (2.8) 
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It should be clearly understood that the foregoing 
result does not mean at all that if A prepared an en- 
semble with the weight given by Eq. (2.8) for each S; 
then retrodictor B would obtain the result T}. Indeed, 
if A started with the weight distribution given in Eq. 
(2.8), then B would obtain T, with weight 


Li w:P(i—>j)P(i—k 


LawiP (ij) 


In other words, Eq. (2.8) represents the weight of S; in 
the subset of systems ending in T; when the entire 
ensemble has the weight distribution w,. Insofar as the 
estimation of w; is correct, observer B’s retrodiction 
based on Eq. (2.8) must be statistically successful in 
this subensemble. If the estimation of w; is unreliable, 
all retrodiction is meaningless. 

It is true that retrodictor B could in principle find 
out the w’s from the observed values of the quantity 
(2.7) for all the T’s, if the number of the S’s equals 
(or is less than) the number of the 7”s, and if the deter- 
minant of P(i—j) does not vanish. This, however, 
implies that before any attempt at retrodiction starting 
from a particular final state T; can be made, a very 
large number of observations has to be made for all 
T’s. Furthermore, it must be guaranteed that the values 
of the w’s are kept fixed during all the observations. 
Thus, determination of the w’s by retrodictor B is 
seldom feasible in practice. 

If retrodictor B does not have any preliminary 
knowledge about the habit of A, the only thing he can 
do is to resort to the principle of ignorance and to assume 
that thea priori probability w;is equal for each quantum 
state S;. This attitude of B will be successful (verifiable 
by repetition) only if A prepares an ensemble with equal 
weight for all quantum states (similar to the micro- 
canonical ensemble on an energy shell), and if B picks up 
only those cases which have landed in 7; and then 
classifies them according to various possible initial 
states. According to this simplifying assumption, Eq. 
(2.8) will become 

Pls) 
i—j)=———— (2.9) 
GEIS lap 


and, further with the help of the inverse normalization 


Qoli-j) =P 9). (2.10) 

It should be well noted that Eqs. (2.9) and (2.10) are 
based on a specific assumption that w; is uniform. In 
fact, retrodictor B can very easily Fa oe by 
monitor A. Suppose for instance that there are only two 
possible states (1) and (2) and P(1>1)=P(1—2) 
= P(2>1)= P(22)=2- No matter what ratio wi/w2 
monitor A may choose, retrodictor B will find one half of 
the cases in state (1) and the other half in state (2). 
Conversely, observer B’s retrodiction based on the equal 
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distribution that one half of the cases must have origi- 
nated from (1) and the other half from (2) may be com- 
pletely wrong; monitor A may have handed over to 
retrodictor B only those systems which were found by A 
to be in (1) at £=0. The beSt way to avoid this deception 
on the part of B would be to impose on A, as a rule of the 
game, that he should pick up cases at random from the 
“microcanonical ensemble.” Then Eq. (2.9) or Eq. (2.10) 
will have a meaning in a subensemble which lands in 
T ;. We shall hereinafter refer to the retrodiction based 
on the uniform w’s as a “blind retrodiction.” 
Prediction, in contrast to retrodiction, has a simpler 
rule of game: the monitor (posterior observer here) is 
required to show all his results. Then, the prediction 
based on Eq. (2.3) will always be statistically successful. 
It should be emphasized that this asymmetry between 
prediction and retrodiction originates from the asym- 
metry of the “rules of game.” In prediction, the 
predictor has the right to prepare the ensemble, while in 
retrodiction, the monitor has the right. We can easily 
change the rules to make prediction just as unreliable as 
retrodiction. Suppose monitor B referring to a prediction 
has the tendency to forget to record some of the cases in 
such a way that the chance of state T; being recorded by 
him is proportional to wj. Then, the prediction by A 
will be that the systems registered by him as .$; will be 
recorded by monitor B with the distribution given by 


Pi j)w; 
Dj Pim jw; 


which offers a nice parallelism to Eq. (2.8). We shall 
however seldom have to deal with such a “forgetful” 
observer. The “rules of game” must be chosen in each 
case in such a way that they correspond faithfully to the 
nature of the actual description of physical phenomena 
under consideration. In this sense, P(i—j) given in 
Eq. (2.3) may be used for prediction, but for retrodiction 
we have to use Q(i—j) given in Eq. (2.8) with inde- 
terminate w; in general cases. 

Exception has to be made to the entire consideration 
of this section either if (1) S and T are the same set and 
it commutes with the Hamiltonian of the system or if 
(2) Sand T are the same and the time duration 7 is zero. 
In this case, P(i—j) =6;; and Eq. (2.10) follows auto- 
matically from Eq. (2.8) irrespective of the w’s: 

Qli— j) = Pip) = ôi; 
Retrodiction is perfectly successful in this special case. 

The situation in classical physics may’ be included in 

Eq. (2.10), if the “state” is determined as precisely as 


possible in principle, i.e., if the system is located at a 
point in the phase space. 


3. MACROSCOPIC RETRODICTION 


We shall now introduce the concept of macroscopic 
cells in our consideration. The macroscopic observations 
are known to be compatible with one another, therefore 


(2.11) 


(2.12) 
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we can think of a family S of microscopic observables 
which are compatible with all these macroscopic obser- 
vations. In general, this S will not be commutable with 
the exact Hamiltonian of the system.* Suppose further 
that the eigenstates, Sı, S2, etc., are grouped into 
macroscopic cells which are labeled by p=1, 2, ---, in 
such a way that cell »=1 contains nı eigenstates of S, 
cell u=2 contains 72 eigenstates of S, etc. 

A macroscopic prediction consists in inferring the 
probability of finding the system in cell » at =7 when 
it is known that the system was found in cell » at ¿=0. 
The answer will be, in terms of the microscopic transi- 
tion probabilities, * 


LOG 
P(u>r)=— DU PH), 
t 2 


Ny 


(3.1) 


where >>,“ means that ¿i should run over all the 
eigenstates contained in cell u. It should be noted that 
this answer is based on the equal weight of S; within cell 
u, and complete disorder of phase among these states S;. 
In other words, we are taking as the initial state a 
density matrix (statistical ensemble) which corresponds 
to the Hilbert subspace u. Writing @[¢ ] for the projec- 
tion operator for quantum state ¢, we can express our 
initial ensemble by 


1 w 
G=— OL ei]. 


Ny 


(3.2) 


This is the best we can do under the given information 
that the system was found in p at =0. The quantity 
given in Eq. (3.1) is of course normalized with regard to 
v, i.e., >>» P(u—-v) = 1. 

Now the retrodiction consists in inferring the proba- 
bility that the system had been found in macroscopic 
state u at t=0, when it is known that the system was 
found to be in macroscopic state v at ‘=7. Again 
introducing the a priori probability w, for each quantum 
state in cell 4, one will answer that the probability in 
question is given by 

WNP (uy 
Q(uer) ese eS 
Lin WP (uy) 


(a) (») 


Wy 2 23 RGI) 


(3.3) 
(x) () 


DORDE DD Pj) 


If we can assume that the background ensemble of A 
was a microcanonical ensemble, i.e., if all the w’s are 
equal, then we can simplify Eq. (3.3) to the form: 


Qolu») = (n,/1%»)P (uv), (3.4) 


5 The exact Hamiltonian may commute with the “macroscopic 
energy” but not with the other macroscopic quantities. J. v. 
Neumann, Z. Physik 57, 30 (1929). A 

* Note added in proof.—The P on the left hand side of Eq. (3.1) 
is what was denoted by W in Parf I. (See reference 6.) 


o 
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where the inverse normalization given in Eq. (2.4) has 
been utilized. The probability given in Eq. (3.3) or 
(3.4) satisfies the normalization condition with regard 
to u: Din O(u—v) =0. 

It might appear as if prediction in the macroscopic 
case were equally unreliable as retrodiction since we 
have to use the assumption of equal probability (within 
cell x) also for prediction here. However, it should not 
be forgotten that it is observer-predictor A himself who 
prepares the initial ensemble, therefore unless he puts 
uneven, selective weights to various states within cell y 
he can succeed. On the other hand, the supposed even 
weight all over the energy shell assumed in Eq. (3.4) is 
not in the control of observer-retrodictor B, who there- 
fore can very easily fail in his retrodiction. 

Let us next examine the consequences of reversibility 
(invariance for time-reversal) and inversibility (invari- 
ance for space-and-time inversion) on our problem. The 
reversed state S’ of a state S means the one in which all 
the particles have the same positions as in Şut the 
equal and opposite velocities to those in S. The inverted 
state S” of a state S means the one in which all the 
particles have the same velocities as in S and the 
space-inverted positions as compared with S.° Reversi- 
bility and inversibility, which hold in the basic processes 
in quantum mechanics, then mean: 


P(S>T)=P(T'S’). (3.5) 


The macroscopic observations usually’ cannot dis- 
tinguish a state from its reversed or inverted state. In 
other words, a cell u contains the reversed, as well as 
inverted, state S’ of a state S if it contains S. Then from 
Eq. (3.1), we obtain in virtue of Eq. (3.5), 


1 w © Ny, 
P(u>)=— Ye Dy Pl’) =—P On). (3.6) 


Np Ny 


Using this relation, we can write Eq. (3.3) in the form: 


WP (yp) 
Our) =— (3.7) 
Liu MP (vn) 
and with the assumption of uniform weight, 
Qo(u—v) = P (vn) (3.8) 


which has a striking simplicity. Equations (3:4) and 
(3.8) are applicable only to the “blind” macroscopic 
retrodiction. 

What has been developed in this section also applies 
to classical, statistical considerations if we replace the 


number of quantum states by the volume in the phase 
space. 


__° For the precise definition of time-reversal and space-and-time 
inversion, see Sections 3, 4, Part I, and Section 3, Part II, S. 
Watanabe, Revs. Modern Phys. 27, 26-76 (1955). 

This is certainly the case if we limit the macroscopic quantities 


to a certain category, for instance to the thermodynamical 
variables, 


ie 


tè 


FONS 
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4, APPLICATION OF THE H-THEOREM TO THE PAST 


The probability P(v—y) depends naturally on the 
length of the interval r between the two observations 
The ergodic H-theorem, in essence, states that if the 
exact Hamiltonian does not commute with the observa- 
ble-family S which is compatible with the macroscopic 
observations, then the probability P(v—y), averaged 
over possible values of 7,’ is proportional to the size of 
the final cell p: ` 


(P(»u))=n,/N, (4.1) 


where JN is the total number of quantum states on the 
energy shell. 

Equation (4.1) shows that if we take a value of 7 
arbitrarily from its possible domain, 0<7<%, the 
probability P(v—ų) of finding the system in a large cell 
p is large. Invoking now the relation (3.8), we can say 
that on the assumption of blind retrodiction, the proba- 
bility .Qo(u-v) that the system had been found at 
t= -—7 in a large cell pais also large if —r is arbitrarily 
taken from its possible domain — œ < — r <0. If we use 
the Boltzmann entropy $s, 


Sg=logn,. (4.2) 


We can say on the basis of blind retrodiction that if Sz 
at ¿=0 has a certain nonmaximum value, then it is just 
as probable to have a larger entropy value in the future 
as in the past. This is the well-known conclusion of a 
formal application of the H-theorem to the past. We 
could also use the Gibbs entropy Sg, 


So(7)=— Lia P(vu) log P0—>y)/na], (4-3) 
Se(—7)= Ln Qu») loglO(u—v)/mu], (4-4) 


with 7>0, but it may be easier to visualize the situation 
with the help of the Boltzmannian entropy. 

The foregoing argument is based on the premise of 
blind retrodiction which may be the only possible basis 
of inference if it is perfectly certain that the system had 
been isolated from the exterior system (except a possible 
prior observer who does not perform any kind of 
selection) and if we have absolutely no other informa- 
tion about the system than that it was found in v at /=0. 
However, such conditions are seldom satisfied in the 
actual circumstances. A sounder inference than the 
mere blind retrodiction, in line with Symon’s idea ex- 
plained in our Section 1, would be somewhat as follows: 

Consider two cells y and y such that 2,>>n,. Then 


8 Although the noncommutability of the exact Hamiltonian with 
S is the main hypothesis, we need some more auxiliary conditions 
to derive this result. For the two versions of these conditions, see 
J. von Neumann, Z. Poeh o o (1929); and W. Pauli and M. 

i ik 572 (1937). 
ei Uist 1 pA A Neumann’s proof can be only for the 
positive values of 7. It should be noted also that the H-theorem 
Considered here refers to one initial observation ({=0) and one 
final observation (t=7) and is different from the cnet ina 
based on repeated observations. See Section 7, Part I, S. 
Watanabe, Revs. Modern Phys. 27, 26 (1955). 
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according to Eq. (3.6), we have 
—=—<1, (4.5) 


and according to Eq. (4.1) we have also 
(P(k—v)) m, i 
——=— K] (x, \ arbitrary). (4.6) 
(P(Q\n)) Mu 


Now let us assume that we find a system at /=0 in cell v. 
Seeing from Eq. (4.5) that it is extremely improbable 
for a system starting from a large cell » to reach a small 
cell y, we suspect that such was not the actual history 
behind the system we have just found in v. This infer- 
ence is a direct contradiction to the aforementioned re- 
sult based on the uniform w. Thus we are led to modify 
the assumption of uniform w in such a manner as to give 
less weight w, to larger cells y and larger weight to 
smaller cells. Such an assumption of nonuniform w is 
perfectly allowable according to our theory. In fact, if 
there is any possible doubt about the isolation of the 
system in the past, for instance, there is no reason to 
adopt the hypothesis of blind retrodiction. Then the 
result of observation that the system was found in a 
small cell at present can very well reflect itself in our 
estimation of the w’s. 

Once we have abandoned the assumption of uniform 
w, we cannot use Eq. (3.8) any longer and have to go 
back to Eq. (3.7). In spite of the fact that P(v—p) may 
be large, Q (uv) can be small if w, is small in Eq. (3.7). 
And the probability of the system having originated 
from a small cell can become quite large. Thus the 
entropy value Sg at t=—r may probably have been 
smaller. Our formalism is flexible enough to incorporate 
this very reasonable inference. 

The foregoing argument can be applied also to the 
classical, statistical mechanics. It is interesting to note 
how our argument can stand the famous objection due 
to Loschmidt. It is true that in cell y there are just as 
many microstates headed for larger values of entropy 
Ss in the future as those which have originated from 
larger values of entropy in the past. If w is uniform, then 
each microstate inside cell » will be occupied by the 
same weight on account of the permanence of the 
microcanonical ensemble.” Then, Loschmidt’s argu- 
ment becomes valid, and we have to conclude larger 
values of entropy for the future as well as for the past. 
But if w is not necessarily uniform, then we need not 
assume equal weight for eath microstate*inside the cell 
for the purpose of extrapolation towards the past. Then 
Loschmidt’s objection does not hold any longer. For a 
realistic macroscopic retrodiction, we should not use the 
uniform weight within the macroscopic cell y, while it 
may be assumed for prediction. T 

It is interesting to note that the blind application of 
the ergodic H-theorem to the past does not actually 


2 This is a consequence of tie inverse normalization, Eq. (2.6). 


~ 
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yield any newer information than what one has put in as 
the assumption. The combination of Eqs. (4.1) and 
(3.8) gives 


(Qo(ucv))=n,/N, (4.7) 


which is nothing but an expression of the uniform 
probability, an assumption which has been used in 
deriving Eq. (3.8). 


5. RETRODICTIVE QUANTUM MECHANICS 


Our basic equation (2.8) for retrodiction can be 
written as 


Qj) =w Qoli) wi), (5.1) 
with the help of Eq. (2.10): 

Qli—j)=P(i>)), (5.2) 
where the w; depends on the over-all judgment of the 
retrodictor. Only when the system has been isolated in 
the past and there is no other clue to its past history 
than the observational fact that the system is found at 
present in state j, will the retrodictor use the uniform 
value of w; for various 7’s and Q(i—j) will reduce to 
Qoli—j). What follows mainly concerns the blind 
retrodiction represented by Qo, but Q can be derived 
from Qo by the use of Eq. (5.1) if there is any way of 
estimating w; 

In this section, we shall first show that the quantity 
given in Eq. (5.2) can be calculated in two ways: either 
solving the Schrédinger equation with the initial condi- 
tion S; or solving the same equation with the final 
condition T;. Although the resulting values of proba- 
bility are the same, the first method agrees better with 
the idea suggested by the right hand side of Eq. (5.2), 
while the second method reflects more faithfully the idea 
suggested by the left hand side. Since the first method is 
the customary one, we shall only show how the second 
method can be used to evaluate the same probability. 

Let the eigenfunctions of S be called ¢1, p2, ---, pi 7t 
and those,of T yı, Y2, ---, Yj, ---. Further, let the solu- 
tion of the Schrédinger equation, 


ðY (i) /di= —1H()V (d), (5.3) 
satisfying the final condition, 
U(r)=¥3, (5.4) 


be denoted by W,(#). Expanding Y,(0) according to 
the g;: 


a 


~ ¥ (0) D: aig; (5.5) 
we can easily show that @;*a; represents the probability 
(5.2). 

Consider the transition matrix U (f,t2) defined by: 
e ðU (tı,t2)/ðl1= —iH (hh) U (ty,t2), 
p OU (b,t2)/ ð= +-1U (t,t2)H (tə), 


U (4,4) =1, U (tat) =U (hi, te) = Ü (t,t). (5.7) 


(5.6) 


e 


Then according to the customary theory, the proba- 
bility (5.2) is given by 


P(i>j)= | WU (7,0) p:)|?. (5.8) 
On the other hand, W,(é) considered in the foregoing is 
Y,(4) = U (lT); (5.9) 

and the coefficients a; are A 


a= (gU (0,7)W). (5.10) 


On account of the unitarity of U and of the relation 
U(0,r) = U— (7,0), as in Eq. (5.7), we obtain 


a= (U (0,7) eis) 


1G 
5 GEO OCOD eae” 
Hence, in view of Eq. (5.8), 
a;*a;= P (ij) = Qo(ie—J). (5.12) 


This situation suggests a new picture of the “state” 
of a system between two observations, one at‘?=0 and 
the other at /=7: There exist simultaneously two states, 
one being a predictive state Y,(¢) which complies with 
the initial condition at =0, and the other a retrodictive 
state Y,(/) which complies with the final condition at 
t=r. Both V,(é) and W,(#) obey the same Schrödinger 
equation. This picture, though redundant in practical 
applications, offers certain intellectual interest, for it 
provides a complete symmetry between two consecutive 
observations. Y, can be given just as much, or just as 
little, “reality” as ¥,. 

Now, we should like to look upon the same situation 
from a slightly different point of view, namely we 
attempt to establish a time-symmetry, not with regard 
to two observations, but with regard to the future and 
past referring to a single observation at Hand. Suppose 
we make an observation at =0 and obtain a result i. 
Then our inference will develop towards the past just 
as well as towards the future. Let us introduce a new 
variable s, called inference parameter, which coincides 
with ¢ when it refers to prediction, and which is equal to 
minus ¿ when it refers to retrodiction. s is then always 
positive. 

The development of a retrodictive state V,(#) starting 
backward from ¢; at =0 is nothing but the extrapola- 
tion of the predictive state and obeys 


a¥,()/a=—iH (OY, (), V-(0)=es (5-13) 
or in terms of s, 
oW,(—s)/ds=+iH (—s)¥,(—s). (5.14) 


The Hamiltonian being Hermitian, the complex conju- 
gate of Eq. (5.14) becomes : 


ov ,*(—s)/ds=—iv,*(—s)H(—s). (5.15) 


Introducing a time-independent unitary operator R, 
called reversion operator,® such that 


(RH (—s)R)T=H (s), (5.16) 
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we can rewrite Eq. (5.15) in the form: 
ARTY *(—s)/ðs= —iH (s)R™|A(—5). (5.17) 
This means that ¢(s) defined by 
&(s)=RIWV,*(—s) or V,(—s)=Rb*(s) (5.18) 
satisfies the same equation as the predictive state, 
°” 8(s)/As= —iH1(3)(5). (5.19) 


The only difference is that &(s) satisfies the initial 
condition : 


&(0)=R7 0 **, (5.20) 
while the predictive state W,(s) satisfies 
V,(0)= p; (5.21) 


The probability of finding this system at =s>0 in 
state Y; will be 


PU>P= | Wah o(s)) [?, 


while the blind probability that the system had been 
found at = —s<0 in state Y; will be 


Qo( jt) = | (Wi ¥-(—s)) |? 
= | (Hj RO*(s)) |?= | (RY *,B(5)) |*. 


In brief, the two inferential states V,(s) and &(s) can 
be treated in a parallel fashion, only using R7* for &(s) 
wherever we would use ¢ for W,(s). Compare Eqs. 
(5.20) and (5.23), respectively, with Eqs. (5.21) and 
(5.22). It would then be a tempting idea to introduce a 
quantity which comprises both Y, and % on the same 
footing. A “double inferential state” composed of two 
components 4s defined by 


W,(s) V,(/) 
Ww(i)= = : 
P(s) &(—#) 
where s is now freed from the condition s>0. This 
function will obey 


(5.22) 


(5.23) 


(5.24) 
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will then have the form: 


W(t) 
wiy=( J . (5.28) 
ERTY* (t) 


‘\ 


We can however liberalize the relationship between the 
two components of Eq. (5.28) without affecting their 
physical meaning. Namely, taking any unitary operator 
W! which commutes with all the known physical 
quantities, we can write, instead of Eq. (5.28), 


W(t) 
wio=( J 
RTW*¥*(t) 


This amounts to replacing W(t) by W(t), which of 
course does not change the meaning of a state function. 
The initial condition of Eq. (5.29) is then 


Pi 
ZM EE .) 
Gi 


To make our discussion more concrete let us take as W 
(5.31) 


where n» in any arbitrary integer and A is given by" 


all 
A=A7=AT=A*=RTR=[[(—1)*%', (5.32) 


(5.29) 


(5.30) 


W=ar, 


in which N; is the occupation number operator for the 
spinor eigenstate labeled 7. A is known to commute with 
the reversion operator, R. Then the general pattern of a 
double inferential function is 


v(t) 
w(i)= ( ) 
A*R™W*(1) 


with arbitrary n. A" is unity when z is even. 
We can now introduce the “reversed” inferential 
function W of Eq. (5.33) by 


po- (2 RUO 
Ww,e()= ; 
; ( v(i) ) 


which certainly falls in the supposed general pattern of 
an inferential function (5.33), only the arbitrary number 
n being replaced by n+ 1, for 


Y (i) =AHRT(A"RTY*(4))*. 


(5.33) 


(5.34) 


(5.35) 


Furthermore, at each value of £, the first and the second 
components of Eq. (5.34) represent respectively the so- 
called “reversed. states” of the first and the second 
components of Eq. (5.33).5 Indeed, for a given state 
@(é), its reversed state can be expressed by WR*(t). 
The transformation from Eq. (5.33) to Eq. (3.34) can be 


written 
wr =RY (i), WO=Rw¥a(}, (5.36) 


11 See Section 12, Part II, S. Watanabe, Revs. Modern Phys. 27, 
40 (1955). 
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aw ()/di=—iH()w(), (5.25) 
18 ; 
with 
H(t 0 
HW) = ( O) ) 
0 —A(—)) 
0 
k (@ ). (5.26) 
0 —(RA()R)? 
The initial condition of ¥ (8) is 
Pi = 
vO- (pr)! (5.27) 
and the solution of Eq. (5.25) at an arbitrary value of i 


Digitized by Arya Samaj Foundation Chennai and eGangotri — 


186 SATOSI 


with 


(5.37) 


The formalism presented here has no practical ad- 
vantage over the current quantum theory, but it has a 
formal advantage’ in that time-reversal is represented 
here by a linear transformation (5.37), and double time- 
reversal® becomes an identity transformation: 


R=1. (5.38) 


6. IRREVERSIBILITY OF INFERENCE AND 
INFORMATION 


Suppose observer 4 prepares a large number NV of 
systems which were found at ‘=0 in state g;. After 7 
seconds, each system will become U (7,0) ¢;. If observer 
B performs at t= 7 an observation with the complete set 
T(Wivs,---), then N| @;,U (7,0) ¢:) |? systems will turn 
out to be in state y;. With the help of projection 
operators @L¢ ] we can write this process in the following 
schema: 


G:=CL¢:}°$2=CLU (7,0) ¢: |}? 
S3s= Dos] VU (7,0) ¢:) | CL]. 


The amount of “information”? carried by the knowl- 
edge about the system represented by G is 


I=Spur(G log§)+ const. (6.2) 


This quantity does not change in the first step of 
transition in Eq. (6.1), but does decrease in the second 
step. This is the famous irreversibility of observation 
pointed out by von Neumann.” It should be noted that 
Gs in Eq. (6.1) does not represent the knowledge ob- 
tained by observer B in individual cases, for in each case 
observer B knows perfectly well in which one of the 
y’s the system is found. G; can be considered as a global 
description of the entire ensemble after the observation, 
or as the prediction of the result of B in each case. 
Next, suppose that observer A prepares at /=0 a large 
number of systems with equal weight in all possible 
gs (eS). Observer B performs at t=r an observation 
with T, and a certain large number NV of systems is 


(6.1) 


12 C. Shannon and W. Weaver, Mathematical Theory of Com- 
munication (University of Illinois Press, Urbana, Illinois, 1949). 
We do not indulge here in the discussion regarding the sign before 
the Spur and regarding the constant in Eq. (6.2). The quantity 
(6.2) was first used by von Neumann, Mathemalische Grundlagen 
der Quantenmechanik (Verlag Julius Springer, Berlin, Germany, 
1932). See also L. Szilard, Z. Physik 53, 840 (1929). For an early 
application of the quantity (6.2) to a concrete physical problem, 
see S. Watanabe, Z. Physik 113, 482 (1939). 
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found to be in state y;. He considers now only those 
systems ending in y;, and ask what percent of them had 
been registered as g; by the previous observer A. Then, 
he extrapolates Y; backward by the Schrödinger equa- 
tion from ż=r to /=0, and calculates | (¢:,U (0,7)¥,) |2. 
His inference will then be that, among N systems that 
he found in y; V| (¢:,U (0,7)¥;)?| systems must have 
been found by A to be in ¢;. Schematically, this infer- 
ence can be denoted by 


Sr =D | (oU 0,7) |?OL Gi ]— 
Go’ =0LU (0,7) ¥i l-9 =O Ls], 
which exhibits a parallelism to (6.1). Gs’ in (6.3) 


represents a partial ensemble immersed in the uniform 
ensemble: 


(6.3) 


Go=r\ Li PLeil (6.4) 


prepared by A. If A would have started with Gs’, then B 
would not obtain G1’. Nonetheless, G3’ represents the 
legitimate inference made by B based on the. blind 
retrodiction hypothesis with regard to the results that A 
had obtained in the systems which were later found by 
Bin Wj. 

If B has any further source of judgment about the 
initial ensemble, he will modify the assumption of 
uniform weight, and attach a reappraised a priori 
probability w; to each ¢;. In this case, G3’ will become 


G ={ w| ConU 0,7)¥s) |?CLe:]}/ 
{3 : w;| (9U (0,7)W;) [3 


in accordance with Eqs. (2.8) or (5.1). 

Tt is evident that, no matter whether one uses G3’ of 
Eqs. (6.3) or that of Eq. (6.5), the amount of informa- 
tion carried by G3’ is smaller than that carried by Gy’, 
i.e., the decrease of information here takes place in the 
backward direction of time. Both the case of prediction, 
Eq. (6.1), and the case of retrodiction, Eqs. (6.3) or 
(6.5), can, however, be included in’ the statement that 
the amount of information decreases in the direction of 
inference, i.e., in the positive direction of the inference 
parameter of the last section. This last result is in a good 
agreement with the common sense, for an inference 
cannot contain more information than the fact from 
which the inference is drawn. 

In the statements in the foregoing, the phrase “‘infor- 
mation decreases” must be replaced by “information 
remains constant” in the following two cases: When 
(1) S and T are the same set and the elapse of time 7 is 
zero, or (2) S and T are the same set and commute with 
the Hamiltonian of the system. 


(6.5) 
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I. INTRODUCTION 


HE Dirac matrices arise in Dirac’s theory of 
electrons and positrons.! The elementary prop- 
erties of these matrices are very well known. However, 
in order to understand some of the features of Dirac’s 
theory, a person needs to be fluent with more than these 
elementary properties. Also some of the current applica- 
tions of Dirac’s theory are based on some of the less 
well-known theorems. The purpose of this paper is to 
give a consistent and reasonably complete account of 
the properties of these matrices. 

The discussion below is restricted to four-by-four 
matrices since only these are needed in Dirac’s theory. 
Most of thesalgebraic properties of the matrices were 
originally developed by Pauli’; the treatment given here 
follows his work closely, especially in the sections on the 
fundamental theorem and on the identities between 
scalars formed from four wave functions. 

In the next section an outline of the derivation of the 
Dirac equation is given in order to introduce the 
matrices. Section III contains Pauli’s proof of the 
fundamental theorem—that any two sets of four-by- 
four Dirac matrices are connected by a similarity trans- 
formation. Sections IV to VI contain discussions of the 
Lorentz transformation properties of the Dirac wave 
functions, the charge-conjugation operation, and the 
formation of covariants. Some special examples of sets 
of Dirac matrices are given in Sec. VII, especially the 
set which is used to show the nonrelativistic limit of 
Dirac’s theory, the set used in developing the spinor 
point of vtew, and a set which makes charge Con JURAN 
identical with complex conjugation. Finally in Sec. VIII 
the quadratic identities connecting scalars formed from 


PALM. inci ics (Oxford 
3 í. Di The Principles of Quantum Mechanics (Oxf. 
uake DER London, England, 1947), third edition, 


Chap. XT. Boy: 
$ W. Pauli, Ann. inst. Henri Poincaré 6, 109 (1936). 
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four wave functions are derived and some of the 
consequences of the identities are discussed. 

The paper is written entirely in terms of ordinary 
matrix theory even though some of the proofs could be 
shortened by using theorems from abstract algebra. 
(The matrices form a set of hypercomplex numbers of 
the type first studied by Clifford.) Especially the funda- 
mental theorem can be proven very neatly by using 
group theoretical ideas; this was done by van der 
Waerden.' Matrix theory is used exclusively and Pauli’s 
proof of the fundamental theorem is, given rather than 
van der Waerden’s in order to emphasize the matrices 
themselves rather than their abstract algebraic prop- 
erties, This point of view makes the arguments closer to 
the applications of the Dirac theory and also makes 
them readable to people not well versed in modern 
algebra. 


Il. THE DIRAC EQUATION 


The purpose of this section is to review the derivation 
of the Dirac equation for a charged particle in an electro- 
magnetic field. The starting point is the Lagrangian for 
the classical relativistic motion of the particle: 


(1) 


where m and e are the mass and charge of the particle, 
x; and £; are its position and velocity,’ and c is the speed 
of light. For an electron, e is negative. The functions 
(x,t) and A ;(x,t) are the scalar and vector potentials of 
the external field. The actual classical equations of 
motion are not needed below but, as a matter of 
completeness, are 


Lli) = mer tij) eb + ec"2,A j 


d mii 
dt (1—c*2,4;)3 


ðb 1əðA:\ e /ðA; ðA; 
E 
Ox; c oat G Ox, GES 


(2) 


=eE;t ec eirt; B i 
where 
E;= — (ô%/ 3x) — (3A i/ 4), 
B= €xim(OA m/ 0x1) 
are the electromagnetic-field vectors and eij is the usual 
Levi-Civita three-index symbol, zero if any two of the 


2W.K. Clifford, Am. J. Math. 1, 350 (1878). ae 
4B. L. van der Waerden, Die Gruppentheorelische Methode in der — 


Quantenmechanik (Verlag Julius Springer, Berlin, Germany, 1932),  - ‘ 


as 
p 5 Lower case Latin indices range from 1 to 3; lower case Greek 
from 1 to 4. Indices repeated im a product are to be summed. 
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indices are equal and plus or minus one, according to 
whether ijk is an even or an odd permutation of 123. 
These equations can be cast into a Hamiltonian form 
in the usual way. The mementa conjugate to the 
coordinates are 


p= aL/ Ox; 
= (1-0-8 pt) “tnt + eTA (3) 


and the Hamiltonian is 


H(x,p)=pai—L 
= cle (pij ec j) (pj;—ec 14 j) +e. (4) 


The nonrelativistic approximation is valid when the 
second term in the brackets is small compared to the 
first. If an expansion of the square root is made, and 
only the first two terms are retained the result is 


H(x,p)=me?-+ (2m) (pj—ecA 3) (pj—ec 1A) +e®, (5) 


as expected. 

The usual procedure for setting up the quantum me- 
chanics of a particle is to introduce a wave function 
y(x,t) satisfying the equation, 


H (x, —ihd/dx)p=ihdy/ dt, (6) 


where 4 is Planck’s constant divided by 2r. This 
procedure leads to 


ho e ho e i 
[C2003 
1 Ox; ¢ 1 Ox; c 


zo @ 
+ H aly=o. (7) 


1¢ OL c 


Dirac! regards this equation as unsatisfactory from a 
relativistic point of view because of the dissymmetry 
between the time and space coordinates. However, he 
points out that in the field-free case 


ae 7} no 
e {| meet | +>- ly=o, (8) 
ðx;ðx;l ic dt 


and solutions of this equation also satisfy the equation, 


e OP 


(9) 


G i? o? 
[nete | =0, 


Ox ;0% j 
as is seen by operating on Eq. (8) with 


Z P nha 
[2 
GERGEF ic Ot 


© 


— |ne- h? 


Equation (9) is of .a relativistically invariant form 
although Eq. (8) is not. The solutions of the equation, 


[omc+a;(—iha/ 3x) + (iho / a) W =0, (10) 


also satisfy Eq. (9) if 8 and the a; are operators which 
are independent of the coordinates and the time and 


e 


GOOD, 


JR. 


which satisfy 


aja. ,0;= 2652, (11) 
a,8-+Ba;=0, (12) 
w=. (13) 


(Here 5;, is the usual Kronecker delta, one when the 
indices are the same, otherwise zero.) This statement is 
easily verified by operating on Eq. (10) with 


—Bmc—a;(—ihd/dx;)+c7 (ihd/ dl). 


The end result of these considerations is Eq. (10), again 
of first order in the time-differentiation but now with 
the space and time coordinates on an equal footing. 
However the wave function must be enlarged to depend 
on more than space and time in order to accommodate 
the operators a;, 8. In Eq. (10) and in the rest of the 
paper y is given the meaning of a column matrix, each 
element of which is a function of space and time, and 
aj, 8 are square matrices which operate on y to produce a 
new column matrix. In the next*section it is shown that 
four rows in the column are needed to make the above 
equations sensible. Dirac thus proposes Eq. (10) to 
govern the motion of a free particle. Written in the form 
of Eq. (6) it is 


[—Bmce—ca;(—1hd/0x;) W=ihdy/dt. (14) 


The generalization to the case when fields are present is 
made, as suggested by Eqs. (3) and (4), by decreasing 
the momenta by ec ™!A ; and the Hamiltonian by e$. The 
result is that 


Hpw=ihdy/ dt, (15) 
where the Dirac Hamiltonian is given by 
Hp=—Bmce—caj(—ihd/dx;)—ecAc|teb. (16) 


The Dirac equation can be written very neatly by 
using a relativistic notation. Combining Eqs. (15) and 
(16) and introducing 


x4=1C¢l, 


A 4= 1®, 
one finds that 


|; ð i )+( ð ite mc (17) 
ia; —-——A; ———A, }—s6— =o. 
á ðx; he ; 0x4 Åc :) h p 


This is evidently to be simplified by multiplying on the 
left by —6 and by introducing 


i= —1Ba;, (18) 
y= —B * (19) 
so that the equation becomes 
( ð i ) pE a (20) 
On Te u YW oR Ad - 


As a consequence of Eqs. (11)-(13), the y, satisfy the 
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equations 

YY rt VAN = 28 yy, (21) 
Matrices which satisfy these relations are called Dirac 
matrices. 

The matrices y, in Eq. (20) will always be chosen 
Hermitian, in the sense that the complex conjugate is 
equal to the transpose. It is seen from Eqs. (18) and (19) 
that if a; 6 are Hermitian, then the Yu are also, for® 


y= (— Ba)" asp! = iaB= ipay, (22) 
y” =(—p)/¥=—Ba=y,, (23) 


The converse of this statement can be verified easily in 
the same way. As a consequence of choosing the y, 
Hermitian, one can set up a row-matrix equation 
complementary to the column matrix Eq. (20). The 
Hermitian conjugate of Eq. (20) is 


ð ie ð te 
ia (La tadve 
Ox; “hc Ox, he 


ə 


mce 


+—y"=0, (24) 
h 


where the nonuniformity in the space and time parts 
comes about because x, and A, are pure imaginary. 
From Eq. (21) it is seen that y4 anticommutes with each 
of the y;. Accordingly, multiplying on the right with 
— y4 and introducing what is called the adjoint wave 
function by 


payin, (25) 
one obtains 
ð te mc 
(=a „Juin v=o, (26) 
DAGE fic h 


ordinarily called the adjoint equation. The conservation 
of probability equation follows immediately by multi- 
plying Eq. (20) on the left by icy, multiplying Eq. (26) 
on the right by ia, and adding: 


A (ily) /Ox,=0. (27) 
This equation, together with the fact (discussed in 


Sec. VI) that ic), is a vector with respect to Lorentz 
transformations, permits 


payday 
to be interpreted as the probability density and 
inbty = — Vian 


to be interpreted as the probability current. 
As an example of Dirac matrices, one set can be built 


é'Superscripts 7, €, # are used to designate the tenpe oa 
plex conjugated, and Hermitian conjugated matr ees ee ees 
wave functions y are treated strictly as nels Ser 2i T R 
one column. For example, if the gon ano me the BAG ee 
reading from the top down, then Leu 
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up from the two-by-two Pauli spin matrices, 


0 f 0 —i 1 0 
a -( ) ”=( } ”n=( J (28) 
1 0 t a0 0 -=i 


which are Hermitian and satisfy the equations 


ojo p= iejo r} sj (29) 
so that 
Oj eto po j= 26 jx. (30) 


As a consequence of Eq. (30) it is seen that the four-by- 
four matrices, 


0 a; 1 0 
«=( ). s-( ) (31) 
a; 0 0 -1 


where here the 0’s and 1’s represent the zero and unit 
two-by-two matrices, fulfil Eqs. (11) to (13). Also they 
are evidently Hermitian since the o; are Hermitian. 

For some of the arguments in the following sections 
it is useful to keep the nonrelativistic limit of the Dirac 
equation in mind and so an outline of how the limit may 
be obtained is given next. In discussions involving this 
limit the matrices in Eq. (31) are especially appropriate. 
As suggested by the form of the matrices, one may 
write the four-rowed column matrix y in terms of two 
two-rowed column matrices Ys, wr so that 


Vs 3 
y= ( ) exp(mct/th). (32) 


YL 


The exponential factor is added also to remove the rest- 
mass contribution to the wave function. Then Eq. (15) 
may be rewritten as two equations: 
—2mc4ps— co; (—iħhð/ðx)— eA; Wr 

+ebys—ihdys/dl=0, (33) 
—co[(—ihð/ðx)— eA; Ws 

t+eby,—hop,/dt=0. (34) 
The nonrelativistic approximation applies when the last 


two terms in Eq. (33) can be neglected compared to the 
—2mcrs term. Then, introducing the abbreviation 


TJE (—ihð/ðx;)— eA j (35) 
one can solve Eq. (33) for Ys: 
Ws= — (2me)o gr pr. (36) 


Since the ys are of order o~! compared to the wz, the Ys 
are called the small components of y and the Wz, the 
large. This value of ys can now be inserted into Eq. (34) 
to obtain an equation involving the large components 
only: 


(2m) o px jo em i eb = thd 1/ ote (37) 
The properties of the Pauli matrices, Eqs. (29), lead to 


the following simplification of the operator in the first 


~ 
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term: 


OjO KM AT K= 1 ;K 10 MT EA OT Te 
= piej, jm port hierin jort rjrj 
ie 
= hielt lort TT, 
where [j,7;.] indicates the commutator of r; and 7x, 
13 .— 7 4.7 ;. From the definition, Eq. (35), this is readily 
found to be 


Lrj,7.J= iehc (0A x/0x;)— (0A ;/Oxx) ]- (38) 
The result for the operator is 


O50 KT STK — Zeho (0A r/ 9%) — (dA 3/ 9x) +r 
=— ehe orei ðA x/ 9x) HTT 
= —eheo BiH TT. (39) 
The final equation for the large component is found by 
substituting Eq. (39) into Eq. (37) and by using Eq. 
(35) to remove the 7; abbreviations: 


1/nh od e ho e 
aCe C4) 
2m\i dx; c 0 Oh G 


eh OWL 
— (= ) oiBr+et ys =in—. (40) 
2mc ðt 


This is of course the Schrödinger equation increased by 
the Pauli spin contribution. The nonrelativistic expres- 
sion for the probability density results when the 
products of the small components are discarded com- 
pared to the products of the large components: 


Vb = 0s bstyL L 
= Vip. 
The first step in finding the corresponding expression for 


the probability current is to use Eq. (36) to eliminate 
the small components, 


=Y ap = — cops oprop 
= (Qm)7 (er orc biti o jo. Wr |, 
and Eg. (35) to remove the r+ abbreviations, 
— oy ap = (2m) [ih (3p /ðxr)oro pr 
—ec A pioro pr ihht ðY Axx) 
—er A projo]. 
The terms involving Á+ are easily simplified by Eq. 


(30); the other terms are reduced with the help of 
Eq. (29). The zesult is evidently 


(41) 


h ð 
H—ejrr (Yr opr). (42) 
2m Ox, 


The first three are the usual nonrelativistic current 
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terms, and the last is a contribution from the spin. 
A more complete discussion of this transition to 
the nonrelativistic limit is given by Pauli.” A differ- 
ent way of studying the limiting process, which has 
certain definite advantages, has been developed by 
Foldy and Wouthuysen.® 


Ill. THE FUNDAMENTAL THEOREM 


As outlined in the previous section, Dirac’s theory of 
electrons and positrons leads to the consideration of sets 
of four Hermitian matrices y, satisfying Eqs. (21). The 
Hermitian property provides the adjoint equation and 
the conservation of probability; also it assures the 
reality of the probability density and current. However, 
in some of the subsequent arguments, it is convenient to 
consider non-Hermitian sets of matrices satisfying Eqs. 
(21) ; accordingly in this section the matrices will not be 
assumed Hermitian. A basic property of these matrices 
is that any two sets of four-by-four matrices, each of 
which satisfy Eqs. (21), are connected by a similarity 
transformation. In other words, if y, and y,’ are two 
sets of four four-by-four matrices such that 


WHY n= 28» (43) 
and 


Yn A tY Y ia a 25, » (44) 


then there exists a nonsingular four-by-four matrix S, 
such that 


Yr =SY S. (45) 


Pauli? has called this the fundamental theorem of the 
Dirac matrices and has given the simple algebraic proof 
which is reviewed below. 

In making the proof the number of matrices considered 
is increased from the four y, to the following sixteen: 


1 

Ya Y2 V3 Ya 

iyoy3 iysyı tyıyz iyya iya iyya 
iY1Y2Y3 
YYY 3Y 4 


iyya iy3Y 1Y4 TY 2Y3Y4 


where 1 refers to the unit four-by-four matrix. The 
factors of 7 in the third and fourth rows are chosen so 
that the square of each of these matrices is the unit 
matrix’: 


me= ilk (46) 


This is easily verified in each case since, according to 
Eq. (43), the square of each y, is one, and the yu 
anticommute with each other. If the y, happen to be 
Hermitian, it is seen that the y4 are Hermitian also. 
Occasionally the matrices in the third and fourth rows 


7W. Pauli, Handbuch der Physik, H. Geiger and K. Scheel, 
editors (Verlag Julius Springer, Berlin, Germany, 1933), second 
edition, Vol. 24, Part 1, p. 236. 
s L. L. Foldy and S. A. Wouthuysen, Phys. Rev. 78, 29 (1950). 
Capital Latin indices are used to range from 1 to 16 so that y4 
represents any of the sixteen matrices displayed above. 
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are represented by the symbols 
where the brackets indicate that the i 
and that their order is immaterial, 
often used is 


Y {uy and Vihar) 
ndices are unequal 

An abbreviation 
V= 11121314. (47) 
This is appropriate because ys has the same properties 
as the four 74: 


> s=1, (48) 


(49) 

As a matter of convenience the proof of the theorem 
will be broken up into a series of eight steps. The first 
five steps deal with properties of just one set of matrices 


y, and the last three with connections between two sets 
of Dirac matrices y, and y’. 


Veet YuYs= 0. 


1. For each y4 except the identity, one can choose a 
yp such that!’ 


(50) 
: 2 NSR 
A proof of this statement can be made by considering 
each type of y separately. It is easily seen that Eq. (50) 
is fulfilled when ya =y, by yg=ys, when ya = Yin by 
YyYB=Y%Ym When ya=Ypuy by YgB=ys, and when ya= ys 


by yB =Ys. k 
2. The spurs of all the y4 except the identity are zero: 


Sp(ya)=0 if yal. (51) 


This is easily seen by taking the spur of Eq. (50). Since 
the spur of the product of two matrices is independent 
of the order of the factors, and since yz’ is 1 the left side 
becomes simply Sp(y.) while the right is —Sp(ya). 

3. The y4 are linearly independent. In other words 


the equation, 
A 


ag YBYAYB>= —YA- 


16 
>, aaya =Ù, (52) 


A=l 


where the aa are complex numbers, only holds when the 
aa are all zero. To show that, for example, ap is zero, one 
multiplies Eq. (52) through by ys to obtain 


apt aes daysya=0. (53) 


With the help of Eqs. (43) a product yay can always be 
reduced to the form byc where b is some complex 
number. By considering the various types of products it 
is easily seen that if A%B, Yc will not be the identity. 
Accordingly, all the matrices in the sum in Eq. (53) 
have zero spur and, by taking the spur of the equation, 
one concludes that ag is zero. A consequence of this 
proof is that the sixteen ya are all distinct. In this re- 
spect the properties of the Dirac matrices, are different 
from those of the Pauli matrices oj even though they 
satisfy similar sets of equations, Eqs. (43) and Eqs. (30), 
for there are connections like 
o12=103 


10 The convention of summing on eas indices is not used 
when capital Latin indices are Involves: 
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between the Pauli matrices. Also at this point it is seen 
that a set of matrices satisfying Eqs. (43) must be at 
least four-by-four in order to generate a set of sixteen 
linearly independent matrices, for the above arguments 
hold regardless of the size of the matrices. 
4. An arbitrary four-by-four matrix X can be written 
as a linear combination of the sixteen ya matrices: 
15 
X= > TAYA. (54) 
A=l 
This statement is obviously true since a set of sixteen 
linearly independent vectors form a basis for a space of 
sixteen dimensions. In any special case the expansion 
coefficients xg will be given by the formula 


«a= Sp(yeX), (55) 


as is seen by multiplying Eq. (54) by yz and taking the 
spur. 

5. A four-by-four matrix which commutes with each 
of the y, is a multiple of the unit matrix. Let X be the 
matrix in question; as argued in step 4, one may write it 
as a linear combination of the y4. Equation (54) may be 
written in the form 


(56) 
where vz is one of the sixteen matrices and is chosen 
arbitrarily except that it is not the identity. The asser- 


tion will be proven if it can be demonstrated that xg is 0. 
According to step 1, a matrix yc can be chosen so that 


(57) 
By hypothesis X commutes with all of the 7, and 
therefore with yc so that 

X=cXye. (58) 


In terms of the expansion coefficients of Eq. (56), Eq. 
(58) reads 


X=xryr HIL ABAYA 


YOY BY O= = YB 


XBY BE QOAMAB)LAY A= XL BY CY BY CH DL ABAY CV AVE, 
and this can be written in the form 
XL BY BT LAB) LAV A= — XB B+ Lae (+1) Kaya, 


as a consequence of Eq. (57) and the fact that yc and ya 
either commute or anticommute. If finally this equation 
is multiplied by yg and the spur is taken, the result is 


xrB=— Xg 


so that xg is zero as required. In summary, if 


Xy.= Vax 
then 
X=k, 


where k isa complex number times the unit matrix. This 
result, a special consequence of Schur’s lemma," will 
often be used below. 

u See, for example, Hermann Wevl, The Theory of Groups and 
Quantum M Skane (Dover Publications, New York, 1931), 
p. 153. 


= 
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6. If y, and y,’ are two sets of matrices satisfying 
Eqs. (43) and (44) and if ya and y4’ are sets of sixteen 
matrices formed in parallel from them according to the 
definitions at the beginning of this section, then 


ya S=Sya, (59) 
where . 
16 
S=} ys'Fyn, (60) 
B=1 


and F is any four-by-four matrix. (The last two steps in 
the proof of the theorem consist of showing that F can 
be chosen so that S is nonsingular. Pauli attributes this 
method of proof to Schur.) This assertion is easily 
proven from a consideration of the matrix 


16 
ya SYa= D yaya Fy pya- (61) 
B=) 


The product yaya can always be simplified so that 


YBYA= ECYC, (62) 


where eç has one of the four values +1, +t. For arbi- 
trary ya, as B ranges from 1 to 16, C must range also 
from 1 to 16 since if both 


YBYA=€cYc 
and 


yoya=dceye 


for two different matrices yg and yp, then 


ye=ecycya= (€c/6c)yp, 


and this would contradict the linear independence 
demonstrated in step 3. Furthermore, the ec in Eq. (62) 
are determined by the definitions of the y4 in terms of 
the y, and by the anticommutation rules, Eqs. (43). 
Since these definitions are made in parallel for the primed 
matrices and since the primed matrices satisfy the same 
anticommutation rules, Eqs. (44), the same numbers 
will arise in the primed system: 


ye YA =€cyc'. 
The product y4’7z’ is conveniently found by taking the 
inverse of this last equation: 
yaya = (1/ec)yc’. (63) 


Introducing Eqs. (62) and (63) into Eq. (61) and 
changing the sum-index from B to C one finds 
a a 
16 


ya'Sya= 2 (1/ec)ve'Fecye 


c=1 
- 16 
=) yc Fyre 
- c=1 
=, 
12 Reference 2, pp. 110, 115; I.*Schur, Berliner Sitzber., 406 
- (1905). 3 
rae oe 
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and obtains Eq. (59) by multiplying from the right 
with ya. 

7. The matrix F can be chosen so that S is not zero, 
If S were zero for all F, the equations 


B=1 


Xr wlrae=0 (64) 


could be constructed from Eq. (60) by choosing F 
successively to be the various matrices which have one 
element unity and the other fifteen zero. Here, (yp’),, 
are the elements of the matrix yz’. Equations (64) imply 
the matrix equation 


16 
D (vs) wye=0. 
B=1 


The (ya’)y» are not all zero since yg”=1, so this equa- 
tion is incompatible with the linear independence of the 
ypg. It must be concluded that some matrices F exist for 
which S, defined by Eq. (60), is not zero. 

8. The matrix F can be chosen so that S is non- 
singular. A proof of this statement can be made easily 
with the help of a matrix T defined by 


16 
T= vyaGys’, (65) 
Bal 


where G is a four-by-four matrix to be chosen below. 
The matrix F in the definition of S, 


16 
S=} ys FYB, (66) 
Bal 
will also be left arbitrary for the moment. The argument 


of step 6 with the primed and unprimed matrices 
interchanged can be applied to the T matrix so that 


yal =T ya", (67) 
and step 6 applied directly to S gives 
ya S=Sya. (68) 


However, from Eqs. (67) and (68) together it may be 
concluded that 


yaTS=Tya'S=TSya 
so that, as a consequence of step 5, 
TS=k, (69) 


where k is a complex number times the unit matrix. At 
this point the matrix G will be chosen so that T is not 
zero; this was demonstrated to be possible in step 7. 
Then the matrix F will be,chosen so that k is not zero. 
To see that this also is possible one notices that, if k is 
zero for every choice of F, Eqs. (69) and (66) imply that 


16 
2 Tys'Fyg=0. 


B=) 


Therefore, if F is chosen successively to be the matrices 
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with one element unity and the rest zero, one finds 


16 
(Dre!) ¢=0. 


Written as a matrix equation in the yz this becomes 


16 


pa (Typ) uy p=0. 
B= * 


Not all of the (Tyz’),» are zero since T was chosen 
previously not zero and yp’ includes the identity in its 
range. The assumption that & is zero for all F is then in 
contradiction with the proven linear independence of 
the yz. It must then be possible to choose F and G so 
that Eqs. (68) and (69) hold, where & is not zero. 
Accordingly, there must exist a matrix S, nonsingular, 
such that 


a= SYS. (70) 


This completes the prapf of the theorem. 

The matrix S which connects two sets of Dirac 
matrices according to Eq. (70) is uniquely determined 
by the two sets except for an arbitrary numerical factor. 
In order to prove this suppose that y, and y,’ are two 
definite sets of Dirac matrices and suppose also that 
there are two matrices S; and Sz such that 


Ye = SIY, 
Ye — SYST". 


If y,’ is eliminated from these two equations, it is seen 
that 

SYST i SYST, 

SS yy p= YSS. 


° . . 
Here again, step 5 can be applied; since StS, com- 
mutes with each of the y,, it must be a number, say 
k, times the unit matrix so that 


Si=kS2, 


and this proves the assertion. Occasionally it is con- 
venient to impose the extra condition 


detS=1. 


Evidently this condition can always be fulfilled by an 
appropriate choice of the numerical factor k. The effect 
of this condition is that S is determined except for an 
arbitrary factor of +1, +1. j 

As an example, the operation of taking the transpose 
direct application of the funda- 


of a matrix provides a : i 
bea set of Dirac matrices so that 


mental theorem. Let yu 


nonsingular matrix B such that 

Yu = Bye A (71) 
This matrix is of special interest in some of the sections 
below. An interesting property of it is that it is anti- 
symmetric. Still following Pauli,? one can demonstrate 


this by first of all taking the transpose of Eq. (71) and 
then substituting Eq. (71) into the result; 


a= (B)Ty,7 BP 
= (BV By, BOB 
= (BBT) 'y,(B-1B7). 
This equation can then be multiplied from the left by 
(B-1BT); the result is that (B-1BT) commutes with the 


7, and so, according to step 5 of the proof of the funda- 
mental theorem, is a constant times the identity: 


BoBT =k, 
However, the equation 
BT=kB (72) 
implies that 
B=kB?=hB 


so that & is +1; this establishes that B is either sym- 
metric or antisymmetric. The possibility that B is 
symmetric can be ruled out by means of an argument 
which Pauli credits to Haantjes. In this argument one 
counts which of the By. are symmetric and which are 
antisymmetric. On rewriting Eq. (71) in the form 


Ya B= Bry, (73) 

it is easily seen that 
(B1)7=k(B1), (74) 
(By)? =kyy™B=k( By), (75) 
(iBrywy»)? = kiy Ya B= —RUBy wy»), (76) 


(Byxy err)? = kiy yu yx? B= —k(i Byrta) (17) 
(Byiyeyaya) = kysT ys ye ys B= k(Byvyersrs), (78) 


where Eq. (72) has been used to eliminate B7, and 
à, #, v are all unequal. If B is symmetric so that k is +1 
it is seen that there are ten antisymmetric matrices 
Bytu and Byjyuy. These ten matrices must be linearly 
independent since the yta») and Ypa» are linearly inde- 
pendent and B has an inverse. This is an impossibility ; 
there can only be six linearly-independent antisym- 
metric four-by-four matrices. It must be concluded that 
k is —1 and the B matrix is antisymmetric. It is also 
pertinent to consider the connection between the B 
matrices defined for different sets of y,. That is, if 


YY FY Yu= 28x Ya” = By, B, (79) 
BY x ze: 
k 4 IT = Bly B^! 80) 
The transpose of this equation 1s ) Yu e : (80) 
TT TIS define the matrices B, B’ while 
Yr Yu +p Yr = Pn aes dis 
so that the y,7 also form a set of Dirac matrices. Ac- WOT 
cording to ane fundamental theorem there must exist a ' Reference 2, p. 121. 3 
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gives the connection between the y,’ and the yp, one 
may ask for the corresponding connection between B’ 
and B. This connection can be found by substituting 
Eq. (45) into Eq. (80) to eliminate the y,’, 


(S=) Ya S T= B'Sy SB’, 


then substituting Eq. (79) on the left to eliminate 
the y,7, 

(S YT BY B STB SYS BT, 
and then multiplying from the left and right by the 
proper.quantities to cast the equation into the form 


SB’ (S=)TByu= yS 7B- (S7)TB. 


The matrix which here commutes with the y, must be a 
number, say &, times the unit matrix so that 


So Bo (S=) TR= k, 
or 


B=kS™B'S, (81) 


which is the required connection. A final property of the 
B matrix is that, if the y, are Hermitian, B can be 
chosen unitary. This is easily seen by taking the 
Hermitian conjugate of the defining equation, Eq. (71), 
to obtain, when y,”7=7,, 


Ya = (BO) yB", 
and by recombining this equation with Eq. (71) so that 


BUBy,=7,B"B, 
which implies that 
BEB=hk, 


where & is some number. The number 2 is easily shown 
to be real and positive by writing in detail one of the 
diagonal elements of B¥#B. Accordingly, by proper 
choice of the arbitrary factor in B, B¥B can be adjusted 
until it is the unit matrix and B is unitary. In the 
applications below it is always assumed that this adjust- 
ment has been made so that B is unitary; it is clear that 
then B is determined uniquely except for an arbitrary 
factor whose absolute value is one. As a concrete ex- 
ample, for the set of Dirac matrices y, defined by Eas. 
(18), (19), and (31), the antisymmetric unitary matrix 


02 0 
eae) 
0 oe 
satisfies the defining equation, Eq. (71). This can be 
seen by expressing the y, explicitly in terms of the c;, 


) =o, = @ 
v= (_ ), v=( ), 
az 0 G a 


and noticing that, as a consequence of the definitions of 
the Pauli matrices Eq. (28), yı and yz are antisymmetric 
while y2 and y4 are symmetric. The matrix of Eq. (82) 
evidently anticommutes with yı and ys but commutes 


(82) 


with y2 and y4 and so it does give a solution of Eq. (71). 


GOOD, 
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In Sec. VII examples of the form of the B matrix for 
other special sets of Dirac matrices are given. 


IV. LORENTZ TRANSFORMATIONS OF DIRAC 
WAVE FUNCTIONS 


In this section the Lorentz transformation properties 
of the four-component Dirac wave functions introduced 
in Sec. II are reviewed. A Lorentz transfofmation is 
defined as a coordinate transformation of the type 


1 . 
Vp = lpr 


(83) 


where the transformation coefficients satisfy the orthogo- 
nality relations 

Aprl 5 Ove (84) 
or, equivalently, 

yp Ay = Ove, 


(85) 


and also satisfy the conditions of reality: ajz, i@4;, iajs, 
as, are real. It is well known that the group of Lorentz 
transformations consists of two_types of reflections as 
well as the transformations which can be formed con- 
tinuously from the identity. The first type arises in 
considering the determinant of the transformation 
coefficients @,,. If A is the square four-by-four matrix 
formed from these coefficients, Eq. (84) can be written 
in the form 
ATA=1, 


and the determinant of this equation gives 


det(A7A) = (detA?) (det A) 
= (det A)? 
=1, 
so that 


detA = 41. (86) 


Evidently, the plus sign will apply for transformations 
continuous with the identity. However, as well as these, 
transformations in which the determinant is —1 also 
must be taken into account. The other type of reflection 
arises in considering the sign of the transformation 
coefficient a44. When both v and ¢ are 4, Eq. (84) reads 


au taxta Hagl. 
Since, according to the conditions of reality, the aj4 are 
pure imaginary, 
a;2=—|aj4|?, 
where the vertical bars indicate the absolute value, and 
therefore 


as? =1+ |ar|? |a|? |as]? 
In consequence there are two possibilities for 444: 


either au> 1 or UNOS —1. (87) 


The first alternative applies to transformations continu- 
ous with the identity, but transformations in which a4 
is negative also must be taken into account. It is not 
convenient below to treat the Lorentz transformations 
entirely uniformly; the Lorentz group will on occasion 


ee 


a 


| Satosi Watanabe, Phys. Rev- 
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be regarded as consisting of the transformations continu- 
ous with the identity, the space reflection 


afm, xd = 24, (88) 
which has detA = —1, ay,=1, and the time reflection 

pI 7 

Gen Le =— 2s, (89) 


which has’detA = —1, a4,= —1, 

There are three considerations involved in assigning 
the transformation properties of the Dirac wave func. 
tion y. First of all the Dirac equation, Eq. (20), should 
be covariant with respect to Lorentz transformations; 
this requirement is discussed in the present section. The 
other considerations arise when the subjects of charge 
conjugation and the formation of covariants quad- 
ratically from wave functions are treated; they are 
discussed in Secs. V and VI. The Dirac equation is to be 
covariant in the sense that, when the Lorentz trans- 
formation of Eq. (83) is applied, the equation 


0 ié mc 
(>a) (90) 
Oxa 1G h 
leads to the equation 
0 te mC 
(S -Zar jwt v=o (91) 
Ox, he h 
The operator (0/dx,) is a vector, 
(0/dx,4') =a,,(0/dx,), (92) 
but, following Watanabe’s assignments, the four- 


vector potential transformation rule has an extra sign 
change whenever there is a time-reflection: 


“ u(x’) = (@44/ |as |)ayrA »(x). 


Because of this nonuniformity it is convenient to post- 
pone the discussion of time-reflections and give them 
separate consideration later. For transformations in 
which a44 is positive the covariance can be obtained by 
the transformation 

(94) 


Vi e)=A4(2), 


provided the matrix A is appropriately chosen. It is 
assumed that A has an inverse. In view of the arbi- 
trariness of the Lorentz transformation under discus- 
sion, the covariance can be demonstrated by showing 
that Eq. (90) follows from Eq. (91) rather than vice 
versa. Substituting Eqs. (92), (93), (94) into Eq. (91) 
and operating from the left with A“, one finds 


. me 
(a ,) apiy Ayt Y=, 


Ox, he 


(93) 


d if A can be chosen so that 


Gp AYA = Y7- C5 


i 26 (1955). See also 
M Satosi Watanabe, Revs. Modern Sst) 


so the covariance is establishe 
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The existence of a matrix A satisfying this condition is 
easily shown in the following way. As a consequence of 
the orthogonality relations, Eq. (85), the matrices +,’ 
defined by 

Yu = lap p (96) 
satisfy the anticommutation rules 


Yu Yr FY Ya = apport Lye alu p 
= Oy plo (YoYo FYT) 
= 2l ppl vað po 
= 2a ppűvo 
= 2b 
and so, according to the fundamental theorem, a matrix 
A such that 


(97) 


can be found. This is just the matrix required to satisfy 
Eq. (95) since Eqs. (96) and (97), together with the 
orthogonality conditions Eqs. (84), yield 


Yp =A YA 


ap ATY A= ly We 
= laly po 
=ô, Yp 
= Yy. 


They,’ defined in Eq. (96) are not necessarily Hermitian 
even though the y, are Hermitian because the a,, are 
not necessarily real; it is important here that the 
fundamental theorem holds between sets of matrices 
which need not be Hermitian. In summary, the covari- 
ance of Eq. (90) with respect to Lorentz transformations 
which preserve the direction of the time is obtained if 
the wave function transforms according to 


W(x’) =Ayp(x), (98) 


where the matrix A is connected with the Lorentz 
transformation coefficients by 


(99) 


According to the discussion in the paragraph following 
Eq. (70), this defines A except for an arbitrary numerical 
factor. 

When there is a time-reflection, as,< — 1, the covari- 
ance requires a different treatment than in the previous 
paragraph because of the extra sign change of the four- 
potential in Eq. (93). As a first step in compensating for 
the extra sign change, one takes the complex conjugate 
of Eq. (91) because this operation also changes the sign 
of the four-vector potential relative to the gradient 
operator: 


pY FA yA. 


ð ie 
Sy, /) yoye 


ox; he 


> 


ð ie me prr 
- (Star )reve+ yeo. (00) 


Ox. “if 
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Here the difference in sign between the first two terms 
arises because x,’ and A,’ are pure imaginary. From the 
form of this equation one is led to introduce a matrix C 
such that i 


i= CysC>, (101) 
yat = —CysC (102) 
because then, in terms of ¢’ defined by 
Yela) =C a"), (103) 
and after multiplying by C~!, Eq. (100) becomes 
ð ie mc 
( ar =l suw =: (104) 
Ox,’ he h 


The fundamental theorem ensures the existence of such 
a matrix C, for the y, are a set of Dirac matrices 
satisfying the anticommutation rules Eq. (43) and, by 
taking the complex conjugate of Eq. (43), it is seen that 
73°, — y4! satisfy a similar set of equations. In fact it is 
easily verified that the matrix 


C=Bysys (105) 


satisfies Eqs. (101) and (102), as a consequence of the 
defining equation for B, Eq. (71), and of the fact that 
the y, are Hermitian. The final step is to transform Eq. 
(104) to the unprimed quantities according to Eqs. (92) 
and (93), keeping in mind that a44< — 1. By comparing 
this with the non-time-reflection case discussed above it 
is obvious that if 

$'(x')=Ay(x), (106) 
where 


aup Yo =A yA, (99) 


then Eq. (104) becomes 


D te mc 
(—- —A „w+ == 0, 
OX, tic h 
as required. The complete wave-function transformation 
is found by combining Eqs. (103) and (106). The Dirac 
equation is covariant with respect to time reflections if 
the wave function transforms according to 


¥'°(x!)=CAY (x), (107) 


where A is chosen to satisfy Eq. (99), the same as for 
non-time-reflections. (One could, of course, perform the 
two steps above in the opposite order, first transforming 
to the unprimed quantities and then making the 
transformation with the @ matrix. This leads to the 
alternate transformation rule 


We (x) =ACCH(x). (108) 


In the next section it is argued that only transformations 
such that 

CA=A°C 
should be considered; in that case Eqs. (107) and (108) 
are equivalent.) e 


e 


e 
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Regardless of whether it is a time reflection or not, 
when the Lorentz transformation 


One 
is applied, a matrix A such that 


auo Yp = A'YA (109) 


must be found before the transformation of the wave 
function is known. It would be’a complicated affair to 
write down A explicitly in the general case, but for the 
special cases of two-dimensional rotations and the space 
and time reflections of Eqs. (88) and (89) the matrix A 
has a simple form, as exhibited below. In a sense the 
formulas below give A in the general case because any 
Lorentz transformation can be expressed as a sequence 
of reflections and two-dimensional rotations. To begin 
with, the matrix 


A=cos(w/2)+y,75 sin (w/2) (110) 


(where uv) corresponds to a rotation in the uv plane; 
the parameter w measures the rotation angle. The 
inverse of this matrix is 


A= cos(w/2)— YY» sin (w/2), 


as is easily verified by direct multiplication. When p= 1, 
v=2 for example, it is easily seen that 


Ay,;A=cosw yi+sinw ye, 
AIYA = — sinw yi+cosw Y2, 
Ay: = y3, 

Ayd = y4. 


By comparison with Eq. (109) this is found to corre- 
spond to the Lorentz transformation 


xy =cosw a1-+sinw x2, 


x = —sinw x4;+cosw x2, 
wl 

T3 = %X3, 

ahaa 

Ta = Y4, 


which is a space rotation through an angle w about the 
xz axis. A parallel result is obtained for the case u= 3, 
v=4 with the difference that w must be pure imaginary 
to satisfy the conditions of reality. With the change to 
the parameter v such that 


sinw=icy(1— cr") “}, 


cosw= (1—c-2n?)-}, can) 
the transformation can be written in the form 
x=, 
ts (112) 


xa = (a3— vi) (1—c-*2®)-3, 
t= (t— vr) (1—0), 
which is the usual form for the Lorentz transformation - 


between two parallel coordinate systems with relative 
velocity vin the x; direction. A more complete discussion 
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of the matrix in Eq. (110) is given by Pauli.! For a space 
reflection, the assignment 


A=iy (113) 


gives, on substitution into Eq. (109), the Lorentz 
transformation coefficients required in Eq. (88). Finally 
it is easily verified that the matrix 


A= 7574 (114) 


corresponds to the time reflection of Eq. (89). From 
Eqs. (107), (105), and (114) combined, it is seen that 
the net wave function transformation, corresponding to 
the time reflection of Eq. (89), is 


Y/°(x') =CAY (x) 
= Bysysysy. RA (x) 
= By (x). (115) 


The arbitrary numerical factors in the matrices A of 
Eqs. (110), (113), and (114) have been chosen in advance 
to satisfy conditions wñich will be imposed in the next 
two sections. It will be seen that these conditions deter- 
mine these matrices except for an arbitrary factor of —1. 
In the case of the transformations continuous with the 
identity this indeterminacy is in keeping with the de- 
pendence of the matrices in Eq. (110) on the half-angle, 
because if w is increased by 2r, the matrix in Eq. (110) 
changes sign but the corresponding Lorentz transforma- 
tion is unaffected. 

The matrix A of Eq. (109) has an especially simple 
form in the special case of a space rotation or reflection, 


1 " sh 
Xj =ljktk, X4 = Y4, 


and when the matrices introduced in Sec. II, Eqs. (18), 
(19), and (31), are used for the y,: 


0 Fa) Cs A 
sa ; ev . (116) 
$ e 0 0 1 


First of all, from the fourth component of Eq. (109), it is 
seen that 
ya =A y4. 
If the equation 
Ays= 4h 


is written in terms of two-by-two matrices using y4 as 
given in Eq. (116), it is seen that A has the form 


As 0 
a=( ), 
0 AL 


where As and Az are two-by-two matrices which have 
yet to be determined and 0 indicates the zero two-by- 
two matrix. The subscripts S and Z are used because As 


and A, are the transformation matrices for the small and 
wave function in the non- 


relativistic limit. The next step is obviously to restate 


18 Reference 7, pp. 222-224. 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


DIRAC MATRICES 197 


the equation for A, Eq. (109), in terms of As and Az. The 
first three of Eqs. (109) become 


0 —io, 
i. 
to i 0 
Ge 0 (G A b 
0 sg) Niog O 0 Al 


where the y; have been replaced according to Eq. (116). 


On multiplying out the right-hand side, one finds that 
the equation will be satisfied provided the equations 


ajro k= Ag TÀ L, (117) 
ajs =A Lt ajAs, a 18) 


are valid. These two equations can be uncoupled by 
using the properties of the Pauli matrices. If Eq. (29) is 
operated on from the left by Az’ and from the right by 
Az, the result is 


Ap ojAsig oA r= ienn oA r-t ijk 


where the factor A sA s~! has been added in the middle of 
the term on the left to facilitate the substitution of 
Eqs. (117) and (118): 


Uji 1 kmOm= ejr oh r+ sk. 


This equation can be considerably simplified by ap- 
plying Eq. (29) again on the left to make the equation 
linear in the o;. It is seen that 


lja km (t€tmnT n+ 41m) = tejzud CoA r-H ije 


The terms independent of the g; cancel as a consequence 
of the orthogonality relations. These relations can also 
be introduced into the first term on the left so that the 
equation becomes 


51 em (Apr pn) €tmrTn= ejk L OWL. 
The reason for this introduction is that now the well- 
known fact 
Ajil kml pr€tmr= (deta) €jkp 


brings the equation to the form 
(deta) €j% på pnOn= ejk LTAL 
so that, finally, 
(deta) @ingn=ArorAr. (119) 


This equation is to be compared with Eq. (109); it is 
seen that the Pauli matrices play a part in the theory of 
three-dimensional orthogonal transformations parallel 
to the Dirac matrices in the theory of four-dimensional 
orthogonal transformations. The connection between As 
and Az is easily found by combining Eqs. (118) and 
(119): 


ojAs= ALO; k 
a (detado;Ax, 
~ 


p. 116. 
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and therefore, since the square of the determinant is one 
and g; is 1, 


As= (deta)A ie (120) 


The problem of finding the four-by-four matrix A is thus 
reduced to the problem of solving Eq. (119) for the two- 
by-two matrix Az. Solutions of Eq. (119) in the case of a 
rotation about one of the coordinate axes are easily 
obtained by substituting the y; from Eq. (116) into Eq. 
(110) and using Eq. (29) to simplify the products of the 
Pauli matrices. For a rotation through an angle w in the 
right-hand sense about the k-axis the result is 


A,=cos(w/2)+ic,. sin(w/2). (121) 


For rotations the matrix Az is just the matrix of the 
Cayley-Klein parameters.!® A matrix Az for the space 
reflection of Eq. (88) is 


(122) 


as may be found by using the y4 of Eq. (116) in Eq. 
(113). 

The covariance of the Dirac equation with respect to 
Lorentz transformations leads uniformly in the non- 
relativistic limit to the covariance of the Schrödinger- 
Pauli equation, Eq. (40), with respect to space rotations 
and reflections, Galilean transformations, and time 
reflection. As discussed in Sec. II the nonrelativistic 
limit of the Dirac equation may be taken by introducing 
the matrices of Eq. (116) and discarding the small 
components of the wave function compared to the large. 
For a space rotation or reflection the results of the 
previous paragraph may be used. The small components 
do not even enter in to the transformation rule for the 
large components. It is easily verified that Eq. (40) is 
covariant with respect to the transformation 


AL=1, 


Xj = jktk, 
=t, 
Aj (x)= ajA x(a), 
z P'(x')=8(x), 
Bj (x’) = (deta)a;.Bx(), 
Yr (x’)=Aryx(a), 
where 
0;%0;1= 6x1, 
(deta)a; poz =ArojAr. 
For the Galilean transformation it is convenient to refer 
to the Lorentz transformation of Eq. (112) which, when 
the terms involving (v/c) can be neglected, reduces to 
x= %1, 
Xo! = %2, 
43 =x — 1l, 
« i 


(123) 


16 See, for example, Herbert Goldstein, Classical Mechanics 
(Addison-Wesley Press, Inc., Cambridge, Massachusetts, 1950), 


Aa 
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The corresponding transformation of the space and time 
derivatives is easily seen to be 


(8/3x;) = (3/ðx;), 
(3/3t')= (0/dl)+0(0/ 0x3). 
The potentials A ;, ib form a four-vector in parallel with 


the coordinates «;, ict; by comparison with Eq. (112) 
one has 


(124) 


A= (A tier c vb) (1 i or), 
p'= (b—c vA 3) (1—c-*e*) “3, 


where the nonrelativistic limit has yet to be taken. In 
Eq. (40) it is seen that an effect of the limiting process 
was to multiply the scalar potential by a factor of ¢ 
relative to the vector potential. Therefore in the 
nonrelativistic limit one must carry the transformation 
rule for the scalar potential to one higher order than for 
the vector potential and write 


Aj =Aj, 


@’ = h— (u/c) Az. (125) 


A minor consequence of the first of these is that 
B= B; (126) 


The matrix A corresponding to the Lorentz transforma- 
tion of Eq. (112) is given by Eq. (110) when u=3, v=4, 
and when the parameter w is given by Eqs. (111). When 
(v/c) is small compared to 1, it is seen that w is just 
i(v/c), A is the identity, and 


V (x)=). 


In view of the defining equation for the large com- 
ponents, Eq. (32), they transform according to 


Yr («’)=W1(a) explLme(t—1')/ih] 
=yŅ L(x) exp[ (mox3—4mvt)/ih], (127) 


where the last of Eqs. (112) has been used to find the 
limiting value of the exponent. One can then easily 
verify by direct differentiation that Eq. (40) is covariant 
with respect to the transformation of Eqs. (124) to (127) 
so the covariance with respect to Galilean transforma- 
tions does come out as a limit of the covariance with 
respect to Lorentz transformations. The final transfor- 
mation to be considered is the time reflection 


(128) 


As required by Eq. (93) the potentials transform ac- 
cording to 


xj/= Xj, =i. 


Aj=—A;, =F, (129) 
and so the rule for the magnetic field is 
B/=—B;. (130) 


Substituting for the B matrix from Eq. (82) into the — 


wave-function transformation rule, Eq. (115), and using 
the definition of the large components, Eq. (32), one 


` outlined, and some of the 
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finds that 
pr (a!) = on r(x). (131) 


In order to demonstrate the covariance, it is convenient 


to start with the complex conjugate of Eq. (40) in the 
primed coordinate system: 


1 ho oe ho G 
BN) 
2m\ idm; c \ idx ¢ 


eh Ope 
= (Eyre we- —th = ; 
ðt 


2mc 


If here the primed quantities are eliminated in favor of 
the unprimed by Eqs. (128) to (131) and if then the 
equation is multiplied through by ø», the result is 


1 hð e hod e 
EON i) 
2m 40%; c 40%; ¢ 


ach OWL 
+( )eesterBrt-ob hma, 
Ot 


2me 


The remark that, as a consequence of the definition of 
the Pauli matrices Eq. (28), 


o20;°o2= —0; 


completes the proof of the covariance with respect to 
time reflection. Time reflection in the nonrelativistic 
limit has been discussed especially by Wigner.!” 

In the treatment of time reflections given above, the 
vector-potential transformation rule was assumed to be 


Ap (x)= (a4s/| a44| aud »(x), (93) 
and this led 40 the wave-function transformation 
¥/¢(a’) =CAY(a), (107) 
where 3 
apoyo = A YÀ. (109) 


If, in contrast to this approach, the vector potential is 
assumed to be a regular vector even with respect to time 
reflections, 
A A (x)= tA o 

then the time reflections will not require a separate 
treatment and 
; ye) =A) (132) 
will hold for all Lorentz transformations. This point of 
view has the advantage that it permits a. uniform 
treatment for all Lorentz transformations, and it has the 
disadvantage that it does not lead to the nonrelativistic 
time-reflection of the preceding paragraph. 


V. CHARGE CONJUGATION 


In this section the basic idea of charge conjugation is 
properties of the operation are 


11 E. Wigner, Göttinger Nachrichten 31, 546 (1932). 


discussed. The effect on the transformation matrix A of 
requiring that charge conjugation be a covariant opera- 
tion is discussed in detail. 

The idea of charge conjugation arises in a further 
consideration of the C-matrix substitution made in the 
treatment of time reflections in the preceding section. 
Referring back to Eqs. (100) to (105), one sees that the 
substitution 


y (x) =CH(x) (133) 
carries the equation 
ð te me 
(- a)y aa 
Gat. OY h 
into the equation 
GC) me 
(+4. no +0, (135) 
Ox, he h 
where the C matrix may be defined by 
C= Brits, (136) 
and has the properties 
VE= Ce (137) 
yaf = — Cy C. (138) 


Before discussing the significance of this substitution, 
some properties of the C matrix and of Eq. (133) will be 
pointed out. Two properties of C are that it is symmetric 
and unitary. These are consequences of the properties of 
the B matrix discussed at the end of Sec. III. The 
symmetry of C follows from the antisymmetry of B: 


CT= (Bysys)” 
= 757747 BT 
=— ysu" B 
=— Bysya 
= Byss i 
=C, (139) 
where Eq. (73) was used to eliminate the transposed 
matrices. The fact that C is unitary follows from the fact 
that B is unitary: 
C¥C= (Bysys)” (Byars) 
= 757 BI Byes © 
=Y5SV4AV476 
=i g (140) 
The function ġ(x) connected with ¥(x) according to 
Eq. (133) is called the charge conjugate wave function 
to ¥(x). Charge conjugation is reciprocal in the sense 
that if @ is charge conjugate to y then y is charge 
conjugate to ¢. This can be seen by multiplying Eq. 
(133) by C¥ to obtain 


CEC (x= C#CO(z). % 
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On the right Eq. (140) may be applied, ånd on the left 
C# can be replaced by C® in view of Eq. (139). Then, on 
taking the complex conjugate, one finds 
p° (x)= (a), 

which proves the assertion. Now that these properties 
have been established, the significance of the charge- 
conjugation operation can be discussed. From Eqs. (134) 
and (135) it is seen that, for a certain set of potentials 
A, (x), if Y(w) is a solution of the Dirac equation for a 
particle with charge e then ¢() is a solution for a 
particle with charge —e. In view of the reciprocal 
property of the charge conjugation this correspondence 
between solutions is one-to-one. The corresponding 
solutions have opposite signs of energy for if 


ihdy/dl=Wy 
then it is seen that 


indo/ di=1hd(C%W*)/ dt 
=—C°(ihdy/ at) 
== Weep? 
=—Wọ. 


This is the basis of Dirac’s theory of the positron!®: the 
correspondence between negative-energy solutions of 
the electron equation and positive-energy solutions of 
the positron equation led him to propose a single theory 
which includes both particles. In it he makes the 
assumption that the negative-energy electron states are 
nearly all filled and that a positron is the absence of an 
electron from one of the negative energy states. This 
theory has the advantage that it explains why electrons 
in positive-energy states do not ordinarily fall to nega- 
tive-energy states; such transitions are forbidden by the 
Pauli exclusion principle as long as the negative-energy 
states are already filled. In many of the modern theories 
the postulate of the infinite number of electrons in 
negative-energy states is avoided by quantizing the 
particle field in such a way that electrons and positrons 
appear on an equal footing from the start.!® 

Especially if Dirac’s theory of the positron is used, it 
is to be assumed that the correspondence between 
positive-energy positron states and negative-energy 
electron states is independent of a particular Lorentz 
frame of reference, so that the charge conjugation 
operation is covariant with respect to Lorentz trans- 
formations. This has been pointed out by Pauli.” To put 
this requirement in symbols, if 


A ¥°(x)=Co(z), (141) 
and if a Lorentz transformation is made so that 
¥ («’)=Ay(x), 
p(x) =Ap (x) 


18 Reference 1, p. 272. 

19 See, for example, W. Heitler, The Quantum Theory of Radiation 
(Oxford University Press, London, England, 1954), third edition, 
Chap. III. 

20 Reference 2, p. 129. e 


(142) 


e 
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when the time is not reflected or so that 
W/o(x!)=CAy(x), 
$'°(x')=CAG(x) | 

otherwise, then it is to be assumed in consequence that 
ye (a!)=Co'(x’). (144) 


This assumption imposes a condition on the trans- 
formation matrix A. Considering a non-time-reflection, 
if one substitutes Eqs. (142) into Eq. (144) the result is 


ASYS(x)=CAG(x), 
and if then Eq. (141) is used on the left, 
A°C$(x)=CAg(x), 
so A should satisfy 


(143) 


ACC=CA. (145) 


The same condition is obtained in a similar way when a 
time-reflection is considered. Given the Lorentz trans- 
formation coefficients a,,, A is determined except for an 
arbitrary numerical factor by Eq. (109): 


Qup¥p=A“y, A. R (146) 


It is easily seen that A must satisfy an equation like 
Eq. (145) as a consequence of the conditions of reality 
on the a,,. The complex conjugate of Eqs. (146) is 


asny r — ajya = (A) CYA, 
— daky kl FH aaya = (AT) OYCAS, 
and these equations can be combined with the help of 
Eqs. (137) and (138) into the form 
Gup Yp t= (A) CYC AS 
Here Eqs. (146) may be reapplied to eliminate the app on 
the left 
CAY, AC = (A) CYC AS, 
and if this equation is multiplied from the left by CA? 
and from the right by CA™ the result is 
CAASCAM yp = Yn NCCA. 


Thus the matrix CA°CA™ commutes with the y, and 
so, according to step 5 of the proof of the fundamental 
theorem, it is a number, say &, times the identity. This 
means that 

ACC=RCA. (147) 


Furthermore by taking the determinant of this last 
equation one sees that the absolute value of & is unity- 
Then from Eq. (147) it is clear that k can be adjusted to 
unity by proper choice of the argument of the arbitrary 
complex number in A. One can thus obtain the covari- 


ance of the charge conjugation operation in this simple - 


way. In what follows it will be supposed that this 
adjustment has been made so that Eq. (145) does apply 
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to A. It is clear that Eqs. (145) and (146) determine A 
except for a real (positive or negative) numerical factor. 

A consequence of Eq. (145) is that when a series of 
Lorentz transformations is applied, the net wave- 
function transformation can be found by multiplying all 
the A matrices together and using í 


_ W (20") = AW (x 
or ` 


W(x!) =CAY(x) 


depending on whether the over-all transformation is a 
time-reflection or not. For example suppose the time- 
reflection, 


yl Snes, 
Yel) =CMy (x), 
buyi =M YM, 
is followed by the non-time-reflection, 
Ma ON in 
W(x") =A (x'), 
Ar Yu = ATINA. 
Then the net transformation is found by direct substitu- 
tion to be 
EN = Oal urn, 
Y (l )=ACOMYE (a), 
2,0 pr Y= aruM ~ Y,M 
=M71A7y),AM 
= (AM) (AM). 
Since Eq. (145) applies to A, the wave-function trans- 
formation can be rewritten as 
y” E(x") = ACCMY (x) 
=C(AM)¥ (x), 
which proves the assertion in this particular case. The 
assertion in the other cases can be verified in a similar 
way. The special values of A given in Eqs. (110), (113), 
and (114) have been chosen so that they satisfy Eq. 
(145). That charge conjugation should be covariant 
with respect to those two reflections was first pointed 
out by Racah.2! Any matrix compounded from those 
given in, Eqs. (110), (113), and (114) will still satisfy 
Eq. (145) since if both 
A°C=CA 
and 
M°C=CM, 
then for the product AM, n 
(AM)°C=A°M°C 
=A°CM 
=C(AM), 
as required. 


21 Giulio Racah, Nuovo cimento 14, 322 (1937). 
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VI. COVARIANTS FORMED QUADRATICALLY FROM 
DIRAC WAVE FUNCTIONS 


In this section it is shown how tensors can be formed 
quadratically from Dirac wave functions. These tensors 
are of physical interest because they can be chosen so 
that their components are real or pure imaginary ac- 
cording to whether their indices contain an even or an 
odd number of fours—with such a choice they can 
represent observable quantities. The discussion of these 
tensors leads to an additional condition which is im- 
posed on the wave function transformation matrix A. 

In forming tensors the basic idea is to consider the 
quantities y47,w where y is a Dirac wave function and 
y^ is the adjoint wave function of Eq. (25). Therefore 
the first thing to be discussed is the transformation rule 
for the adjoint wave function. In an unprimed coordi- 
nate system the adjoint is defined by 


pra=ply, (148) 
and in a primed system by 
pic p Hya (149) 


If the two coordinate systems are connected by a non- 
time-reflection, 


(150) 
then, combining these three equations, it is seen that 


yazyp 
=y EAH 


= Yiyi yas. (151) 


On the other hand, if the two are connected by a time- 
reflection 


¥'°=CAy, (152) 


it is found that 


WiA)o= (Eya) 

= (By 

= (Cay? 

=YIA\HCHyC 

= Wy ACY f 

= — piy AEC, (153) 
where properties of the C matrix, Eqs. (140) and (138) 
have been used in the last two steps. One can see that if 
y4 were to transform with a factor of A~!, the tensor 
transformation rules for the quantities Y4ysy would 
result in a direct way from the fact that 


Onyi Ay, A. (154) 


It does happen that the matrix y,A“7 is related to A 
as long as the a,, satisfy the conditions of reality. This 
can easily be seen by taking the Hermitian conjugate of 
Eq. (154), 
jryi tjata = ATA), 
— ayit tay = AB (AA, 


~ 
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noticing that these equations can be combined into the 
form 


Anoy aY Va =A yayaya (AF, 


substituting Eq. (154) on the left to eliminate the a,,, 
WATYA ya = Ayayay A)”, 


and multiplying from the left by Ay4 and from the right 
by Ay, to obtain 


YuAyiA ya = Ay AYY p 


Thus the matrix Ay,A”y, commutes with the y, and so, 
according to step 5 of the proof of the fundamental 
theorem, is a number, say k, times the identity 


Ay y= k. (155) 


It can further be seen that & here is real by taking the 
Hermitian conjugate of this equation, 


yabysA# = kar 
and multiplying from the left and right by y4: 


Ay AE y= ks. (156) 


Equations (155) and (156) show that & is equal to its 
conjugate and so is real. Furthermore from Eq. (155) it 
is seen that by proper choice of the absolute value of the 
arbitrary factor in A one can adjust the absolute value 
of k to be one. This can be done independently of the 
assignment made in the preceding section to obtain Eq. 
(145) since that required a choice of the phase of the 
arbitrary factor only. In what follows it will be supposed 
that the arbitrary factor in A has been chosen so that 
both Eq. (145) and 

AysA¥ y= +1 (157) 


are satisfied by A. It happens that the sign in Eq. (157) 
is just the sign of a44. One can see this by multiplying the 
fourth of Eqs. (154), 


Gapyp=A yA, 


from the left and from the right by y, and adding to 
obtain 


asol YY p+ Ya) = Vd y AHA yA. 


The term on the left can be simplified by using the 
anticommutation relations of the y,. Also according to 
Eq. (157), y4A™ can be replaced by +A”, in the first 
term on the right and Ay4 by --y,(A)# in the second. 
The equation then becomes 


Qas= + [AFL+A-1 (A). 


The sign can now be decided by considering one of the 
diagonal elements of this matrix equation. A diagonal 
element of the product of a matrix and its Hermitian 
conjugate must be real and positive so the term in the 
square*brackets gives a positive contribution. The sign 
must accordingly be identical with the sign of a4, and 
Eq. (157) can be written 


Aysh¥ ys=fass/|044|). 
a 


e 


(158) 


GOOD, 
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In view of this equation the transformation rules for the 
adjoint wave function, Eqs. (151) and (153), are as 
follows: 


AE (159) 
for a non-time-reflection, and 
UDEVA Ce (160) 


for a time-reflection. 

Further discussion of the formation of tensors will be 
postponed for a paragraph in order to make a few more 
remarks about the wave-function transformation matrix 
A. Given a set of Lorentz transformation coefficients app, 
A is to be chosen so that it satisfies Eqs. (109), (145), 
and (158): 


aury =A Yå, (161) 
A°C=CA, (162) 
Ay, fy, = (ası. | asal). (163) 


Equation (161) determines A as far as a numerical 
factor, Eq. (162) fixes the argument of the factor as far 
as a multiple of z, and Eq. (163) specifies the absolute 
value of the factor. These three equations then de- 
termine A except for a factor of +1. For each Lorentz 
transformation there are two matrices A satisfying 
these conditions and these two differ only in sign. The 
special solutions for A given in Eqs. (110), (113), (114) 
have already been chosen to satisfy Eq. (163) as well as 
the other two. Products of these matrices will also 
satisfy Eq. (163) properly, as well as the other two 
equations, for if also 


MysM*y4= (b14/|b44|) 
then, for the product AM, 


(AM) ya (AM) Hy, = AMy,MZA Ey 4 
= (bas/ | baa] )AysA Ey 
= (b44/ | B44|) (as4/ lasal). 


It can easily be seen that the matrix A is unimodular: 
(164) 


To prove this one takes the determinant of Eq. (162) to 
see that detA is real and the determinant of Eq. (163) to 
see that detA has absolute value unity. The last step is 
to decide between the values plus and minus one. For 
the identity Lorentz transformation evidently A is plus 
or minus the identity so the sign of the determinant 
must be positive for Lorentz transformations continuous 
with the identity. For the space and time reflections 
whose matrices A are iy, and ysya, Eqs. (113) and (114), 
the determinants of the matrices y4 and zysys may be 
considered since they have the same value. Each of the 


detA=1. 


matrices y4 and iysy, is Hermitian and so has real 


eigenvalues; each squared is one and so its eigenvalues 
are either plus or minus one; each has zero spur (step 2 
of the proof of the fundamental theorem) and so its 
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eigenvalues are +1, +1, —1, —1; therefore, each has 
determinant plus one. The determinant then is positive 
for transformations continuous with the identity and for 
these two special space and time reflections. It will still 
be positive for arbitrary products of transformation 
matrices so Eq. (164) must hold in general. Another 
property of A is that, when aj, and a4; are zero so that 
the @,, aré all real, A is unitary. This is easily demon- 
strated by taking the Hermitian conjugate of Eq. (161) 


auy = Ay, (A7)E, 
recombining the result with Eq. (161) 
ATIY, A =A Hy (A), 
and rearranging the equation into the form 
YAA = AA Fyn 
which implies that 
AA? =k, 


where k is a positive number. It can be concluded that $ 
is 1 by taking the determinant of this last equation; this 
completes the proof that 


AAH=1 (165) 


when the ap, are all real. 

Returning to the subject of the construction of tensors, 
one can find the transformation properties of the 
quantities ¥y zy from the transformation properties of 
the wave function and its adjoint. The results of argu- 
ments to be given below are the following transforma- 
tion rules, corresponding to the Lorentz transformation 


$4 Shee 
of the coordipates: 
vy =py, (166) 
ip! yw! = (ass/ lasal) apip tY, (167) 


WA Quai = (a45/ | a44|)Oup@re 
xi (Y Ye Vo pV; (168) 


ip! Ays Yy = (deta)anpi p YsY 4, (169) 
ipy' Ay = (deta)iy ysy. (170) 


The five quantities on the left are often called the Dirac 
covariants; individually they are known as the scalar, 
vector, tensor, axial or pseudovector, and pseudoscalar 
because of the way they transform with respect to non- 
time-reflections. Evidently the tensor is antisymmetric. 
The factors of i have been chosen so that a component is 
real if its indices contain an even number of fours and so 
that otherwise it is pure imaginary. For example, for the 
space parts of the vector one finds 


(Ay) =- it yan)* 
= Wy 

= yal 

= iy 
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(p4yg is a one-by-one matrix and so is trivially equal to 
its transpose), and similarly finds for its time part 


lipsy) E — iW yer 

= —ipiyy. 

The proof of the first three transformation rules, Eqs. 

(166) to (168), for non-time-reflections follows directly 

from the transformation rules for the wave function, 

Eqs. (150) and (159), and the first property of the A 

matrix, Eq. (161). For example the proof of the tensor 
rule is as follows: 


iy'4 Gi Vu) = 1A AT (Yav WAY 

= ipi NyA p WA My hy 

= aaplrth Ytp — By yp" Yo 

= Appr (YoYo YT) 
for non-time-reflections. The proofs for the axial vector 
and pseudoscalar follow directly in the same way once 
it is established that 

Atys = (deta)ys. (171) 

This can easily be proven by writing ys in the form 
(4i) l eurgoYuYYgto. This form is allowable because there 
are 4! nonzero terms in the sum, in each of which the 
indices are all different so that both ey,¢. and YuYsYgYo 
are completely antisymmetric—this means that each 
such term can be reduced to ys. The proof is then as 
follows: 


Am*ysA = (41)7 Enpre Vp YETA 
= (4!) l euvgoluplvoðgrlosY Y eV rY v 
= (AN) (deta) eper st eV VV v 
= (deta)ys. 
The next thing to discuss is the derivation of the í 
transformation rules when there is a time-reflection. It 
is convenient to consider all the covariants at the same 
time. From Eqs. (152) and (160) it is found that 
payet =L) yB ye] 
= [y AAC- y BECAY |e 
= [YEA Cyl]. 
It is a property of A, Eq. (163), that y44™! is ~A” 80 
Wy a! = — [VIM CCA], 
and it is a property of C, Eq. (138), that yC™ is 
Cy. so 
Wy ah! = [EAEC Cay] (472) 
The expression has been cast into this form in order to 
make use of the fact that 


CysreC = —«(vsre)", 

when ya=1, YsYm OF Ys 
or yarr). 
Equation (173) is just a convenient way of summarizing 


(173) 
where x=1 
=—1 when ys=7¥z 


ae .. E s 
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the symmetries considered by Haantjes in his proof of 
the antisymmetry of B, Eqs. (74) to (78). Equation 
(173) can be rewritten in the form 


Cysyr= —K(yayn)7C 
or, since C is symmetric, 
CYiy B= —K (C "yay B) si 


Tf C is replaced by Bysys according to Eq. (136) the 
statement becomes 


Bysyn=—x(Bysyx)’, 
and the truth of this is easily verified from Eqs. (74) to 
(78), in which k is —1. This establishes the validity of 
Eq. (173). Since C is symmetric and unitary its inverse 
is its conjugate, and the complex conjugate of Eq. (173) 
is s 
Cya YB C= —r (ysy B)". 


This last equation can now be introduced into Eq. (172) 
so that 


W yep = — KW PAF (yry s) Ay] 
= — EAE yay BAY 
= — piy yay why 
= YAA y BAY, (174) 


where Eq. (163) has been used again in the last step. 
This equation shows that the transformation rule for 
non-time-reflections differs from the rule for time- 
reflections only by the factor x. This difference has been 
taken into account in the transformation rules for the 
covariants, Eqs. (166) to (170), by adding the factor 
(@4s/|as4|) in the vector and tensor equations. This 
completes the discussion of these transformation rules. 
If the time-reflection viewpoint discussed at the end of 
Sec. IV is adopted then evidently the adjoint wave 
function always transforms according to Eq. (151), 


yA = piyd Eya 
or, from Eq. (163) 


V'4= (ay/ | a44 [y 4A. 


It is seen that when this point of view is used all of the 
above covariants transform with the factor (a44/ |@44]) 
rather than just the vector and tensor. 

Since the charge conjugation operation is covariant, 
it is clear that five other covariants of the form ¢4y2¢ 
can be constructed, where ¢ is (Cy)°, the charge conju- 
gate wave furiction to y. These, however, are propor- 
tional to the covariants above: 


pysg =p yay 2d 


z L(Cy) Ty sy BC? |? 
3 =V4C(yayn) "CY 
= — Kp yay Bl 
E — Ky By, (175) 
where Eq. (173) is used to pbtain the fourth line. One 
ee 


GOOD IR. 


may consider next quantities of the form 


by ph = ob yay Bh 
=Y" Cys}. (176) 
First of all it is clear that these are identically zero 
unless yg iS Ya OF Yip), for, as another application of 
Eq. (173), 
O"Cysy a= — Kh" (yay 2) CY 
=— «LY (vays) CY]? 
= — Kb" Cysyn¥, (177) 
and x is +1 except for those assignments of yz. Only the 
quantities $4yy7y need be considered, where yyr is any 


one of the y, or Yim]. With respect to non-time-reflec- 
tions their transformation rule is 


¢ Ayr =p A yyrAy (178) 


and, iņ parallel with the derivation of Eq. (174), with 
respect to time-reflections it is found that 


6 yvry' = —YAd yy rAd. (179) 
In view of Eq. (179) these quantities are not covariants. 
However, one may consider also the quantities y4ygọ 


which are plus or minus the complex conjugates of the 


by py: 


(Biyah) = (Hysa): 

=8" (yaya) Ye 

= WIC (yry B) Ch 

=— Kh" (yy B) h 

=y“y 29, (180) 
where Eq. (173) has been used and the last step is justi- 
fied since yayz is either Hermitian or anti-Ifermitian. In 


view of Eq. (180) only the ¥4yyré are not zero; 
evidently they transform according-to 


W4yy7rd'=Y4A yy rAd (181) 
for non-time-reflections and according to 
Yiyyro' = — $4AyyrAy (182) 


for time-reflections. From Eqs. (178), (179), (181), 
(182) it is seen that the sum and difference of ¢4yy7V 
and y“yyr@ are covariant quantities. In detail in terms 


of the wave function y one finds, corresponding to the 
Lorentz transformation ~ 


Eu = lpr 
the following transformation rules: 
WY? Cyrih Wey CC] 
= (asy/ |as |)aui 4T Crys], (183) 


aly’ TCs Yay» zg YYY Hy Eya (yur = Vu) Ca 
= (os/ |as | )aypQroil y" Cya (Ye%e— Yo 
Wo (Yp¥e— Vo) CW], (184) 
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LY Cyl =p yeye] 
= audy Cyrop pyy C], (185) 
TAO a —W N E OAA 
= aupävo l "Cys Gace Yoo) 
— Ws (y¥o- VV 2) Cy"). (186) 


Here also the factors of i have been added so that the j 
components of the vectors and the jk components of the 
tensors are real and the other components are pure 
imaginary. Covariants can be constructed quadratically 
from two different wave functions in a similar way. The 
ambiguity in the sign of A has no effect on any of the 
transformation rules above because A and A~! always 
enter together into any specific term. 


VII. SPECIAL EXAMPLES OF DIRAC MATRICES 


The effect on the Dirac equation and on the covariants 
of introducing a different set of Dirac matrices is the 
first subject treated in this section. Following that, three 
special sets of Dirac matrices are discussed: the set 
already considered above in discussions of the non- 
relativistic limit, a set in which the matrix C is the 
identity so that charge conjugation is identical with 
complex conjugation, and a set which leads to the spinor 
terminology. 

In order to discuss the effect of using different sets of 
matrices, suppose y, and 7, are two sets of Hermitian 
Dirac matrices: 


VeVi Yeu = 2ôpv, 
Fart WV a= By. 


According to the fundamental theorem of Sec. II these 
are connected by a similarity transformation: 


, Y= SYS. (187) 


Since both y, and ¥, are Hermitian, S can be chosen 
unitary, 

S#S=1. (188) 
This is easily proven by substituting Eq. (187) into 


J= Yu 
O = SypyS-, 
which can be rewritten in the form 
= yS "S= SES, 


However, if S¥5 commutes with each of the y, it must, 
according to the fifth step in the proof of the funda- 
mental theorem, be a multiple, say k, of the identity 


SES=k. 


The diagonal element of the product of a matrix and its 
Hermitian conjugate is real and positive so & is real and 
positive. It is then possible to choose the arbitrary 
factor in S so that Eq. (188) is fulfilled. In what follows 
it is assumed that this choice þas always been made at 


to obtain 
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the beginning. One can see next that the substitution 
W(x) = Spl) (189) 


carries the Dirac equation»with matrices ya 


o te me 
(- es Ay) rb y=0 
Ox, he h 


into the Dirac equation with matrices 7, 


I V2 ces 
(- -— A, Jado, 


Ox, he h 


Therefore the introduction of a different set of Her- 
mitian-Dirac matrices only amounts to a unitary trans- 
formation among the four components of the wave 
function. It will next be shown thgt Eq. (189) also 
applies to the charge conjugate wave function. To make 
this proof one may begin with the connection between 
Band B, Eq. (81), 
B=kSTBS. 

As argued following Eq. (81), B and B are both unitary 
and are undetermined as far as a numerical factor which 


has unit absolute value. This means that the absolute 
value of & is also one, for 


BHB=kk°S4# BE (ST)HSTBS 
= |k|2S" Bu (SS#)T BS 
and, since S also is unitary, this reduces to 
1=[&|?. 


Suppose the relative phase of the arbitrary factors in B 
and B or the argument of the arbitrary factor in S is 
always chosen so that 

B=S™BS 
or 


B=S°BS". (190) 


The corresponding connection between C and C is then 
found directly to be i 
G= B7475 
=S BS Syy S 
=S°CS+, (191) 
Finally then, for the charge conjugate wave function, 
5-054" 
=SC°(S) Sey" 
= SCY 
=S¢, 
as required. However this result holds only if some of the 


arbitrary factors are adjusted until Eq. (190) is valid. 
Since 


M709 = 77 F 
=yY"SE Syry gS SY 


ms 
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it is immaterial whether the barred or unbarred 
quantities are used to evaluate the covariants. Next it 
will be shown that the connection between the two wave 
functions, Eq. (189), is covariant with respect to 
Lorentz transformations. Assume it holds in the un- 
primed system. By operating from the left and right 
appropriately with factors of S, S°, and S~! one may 
convert Eqs. (161) to (163) into the form 


Qury» = (5) 7,45, 
(A7) cG = Cie 
AFA (AT) 2 F4= (a44/| asl), 


where 
ATS SNS Ae, 


Therefore, A~ is the transformation matrix for the 
barred wave functions and, for non-time-reflections 


VY ()=1-9@) 
= SASSY (x) 
= SAy (x) 
= SY'(2’), 
as required. Using Eq. (191), one can easily construct a 


similar proof for time-reflections. 
One special set of Hermitian-Dirac matrices is as 


follows: 
0 
) a% 
1 


0 —icsj —1 
Vea k D Yiz ( 
103 0 0 
where the o; are the Pauli matrices, 
0 1 0 —i 1 0) 
n=( ); 02 ( ) om ( ): (193) 
1 0 3 © 0 —1 
For this set ys=yiyzy37ys, 2 B chosen unitary, and 
C=Byzys are: 


0 1 oe 0 0 —oe 
v=( ), B=( ), c=( ). (194) 
1 0 0 02 o? (0) 


In Sec. TI it is shown how, when these matrices are used, 
the Schrödinger-Pauli limit of the Dirac theory is 
obtained. At the end of Sec. IIT this choice of the B 
matrix is discussed. In Sec. IV these matrices are used to 
discuss the matrix A for a space rotation and to show the 
connection between A and the Cayley-Klein parameters. 
Another special set of Hermitian-Dirac matrices is 


« 0 aN ° 1 0 
v= ( ): v= ( ), 
g1 0 0 —1 
A a) (: 3) 
Y3= x Nis . 
$ oc 0 o 0 


This is a natural type to consider because, as is seen 


3 from the definitions of the Pauli matrices above, the y; 
Be here are teal, and y4 is pure imaginary: The matrices ys, 


| 


GOOD, JR. 


B, C as previously defined are for this set 


i) in OY (1 0 
Y= ; p=( J =( ). 196 
(Ge) Oe, Ot) 


Since the charge conjugation matrix C is the identity, 
when this set of matrices is used charge conjugation, 
Eq. (133), reduces to complex conjugation-and, as is 
seen from Eq. (162), the wave-function transformation 
matrix A is always real. It is clear from Eqs. (137) and 
(138) that C can be the identity whenever y;, zy, are all 
real. 

Since ys is Hermitian, anticommutes with each of the 
Yu, and gives unity when squared, another set of Dirac 
matrices can be formed by interchanging the definitions 
of y4 and ys in Eqs. (192) and (194). Such a set is of 
interest because the diagonal elements of all of the y,, 
when written in terms of two-by-two matrices, are then 
zero. This property permits the Dirac equation for the 
four-component wave function y to be written neatly as 
two coupled equations involvirtig two-component func- 
tions called spinors which, it develops, transform sepa- 
rately with respect to Lorentz transformations con- 
tinuous with the identity. Spinors were originally 
discovered by Cartan” but had no application in physics 
before Dirac’s theory. The spinor point of view has 
recently been reviewed by Bade and Jehle.” The basic 
properties of spinors are developed below in order to 
show how they arise from the general treatment of the 
Dirac equation; for further information the work of 
Bade and Jehle can be consulted. Besides the permuta- 
tion of the matrices in Eqs. (192) and (194) mentioned 
above, there are other sets of matrices which also are 
used to divide the four-component wave function into 
two spinors. To conform with the genera}, practice the 
discussion here will be based on the two-by-two 
matrices 


=i @ io 
n 

0 i D 4 
ae D 5 
p= ; = ; 
TANLO fi (eas) 


which are an arrangement of the Pauli matrices and the 


identity with various factors added. It is easily verified 
that these matrices satisfy the equations 


(197) 


p3°petpx°pj= 263%; 
ps“pj—p;"ps=0, 
psps= —1 , 


(198) 


and that, as a consequence, the matrices 


0 1p; 0 p 
v=( D v=( ‘), (199) 
pj? 0 — pal 0 


2 E. Cartan, Bull. Soc. Math. France 41, 53 (1913). 


re w L. Bade and Herbert Jehle, Revs. Modern Phys. 25, 714 
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satisfy the equations 
WIA w= 2b us, 


and so are a set of Dirac matrices. The Y, are evidently 
also Hermitian. This starting point is similar to Bade 
and Jehle’s—they use a different metric. The matrices 
Ys, B, C defined as above are easily found to be 


(Th CED oe 
To , = , C= . (200) 
0 -1 0 ps 1 n aie) 


Next the four-component Dirac equation 
o We mc 
ta) o 
Ox, he h 


will be broken up into two coupled two-component 
equations. The.four-component wave function y will be 


written as 
(x) 
v (2) = ( n) (202) 


n(x) 


where ¢ and 7 are two-rowed column matrices. Two- 
component functions like ¢ and 7 are called spinors. 
Substituting Eqs. (199) and (202) into Eq. (201), one 


finds 
ð ie mc 
(=-24 » pattem 0, (203) 
Ox, fic h 
ô te mc 
E 0, (204) 
Ox, he h 


where the complex conjugate of the second equation was 
taken to bring the j- and 4-terms together. The charge 
conjugate weve function is 


$(a)=CY(2) = Cay 


so charge conjugation is equivalent to interchanging ¢ 
and 7. This can be verified directly in Eqs. (203) and 
(204) for if ¢ and 7 are interchanged there the equations 
are reproduced except with e replaced by —e. The 
transformation properties of the spinors ¢ and ņ can be 
found from the transformation properties of the wave 
function w; it is assumed that the same decomposition, 
Eq. (202), is made in each coordinate system. With 
respect to the space reflection 

uj =— tj X4'= 2X4, (88) 
it is seen from Eq. (113) that 


¢/¢(x")=ip n(x), 


P (205) 

ne (2!) =i (2). 

With respect to the time-reflection 
xf =Xj, T= — XM, (89) 
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it is seen from Eq. (115) that 


AES) =p (£), 
fy (206) 
n'l (x)= pan(x). 


The remainder of the discussion will be restricted to the 
transformations continuous with the identity. The 
matrix A is of course determined by Eqs. (161), (162), 
(163). However, before applying them, one may notice 
that Eq. (171) 

A~ sÅ = (deta)ys 


means, when ys is as defined in Eq. (200), that A is of the 


form 
Ne 0 
A= ( ); (207) 
OA 


where A, and A; are two-by-two matrices. Then, from 
Eq. (162), 


A°C=CA, 
and C as given in Eq. (200), it is seen that 
A= Ag (208) 


so that the two spinors transform identically: 
(2) =A.6(),] 


(209) 
Ways Ben eee | 


Next it is found that Eq. (163) for non-time-reflections 
AysA#y.= 1, 
rewritten in terms of A,, becomes 
Api AT p= — 1. (210) 


From this equation it can be concluded that the spinor 
transformation is unimodular. By taking the determi- 
nant of the equation one can see that (detA,) is unity 
since, according to the definition of p, in Eq. (197), 
(detp,) is minus one. However, A, is continuous with the 
identity and therefore 


detA,=1. (211) 
Finally it is easily seen that Eq. (161) 
aury =A YA (161) 
is equivalent to 
aurp = (As) pAs, (212) 


which is somewhat in parallel with Eqs. (119) and (161). 

The covariance of Eqs. (203) and (204) with respect to 
Lorentz transformations continuous with the identity 
can be seen directly as a consequence of Eqs. (209) and 
(212). Independent of the choice of the y, it was shown 
that A is unitary when a4;, aj; are zero, Eq. (165). This 
property applies also to A, in view of Eq. (207); for a 
space rotation A, is unitary. For any two-dimensional 
rotation A, can be found explicitly by specializing Eq. 
(110) to the matrices ona (199). If the adjoint 


: 
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spinor is introduced according to 
Cha tH p= tps, (213) 


then it‘is found to transform according to 


eA 


See a 
= — CAA Moe 

=—fAp A." p: 

=CA(A Pe, (214) 


where the conjugate of Eq. (210) was used in the last 
step. As a consequence of this and Eq. (212) one sees 
that i¢4p,¢ is a vector with respect to transformations 
continuous with the identity: 


it Appt = iA (A) ppt 
LOAA pi (215) 


eH, ,C 


e 


The factor of 7 wa$ added here so that the j-components 
would be real and the fourth component pure imaginary. 
To demonstrate that this result has been obtained, one 
takes the complex conjugate of each component and 
manipulates it in the following way: 


(af po) = ilt pac pt) a 
=i pst 
= — ig pa pa” ps76. 
From Eqs. (197) and (198) it is easily found that 
pa°p;ps7 is —p; and that p4°p4¥ p47 is ps. Therefore the 
equation 
(itip) C= itip? 
is valid, where the plus sign applies for the 7 components 
and the minus for the fourth component as required. 
Also it can be shown that —7f17 pate is a scalar with re- 
spect to Lorentz transformations continuous with the 
identity, where ¢, and £2 are two spinors: 


— iti "pate = — iTA, T shoe 

= — ih pA Aske 
i = 161" pate; (216) 
here Eq. (210) was used in the first step. By writing it 
explicitly in terms of the components of the spinors, one 
finds that this scalar can be expressed as the determi- 


nant of the two-by-two matrix whose columns are ¢, 
and fe: 


—1§1"po=det(S1 £2). (217) 


Evidently this scalar is zero when ¢, and £2 are identical. 
An alternative. proof that this quantity is a scalar rests 
on the fact that the transformation is unimodular. It is 
easily seen that 


det (r $2’) = det (Asti Asf2) 
= det[(A.) (1 2) ] 
= =det(A,) det (fı £2) 
=det(t1 f), (218) 


which proves the assertion. « 


GOOD, JR. 


VII. IDENTITIES BETWEEN SCALARS FORMED 
FROM FOUR WAVE FUNCTIONS 


In theories of reactions involving four particles which 
have Dirac four-component wave functions, the problem 
of forming scalars proportional to each of the four wave 
functions arises.” In this section connections between 
some of the possible scalars will be given. The prototype 
of these theories is Fermi’s theory of beta-decay®; the 
arguments below are put in the terms of this theory 
although they have somewhat greater applicability. 

For emission of electrons the reaction considered is 


N=P+e+}, (219) 


where N, P, e, # indicate the neutron, proton, electron, 
and antineutrino. Dirac’s theory of holes is assumed to 
apply to both the electron and neutrino so the creation 
of an electron and antineutrino is identical with the 
destruction of a positron and a neutrino. Therefore the 
reaction of Eq. (219) can also be written in the form 


P2N+é+», (220) 


where ë, v indicate a positron and a neutrino, and thus 
the theory of positron emission is included in the theory 
of electron emission. Various arguments in the theory 
require the interaction term to be real, a scalar with 
respect to Lorentz transformations, proportional to the 
wave function of a particle when that particle is de- 
stroyed, and proportional to the complex conjugate of 
the wave function when the corresponding particle is 
created. Actually in the theory the wave functions are 
second quantization operators. However they commute 
with each other and so, for the present purposes, the 
operator properties can be disregarded. The reactions of 
Eqs. (219) and (220) can be written in the form 


V+veP+e, á (221) 
and then it is easily seen that the interaction terms 
J= [Yr tyyn lb ab +l yyy., (222) 
Jo= [Yr yyt [Vry] 

+h yay belive], (223) 


Ja= 3h yaY Ye Yor WN ] 
Xe ys(reve— eV er] 
Bln "YYY Yo WP] 
x [yy (YoYo y YoY p)We], ; (224) 
Jı=— Mef yaver hy Vys] 
= yas [V Yas., (225) 
Js= We yyy Lyerly] 
Hyrre yayste], (226) 
contain the wave functions and their complex conjugates 
in the proper way. In each case the first term applies 


* See, for example, Enrico Fermi, Elementary Particles (Yale 
University Press, New Haven, Connecticut, 1951). 
E. Fermi, Z. Physik 88, 161 (1934). 
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when the reaction of Eq. (221) is to proceed to the 
right—electron emission or positron capture, The second 
term applies when the reaction is to proceed to the left — 
positron emission or electron capture. The constant 
factors in the J’s have been chosen so that each of them 
can be written in the form 


De yay by IY hy] 
: t+ rd p Yyy me |}, 


where the sum extends only over the appropriate type of 
yn. Referring to the definitions of the yz at the beginning 
of Sec. III, one sees that the negative signs in the 
definitions of J; and J, correspond to the factors of i in 
the definitions of yta») and yas») and that the factor of 
(1/8) in the definition of J, will reduce the sum of 48 
terms to a sum of 6 terms, involving the six yta». For 
brevity Eqs. (222) to (226) will be written collectively 
as 


J r= (bre! y Qrby) We yQry,) 
+ by! yQryr) (W Yre), (227) 


where the constant real factor and the sums on the right 
are not explicitly indicated and the capital Greek 
subscripts run from 1 to 5. These Jr are all real, for the 
second terms are the complex conjugates of the first: 


CW! yQrby) WH y Qs) 1° 
= (Ypy ryn)" (Y Yr)" 
= Lyn” (y42r) MY p IL p” (ys2r) Hy] 
= (Wy? yr) (YYY). 


The last step is permitted since y,Qr is just some product 
of the y, and so is either Hermitian or anti-Hermitian. 
With respect to Lorentz transformations which do not 
reflect the time the wave functions transform according 
to Eqs. (150) and (159), and it is evident that the Jr are 
scalars. When there is a time reflection, Eqs. (152) and 
(160) are to be used. One finds, in parallel with the 
derivation of Eq. (174), that 


Vp Fy Qrby' = py yA OrAvp, 


where x, defined just below Eq. (173), is +1 when T is 
1, 4, or 5 and is —1 when T is 2 or 3. In any case Kis +1 
and so the Jp are scalars with respect to time-reflections 
also, the first terms transforming into the second. These 
five interaction terms thus fulfil all the conditions 
imposed above, but of course there are several other 
possibilities. For example, a linear combination of these 


interactions, 


5 
J=> erJr, (228) 


i. 


where the cr are any real numbers, also fulfils the 
requirements. In beta-decay theory it is customary to 
take the five Jp as the primary interactions and to 


` express other interactions linearly in terms of them 


when possible; this is what is done below. In terms of the 
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matrices aj, 8 defined by Eq. (31) and connected with 
the ya according to Eqs. (18) and (19), the interactions 
have the form 


J = (Wr Bb) pb BY) -HEC 
Jo= Wry) bob) — bray) by Hayh,)+e.c., 
Ja= Wr" Be bn) WBE po) 

+ (pr! Baby) (Y Bap) FEC 
I= (Yran) UH aw) — be" yy) yh) +¢.c., 
Js= (Y Byer) YA By) +ec., 


where c.c. indicates the complex conjugate and 


a; O 0 1 
“CO 
0 4; Eg 


gives the matrices ¢;, ys. These forms.of the interactions 
are frequently used as the starting point of treatments 
of beta-decay theory.” Papers concerned with linear 
combinations of interactions have recently been pub- 
lished by Michel and Wightman” and by King and 
Peaslee.**? Michel and Wightman give a résumé of the 
earlier work done on the subject. 

One can easily construct other permissible interactions 
by forming scalars similar to the Jp but with the wave 
functions arranged differently within the products. The 
main purpose of this section is to show how these other 
interactions can be expressed as linear combinations of 
the Jr. One approach to the study of these other 
interactions is to rewrite the reaction equation in the 
form 


P+N+é+v=0, 
so that all the particles are uniformly created or 
annihilated. This corresponds to introducing the charge- 


conjugate wave functions for the proton and electron so 
that the interactions are written as 


Jr= (pC 21m) lT Cy Qrp,)+e.c. (229) 


The other interactions are then easily fərmed by 
interchanging positions of the wave functions. The first 
type which will be discussed is formed by interchanging 
the wave functions of the heavy particles: 


Kr= (Wx7CyQ2rop)(6-7Cy2ry,)+c.c. (230) 


The Kr are simply related to the Jr, as a consequence of 
Eq. (173): 


YT Cy Qrop= — nbn? (y:2r) TCop 
= — by? (Dr) "Cor? 
= — rop CY ryw, 


and this implies that 
Kr=—«Jr. (231) 


ae E for example, E. J. Konopinski, Revs. Modern Phys. 15, 
209 (1943). 
2? L. Michel and A. Wightman, Phys. Rev. 93, 354 (1954). 
23 R. W. King and D. C. Peaglee, Phys. Rev. 94, 1284 (1954). 
a 
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Evidently a similar result applies when the light particle 
wave functions are interchanged so that 


| Kr= (pr CyiQrpw) We ?CyQrb.) + c.c. 


is an alternative expression for Kr. 

The next set of interaction-terms which will be dis- 
cussed is formed by interchanging the wave functions of 
the charged particles in Eq. (229): 


Lr= (6-7Cy Qrby) (or7CyQry,) + c.c. 
= (PP ysQryy) (VPY) + c.c. (232) 


These interactions also can be expressed linearly in 
terms of the Jr; the formulas were first published by 
Fierz.® The proof given below follows Pauli’s proof of de 
Broglie’s identities,” and is based on the fact that 


16 
ZC) ara) a= ned pr, (233) 
=] 


where (yz) ,, are the elements of the matrix yg. One can 
prove this identity by specializing step 6 of the proof of 
the fundamental theorem to the case when the two sets 
of matrices involved, y, and y,’, are identical. Then, 
from Eqs. (59) and (60), for any four-by-four matrix F 
the matrix S defined by : 


16 
S=} ysF yz 


B=1 
has the property 
: yaS=Sya. 


However, from the fifth step in the proof of the funda- 
mental theorem, a matrix which commutes with the y, 
is a constant times the identity. It can be concluded 
that, for any matrix F, 


16 

2 YsFys=k, 

B=1 
where & is some number times the identity. One may 
now choose in succession for F the sixteen different 


matrices which have one element unity and the rest 
zero; this gives 


16 
& e)a (Ya) = krpôgo- 


The constants k,, can be found by contracting on u 
and o: 


16 


f 4ky p= >, (r2) a(78) px 


B=1 
16 
=D (vs*) p> 
B=1 
= 168p». 


2M. Fierz, Z. Physik 104, 553 (1937). A general treatment of 
such identities has been given by K. M. Case, Phys. Rev. 97, 810 
1955). 
‘ æ Reference 2, p. 131. A 
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The last step here results from the fact that the square 
of each of the yz is the identity, Eq. (46). Substituting 
back for k,» one obtains 


16 


DE: (YB) uv» (¥B) po= 45 n05 p>, (233) 
B=) 


as required. If this equation is multiplied by 


(V1) u(Y2) (Wa) p(Ws)», where Yi, Ya, Ws, Ys are any column 
matrices, and if the sum on B is expanded to show the 
five types of yz, the result is 


4 (Ypa) (Was) 
= (Wal Po) YT pa) + WaT Yp) Wi? yas) 
+ (YTY tahe) WY wots) 
+ (YaTiysYy We) Vit tye us) 
+ (Ws? ysho) Wi? yes). (234) 
In the third term the sum is to be carried over the six 
Yta»). Then, keeping in mind that each Jr involves a sum 


over one type of ys, it is easily seen that the five 
specializations 


1. YT =Y", Yo= Wy, YT =Y piy, w=, 
2. abel YY vo= vn, Ws? =VPP yyy, =p, 
3. YT =e yay, Yo=vn, PT =beX yey yEy 


4. Wave ystysyn, Yo=VN, Ws? =P yis =p, 
5. pT=p Eyy vo=vy, Ws? =VrP yay, w= 


give directly these relations between the interaction 
terms: 


4Liı= Jyt Jot Jat Jat Js, 


4Lo=4Jı—2J2 +2I4—4S 5, 
4L:=6Jı —2J,; +6Js, 
4Lı=4J:ı +27: —2Jı—4Js, 


4L;= Jı— Jo+ J3— J+ Js. 
These equations will be summarized in the form 


5 


Lr=}, AraJa, (235) 
A=1 
where 
iy cal: 1 1 1 
4 -2 0 2 —4 
Ar=3/6 0 —2 0 6 |. . (236) 
a 2 O = l 
il l ah ale al 
Evidently 
Vea (237) 


because if the wave functions of the charged particles. 
are interchanged twice, fhe original interaction is re- 
covered. The interaction terms formed by interchanging 
in Eq. (229) the wave functions of the heavy particles, 


the light particles, and the charged particles have now — 


been discussed, and all other possible interchanges can. 
be expressed in terms of them. 
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Sometimes interactions which are symmetric or anti- 
symmetric with respect to an interchange of the wave 
functions of two of the particles are considered. From 
Eg. (231) it is seen that the interaction 


J=aJ obs; (238) 


is symmetric with respect to an interchange of the wave 
functions of either the heavy particles or the light 
particles and that the interaction 


J=cI,+dJi+e/s (239) 


is correspondingly antisymmetric, where a, b, c, d, e are 
any real numbers. It is evident that interchanging the 
charged particle wave functions is equivalent to inter- 
changing the uncharged particle wave functions. If the 
interaction 


5 


=> erly 
T=] 


is symmetric or antisymmetric with respect to such an 
interchange, then the cf will satisfy the equation 


5 5 
D erlry=+ > crJr 
Tal EA 


or, in view of Eq. (235), 


5 5 5 
D X crA rav a= + SD, Cat a. 
T=] A=1 A=1 
However, this implies the equation 


5 
DE (A T)arcr= EON 
T=1 
which may be used to determine the ca. This is in fact 
the eigenvalue problem for the transpose of the matrix 
A, because its square is unity and so its eigenvalues are 
+1. The result is that the interaction 


T= (12f43—)Ii+3fJot 2ft+g)Jst3fJet3gJs (240) 


is symmetric with respect to an interchange of the wave 
functions of either the charged or the uncharged 
particles, and that the interaction 


T= (—M—NIEh—kAD Jot hIs+kIitls (241) 


is correspondingly antisymmetric, where f, g, l, k, l are 
any real numbers. 
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It is easily verified from Eqs. (238) to (241) that the 
only interaction which is either symmetric or anti- 
symmetric with respect to an interchange of any two 
wave functions is ; : 


S=m(—J +S s+Js5), (242) 


where m is some real number. Since this is a special case 
of both Eqs. (239) and (241), this interaction is anti- 
symmetric with respect to an interchange of any two 
wave functions. It was originally proposed by Wigner 
and Critchfield.4-” Since (—Ji+J,+J5) is antisym- 
metric with respect to interchanges of any pair of the 
wave functions dp, Yx, Øe W» it must be proportional to 
the determinant formed from these four wave functions. 
The proportionality constant depends on what set of 
Dirac matrices are involved. The matrices of Eqs. (192) 
and (194) are used almost exclusively in beta-decay 
theory and the rest of the discussién is restricted to 
them. By calculating out some of the terms it can be 
verified that, with those matrices, 


—SitIitJSs= —2 deter bx pa Yi) tcc. (243) 


In parallel with the treatment of the spinor-scalar, Eq. 
(218), the Wigner-Critchfield interaction can be shown 
directly to be a scalar as a consequence of the fact that 
the transformations of the wave function are uni- 
modular, Eq. (164). For a non-time-reflection it is seen 
that 


det(gp’ Yx’ ġe Wr’) tcc. 
=det(Adp Avy Ap, AY.) + C.C. 
=det[A (pr Yy pe Ws) J+ c-c. 
= det (gp Wy Ge Pr) FEE; 
as required. For a time-reflection one must also note 
from Eq. (194) that 
detC=1, 
and then the result is 
det(gp’ Yx’ ġe Ys )+c.c. 
=det (ppf yr’? pel pr jcc 
=det(CAġp CAyy CA. CAY) +C. 
=det[CA(op wy pe Y) J+ c-c. 
=det(dp Yn pe Yr) +C.C., 


which proves the assertion. 


u C. L. Critchfield and E. P. Wigner, Phys. Rev. 60, 412 (1941). 
2C. L. Critchfield, Phys. Rev. 63, 417 (1943). 
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The earth’s core may be assumed to consist of fluid metal sur- 
rounding a solid inner core which probably contains a source of 
heat to drive convection, but it is not possible at present to select 
between various possible types of convective motion in the 
fluid core. Types considered are characterized by some sort of 
radial flow streams and a tendency for the fluid to rotate on the 
average more rapidly near the axis to conserve angular momentum 
during the circulation. Though the actual flow may be quite 
complicated, proposed mechanisms for generating a terrestrial 
magnetic field are considered for some oversimplified flow patterns 
in an attempt to indicate what features of the flow may provide 
the most important possibilities for field generation. It is sug- 
gested that, without a field to absorb the energy, the flow would 
be accelerated indefinitely and would evolve through a succession 
of flow patterns, some of which would be expected to have the 
properties to generate a field capable of preventing further ac- 
celeration and prolonging the sfatus quo, thus making it likely 
that the carth should have a field. 

The generating mechanisms discussed include two induction 
theories, the dynamo theory of Elsasser and Bullard, which is 
discussed at length both in terms of velocity-current systems 
portrayed by elaborate models and in hydromagnetic terms, and 


the “twisted-kink” theory of Alvén which is discussed only hydro- 
magnetically. Each of these theories depends on’ amplifying an 
initial stray magnetic field up to a point where it dissipates all 
of the available energy, and is at least in this respect analogous 
to a conventional clectrical generator but without a ferromagnetic 
core. Other mechanisms discussed depend either on the thermo- 
electric effect with junctions at the core-mantle interface or on a 
combination of thermoelectric and Hall effects in the core and 
mantle. 

If the convective flow is rather irregular, the observed slow 
westward drift of the detailed pattern of the earth’s field is attri- 
buted to the vanishing of the total torque on the core by the 
magnetic field threading through the core and mantle, as a result 
of an eastward drag on the outer part of the core rotating more 
slowly in space and a westward drag on the more rapidly rotating 
part of the core near the axis, with the presumption that the 
observed magnetic pattern is characteristic of the westward- 
drifting outer part. If the flow insteatl involves a jet stream, the 
flow in the jet may under some circumstances be expected to be 
eastward for reasons comparable to temperate-zone meteorology, 
so the magnetic field should exert a westward drag on it leading 
to the westward drift of the flow pattern. 
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I. THE FLUID CORE AND THE PROBLEM OF 
ACCOUNTING FOR THE EARTH’S FIELD 


1. Introduction—The Earth and the 
Types of Theories G 


E must be made clear at the outset that the theory 
of the existence of terrestrial magnetism is in a very 
unsatisfactory state. To some tastes, this makes a 
review article on the subject undesirable, to others, 
particularly desirable because by pointing out some of 
the difficulties and explaining some of the competing 
possible physical effects even in oversimplified cases 
it may guide and stimulate further development, 
besides helping the casual reader appreciate the extent 
of the problem and the nature of some of the possible 
solutions that are being considered. The general ques- 
tion, “Why is the earth a magnet?” is surely sufficiently 
fascinating to merit such exploratory and perhaps 
wasteful attention in considerable detail. The possibility 
that the magnetism of other celestial bodies may have 
a related origin adds to the interest, but as it is hard 
enough to make dependable surmises about the interior 
of even this planet we inhabit, we shall not go further 
afield here. 

The most pressing unsatisfactory feature at the 
present preliminary stage lies not in the realm of con- 
vincing ourselves that we must be concerned with the 
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earth’s core made of molten metal having physical 
properties in certain ranges, but rather in not knowing 
how such a fluid behaves dynamically under the com- 
bined influence of gravity, rotation, magnetic field, 
and a thermal or other source of inhomogeneity. The 
comparatively simple problem of a horizontal slab 
of fluid heated at the bottom in which one has simple 
Bernal ces of convection in the nonrotating case, 
becomes sufficiently complicated when rotation about 
a vertical axis is introduced, or for a conducting fluid 
when a vertical magnetic field is introduced, and 
especially when both are introduced; so that it is quite 
beyond the present state of the art to make dependable 
surmises about the nature of the dynamical solutions 
in the much more complicated geometry of the earth’s 
core. 

Perhaps the best we can do for the present is to try 
to understand in what ways some of the simplest 
imaginable flow patterns can produce a magnetic field, 
and then try to develop some feeling for how general 
such effects may be for other flow patterns which we 
can only vaguely anticipate. In particular, after ap- 
preciating that there are certain details of a fluid flow 
pattern that tend to build up a terrestrial magnetic 
field to a point where it will absorb all the energy 
available and thus limit the velocities in the flow 
pattern, we may propose that the flow paltern as it 
evolves, accelerated by an energy source, will be very likely 
to encounter a stage in which it has field-producing 
features, at which point the evolution will be halted by the 
field’s dissipation of the energy. Without settling many 
of the details such a discussion can at least make it 
seem plausible that the earth should through most of 
its history be endowed with a magnetic field. 

The transiMission and reflection of longitudinal and 
transverse seismic waves at various depths have made 
possible the picture of the earth’s interior as consisting 
of a core, mainly fluid but containing a solid inner core, 
surrounded by a thick, essentially solid mantle and a 
thin crust of familiar rocks. Comparison with meteorites 
and astronomical indications of the moment of inertia 
suggest that the core is largely iron and nickel, 
the mantle mainly the mineral known as olivene, 
(Mg, Fe):SiO;. The seismological and other evidence 
has been reviewed in these pages by Elsasser (1950a).* 
Secondary evidence that there must be some fluid within 
the earth, having motions more rapid than would be 
expected in plastic flow, is found in the secular varia- 
tion of the earth’s magnetic field, and the fascinating, 
age-old problem of accounting for the existence of 
terrestrial magnetism is thus naturally associated with 
the existence of a fluid metallic core. 

Because the magnetic needle has pointed nearly 
north as long as it has been known, we have the im- 


. pression that the earth’s field is steady. Because the 


* References will be found at the, end of the article listed in 
alphabetical order. 
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line of zero declination was in Europe 400 years ago 
and is in America now, we have an indication of a fairly 
long-term drift of the detailed magnetic pattern, While 
there is order-of-magnitude agreement between this 
average long-term drift and the present rate of drift, 
the agreement is not close enough to, indicate a steady 
rate of drift and indeed the rate seems to be varying. 
While the gross features of the earth’s field are similar 
to those of a dipole near the center inclined slightly 
to the earth’s axis of rotation, the deviations from such 
a dipole field may be attributed to several dipoles or 
current loops about on the surface of the core, and the 
whole pattern has a general westward drift (Vestine 
et al., 1947; Runcorn, 1950). The shape of the pattern 
is itself not steady, but is characterized by localized 
regions of rapid change lasting several decades, the 
most violent one at present being near South Africa. 

A longer look at the variations of the field is provided 
by a study of the magnetism of successive layers of 
sedimentary rocks and of lava flows, which suggests 
that the earth’s main dipole field, while maintaining 
its axis close to the earth’s axis, may have changed in 
direction at intervals of several tens of thousands of 
years. This reversal phenomenon is still disputed 
(Graham, 1953) because of the possibility of reversing 
the magnetism of some complex rocks by heating, but 
it seems possible to establish the reversal by observation 
of selected simpler rocks (Hospers, 1951; Runcorn, 
1954) so it seems at present quite likely that reversals 
in polarity of the earth’s field have taken place. The 
rock indicator saturates and it is thus not possible to 
follow the magnitude of the field. Within this limitation, 
the complete reversal seems to take place within three 
thousand years, so the field does not appear to vanish 
for any extended periods. 

The problem is thus to account not for a steady 
field but for one which may gradually vary, remaining 
correlated with the earth’s axis of rotation but perhaps 
even changing polarity. As for the westward drift, our 
knowledge of its westward direction extends ever only 
part of a thousand years so it might, for all we know 
experimentally, also change direction. Since the ob- 
served drift is westward, it is gratifying that there isa 
satisfactory explanation of the westward drift, as is 
presented below. It would be embarassing if it were 
eastward. 

In seeking a satisfactory theory of the existence of 
terrestrial magnetism, it is unfortunate, though perhaps - 
a source of the fascination. of the problem, that one 
cannot observe the properties of the materials with 
which one must deal, and is forced to rely on conjecture 
and remote extrapolation from laboratory conditions. 
There is as yet no single, outstandingly successful, 
theory of geomagnetism, and we shall discuss several 
theories, any one of which might be adequate 4nd all 
of which may perhaps contribute significantly to the 
final phenomenon observed. The extent to which com- 
peting theories rely on ‘conjecture concerning the 
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properties of materials must be taken into account in 
judging their relative plausibility. 

The theories discussed depend, respectively, on the 
electric conductivity of the matter in the earth, on this 
and the thermoelectric power, on these and the Hall 
coefficient, and on the Nernst-von-Ettinghousen coefli- 
cient. Of these electric properties, surely conductivity 
is the simplest. The electromagnetic induction theories 
which depend only on the conductivity and the velocity 
pattern thus involve less controversial conjecture con- 
cerning electrical properties than do the other theories. 

All the theories depend, of course, on the various 
properties of the earth’s interior that contribute to the 
complicated flow pattern within the core, and this 
involves a large amount of additional conjecture. Even 
after making assumptions concerning such matters as 
viscosity and the distribution of heat sources, the equa- 
tions of motion are too complicated to have been 
solved. One must rely on analogy and careful guess- 
work in selecting plausible flow patterns, and beyond 
this in estimating the probable magnetic effects of the 
flow. 

After one has established the plausible existence of 
energy sources sufficient to maintain the earth’s field, 
the outstanding problem in the induction theory is one 
of sign, the demonstration that the inductive reaction 
to an initial field may reasonably be expected to be 
regenerative to build up and perpetuate the original 
field, rather than degenerative to destroy it. The 
polarity of the original field is immaterial so there 
is nothing to preclude occasional reversals with 
changing flow patterns. So far as the theory has yet 
been developed, there is also nothing to preclude the 
development of degenerative flow patterns and con- 
sequent nonmagnetic eras which seem not to occur more 
than briefly. Either the induction is actually regenera- 
tive and primarily responsible for geomagnetism or 
else some other mechanism such as the thermoelectric 
effect makes the primary contribution, but even in the 
latter case, induction of one sign on the other must be 
present and profoundly modify the result, so it is in 
any case desirable to understand the factors con- 
tributing to the induction in seeking an ultimate 
solution of the problem. For these reasons, a rather 
extensive attempt is here made to present some of the 
details of the “dynamo” inductive process, for a typical 
flow pattern, in a comprehensible pictorial form. 


2. Convection and Relative Rotation 
within the Core 


Progressing out from the center of the earth, the 
presumably solid inner core extends to a radius of 
1250 km, the fluid core from there to 3400km, and 
the essentially solid mantle and crust to 6400 km. The 
inner core thus provides an inner boundary for the 
thick spherical shell comprising the fluid outer core, 
but the inner boundary is sufficiently small that in 
some parts of the discussior below it is ignored. 
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There are several sources of relative rotation of the 
fluid core within the mantle. The astronomical ones, 
the precession of the earth’s axis (arising from differ. 
ential gravitational effects on the earth’s ellipticity) 
and the tidal deceleration of the earth’s rotation, have 
been shown by Bullard (1949a, see also Elsasser, 1950b) 
to be negligible and we need not consider them here, 
We need consider only those motions which arise from 
convection in the core of an earth rotating with constant 
angular momentum. 

The traditional source of convection, the one con- 
sidered by Bullard (1949a), is thermal instability caused 
by an assumed radioactivity of the material within 
the core. The evidence from meteors and mineralogy 
indicates that almost all of the radioacitivity of the 
earth is in the crust on the outer surface of the mantle, 
but only about one percent of this in the central body 
within the core, as could reasonably be assumed, 
would provide enough power to maintain the earth’s 
magnetic field. The amount of,heat generated, even if 
stored in the inner part of the mantle without being 
conducted to the outside, would not produce unreason- 
ably high temperatures in the lifetime of the earth. This 
would thus correspond to an earth that has been heating 
up on the inside since its formation. 

Urey (1951) has assembled evidence for the view that 
the earth and other planets were formed by accretion 
of small cold particles with the heavy iron and nickel 
constituents, which are now mainly in the core, originally 
on or near the outside. He suggests that the process of 
settling of the denser materials toward the inside is 
not yet complete, and shows that this settling could 
produce an entirely adequate driving force for the con- 
vection in the core needed to explain geomagnetism, 
at the same time providing a source of heat. The inward 
flow of matter is thought to be of the order of 10 g/sec 
and the energy developed 10" erg/sec. 

The inward transport of angular momentum is caused 
both by the inward transport of matter provided by 
this settling mechanism (which in itself tends to make 
an eastward drift of the matter in the core relative to 
the mantle) and by the exchange of matter between the 
inner and outer parts of the core in the course of any 
gravitationally driven convection (including thermal 
convection). The exchange of matter in convection is 
sufficient for our present discussion. We shall return 
to consideration of the smaller effect of the secular 
transport of matter when we discuss the problem of the 
observed westward drift. 

In a rigid body, a gram of matter near the axis of 
roation has less angular momentum than a gram 
farther from the axis, their angular velocities being 
the same. In a rotating nonviscous fluid the exchange 
of matter between these regions by convection would 
be characterized by conservation of angular momentum 
of the bits of matter, which would make the average 
angular velocity of the matter near the axis greater 
than that further out. The same thing may be said 
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in terms of Coriolis force: In the convection in the earth 
rotating eastward, a stream of fluid moving inward 
along any radius (except one along the axis of rotation) 
experiences a Coriolis force toward the east, and an 
outward stream a Coriolis force toward the west, 
making the parts of the fluid near the axis tend to 
rotate more rapidly toward the east than those farther 
out. 2 

So long as there is gravitational instability, such as 
would arise from an internal source of heat in excess of 
that which can be carried away by conduction or from 
the settling of a dense constituent of matter as men- 
tioned above, convection may be expected to take place 
perhaps involving approximately radial streams of flow 
and flow along the spherical surface of the fluid from 
the outward to the inward radial streams. These 
features may be constantly changing and may 
be accompanied by other complexities. A simple con- 
vection pattern involving two downward streams along 
the polar axis and twojupward streams on the equator 
is shown in Fig. 1. A similar pattern of four approxi- 
mately radial streams, but all of them on the equator, 
is shown in Fig. 2. 

Whether the flow will take place in a large-scale 
convection pattern such as one of these or in a much 
more fine-scale turbulence depends on the relative 
magnitudes of various forces influencing the motion. 
In particular, we must consider in the equation of 
motion the following terms: 


(1) (2) 
pv=—Vp+e(g+oX[rXe]) 
3) © (5) 
+2p0Xv+JIXH-nv XLV Xv]. (1) 


The balance between the pressure gradient (1) and the 
gravitational term (2) provides the driving force for the 
convection, either through the interplay of temperature 
and thermal expansion to provide a temperature 
gradient in excess of the adiabatic gradient or through 
a supply of a denser (and perhaps more compressible) 
material at the top of the fluid. The Coriolis term (3) 
in a part of the fluid rotating about the central axis 
with angular velocity w provides a deflecting force in the 
direction which would seem to favor turbulent circula- 
tion in the sense opposite to w. The magnetic term (4), 


N N 


Ge 
CY 


(a) (b) 


Fic. 1. A simple but implausible convection 
pattern in a polar section. 
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Fic. 2. A simple convection pattern in the equatorial plane. 


in which permeability has been taken equal to unity 
because the mechanism we are discussing is not a 
ferromagnetic one, represents a magnetic drag on the 
motion of conducting matter through a magnetic field 
because of the interaction of the field H with the eddy 
currents J which it induces. The viscous term (5) 
ordinarily helps to maintain order in a fluid, since small- 
scale turbulence increases the importance of the deriva- 
tive operator V and thus increases the force necessary 
to drive the motion against this term. Without a 
magnetic field, turbulence ordinarily sets in when the 
ratio of the left member (pi in the preturbulent 
motion) to term (5), or rather twice this ratio, which 
may be written dimensionally 


2pi/(nv/P) = 2plo/n=R (2) 


is very large, greater than 2000. R is well known as the 
Reynolds number. The known or plausible magnitudes 
of these quantities for the core of the earth are roughly 


p=10 g/cm’, the density of the core, 
]=2 10% cm, the thickness of the fluid core, (3) 
n=10-? g cm™ sec, 

an estimated coefficient of viscosity, 


and these would make R~2000 for the very 
small velocity »~5 10° cm/sec. Since velocities about 
107 times as large as this are needed merely to explain 
the mobility of the magnetic patterns on the earth, as 
is discussed further below, the motion would, without 
a magnetic field, have a very small-scale turbulence. 
A similar criterion for the expectation of turbulence 
in the presence of a magnetic field may only be stated 
even more roughly and qualitatively, and without the 
same empirical foundation. In the hydromagnetic case 
in which the conductivity g is great enough to provide 
all the current demanded™by the natufal complexity 
of the field H, turbulence may be expected in general 
to twist the magnetic field into a pattern of a com- 
plexity comparable with that of the turbulence and 
represented by the same characteristic length L In this 
case we may put the current density J equal to 
yXH/4r and the analog of the Reynolds number is 
the ratio of the first member to term (4) of Eq. (1): 


pi/ | HX VX BY/4er| = 4rp0?/ HP. (4) 
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This is, incidentally, the ratio of kinetic energy to 
magnetic energy per unit volume. Unlike the Reynolds 
number, it does not contain the characteristic length /. 
If in rough analogy with viscosity we take 10° as the 
critical order of magnitude of this number, with 7=4 
gauss (the presumed value at the surface of the core) 
and p=10 g/cm’, this allows v to become as great as 
10 cm/sec before we might expect turbulence to set in. 
(If we take not 10° but 10 we get 1 cm/sec, which is 
just as good for our rough purposes.) With lower con- 
ductivity, the lines of M are not twisted so much and 
the amount of VY XH that is developed is limited by 
the current J which, if we assume that the velocity 
pattern v is irregular enough that a return circuit is 
easily found, we may express roughly J~ov XH. Equa- 
tion (4) provides the relevant criterion unless 


ovXH<vVXH/4r or 4rovl <i, (5) 


[where V XH and / are characteristic of the turbulence 
in question, as in (4)] in which case we have instead 
of (4) the larger quantity 


pt/|oHX[vXH]| ~p2/ol H°. (6) 


We may presumably expect to have turbulence when 
either this or (4) exceeds some critical value such as 
10°, that is, either when the magnetic energy involved 
in twisting up the magnetic lines completely in the 
flow pattern is not great enough compared with the 
kinetic energy to impede turbulence, or when the con- 
ductivity is small enough to allow the flow pattern to 
slip through the magnetic field easily enough to permit 
turbulence. With a value of the conductivity typical of 
molten metals, 


a~ 10° ohm cm™!= 10> sec cm, (7) 


the ratio (6) is greater than 10? when /<(6 sec)v, that 
is, for very small-scale turbulence at moderate velocities 
in the earth’s core. Thus we find that unlike viscosity 
a magnetic field may, while inhibiting large-scale turbu- 
lence, permit a small-scale turbulence which is, how- 
ever, in this case so microscopic as not to matter on a 
terrestrial scale. For velocities less than about 1 to 10 
cm/sec we may expect essentially laminar flow in the 
large-scale convective patterns except for turbulence 
on a scale of a few cm or less which “slips through” the 
magnetic field, and which may facilitate the transport 
of heat into the edges of the convective pattern where 
it comes near the boundaries. 
In the earth it thus seenas that the complication of 
a magnetic field may in some ways simplify the flow by 
inhibiting turbulence. Let us now return to the elements 
of the problem by ignoring for a time both H and the 
turbulence. With thermally-driven convection alone, 
we expect upward and downward streams of fluid flow. 
With tlie Coriolis effects arising from rotation, we expect 
the matter on the inside, on the average, to have higher 
angular velocity than that on the outside, corresponding 
to approximately constant? angular momentum per 


y 


INGLIS 


unit volume as the matter changes its distance from 
the axis of rotation. If that were all, we might then 
expect the upward and downward streams to be 
wound into ever-tighter spirals. At some degree of 
twisting we might expect such a pattern to break up, 
perhaps only to start over. 

The problem here suggested is at least as complicated 
as the meteorology of the Earth’s atmosphere, to which 
it has considerable similarity in spite of having different 
boundary conditions, and that problem has long re- 
sisted complete understanding even with rather com- 
plete access for observation. In imagining possible flow 
patterns within the Earth, we must at least make use 
of meteorological experience. Phenomena comparable 
to the jet stream and to the thermal wind with geo- 
strophic flow may be important insofar as they are not 
suppressed by the magnetic field. 

There are also important laboratory experiments on 
the effect of rotation on fluid flow, in some ways 
closely comparable with meteorological experience but 
with characteristic differences arising mainly from the 
different boundary conditions. Unfortunately it has 
not been possible in these experiments to reproduce or 
include a substitute for the spherically radial nature of 
the gravitational field which must be crucial for the 
flow pattern in the earth’s core, but with due allowance 
for this and other shortcomings the experiments are 
still extremely suggestive. 

The classical experiments of G. I. Taylor (1921, 1923) 
showed that with constant density and small viscosity 

` [terms (4) and (5) of Eq. (1) absent, term (2) opposed 
by the static part of term (1), and term (3) dominant] 
there is a remarkable apparent rigidity of the fluid 
in the direction parallel to the axis of rotation (z-axis). 
For simplicity of explanation, let us consider the in- 
compressible case in which in Eq. (1) the curl of term 
(2), as well as of term (1), vanishes, the force field in 
(2) being the gradient of a potential. With œ a constant 
vector in the z-direction, taking the curl of the remain- 
ing terms of Eq. (1), the left member and the Coriolis 
term (3), gives 


3V XV=VX[oXv]=(o-V)v=—w(d/dz)v. (8) 


In the first member, the total time derivative of the 
velocity of an element of matter relative to the rotating 
axes is 


V=dv/dl-+(v-V)v, ` (9) 


of which the term ðv/ðt vanishes for steady flow. The 
remaining term is quadratic in v so for steady slow 
motions (that is, slow compared to the linear velocities 
induced by w in a space-fixed frame) we have (0/02)¥ 
~0, indicating that the velocity pattern is not a func- 
tion of z, or that line elements parallel to z move rigidly- 
With a horizontal boundary, as in a flat-bottomed 
vessel, we may then expect two-dimensional flow with 
v-=0 throughout. This Taylor confirmed by moving @ 
short cylinder about in a rotating dish of liquid and 
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watching streams of ink flow around its projection in 
another plane. The dish had a flat bottom. Presumably 
a sloping bottom would impede the motion in the same 
way as do the vertical sides of the cylinder. 

The flow in the atmosphere and perhaps that in the 
earth’s fluid core are driven by thermal convection, 
and recent very interesting experiments by Hide (1953) 
at Cambridge study the flow of a rotating liquid driven 
by _thermal convection. These experiments and a 
variety of related ones have also been carried out by 
Fultz and his associates at the University of Chicago. 
Hide used a deep flat-bottomed vessel rotating about a 
vertical axis with the liquid confined between inner and 
outer cylindrical boundaries as shown in Fig. 3(a), the 
inner boundary being hot and the outer cold, or vice 
versa. With slow rotation, the convective pattern 
would be a toroidal loop up near the inner wall, for 
example, down along the outer, and the outward flow 
along the top surface was observed to be not radial 
but spiral as a result of the small Coriolis term. With 
slightly more rapid rotation, however, the flow is 
observed to become horizontal, displaying a character- 
istic two-dimensional flow pattern to transport heat 
between the inner and outer cylinders. Aside from 
viscous end effects near the bottom, these patterns are 
of the same general shape in all horizontal cross sections, 
with velocity within the pattern varying with height. 
The orientation of the pattern may also vary with 
height in a spiral fashion forming “fronts” leaning for- 
ward. Here the conditions differ from Taylor’s, the 
inhomogeniety being essential to the thermal driving 
force of the convection, and dv/dz is definitely not zero. 

Most of Hide’s photographic observations show only 
the top surface pattern, being made by photographing 
a floating anise@tropic power. A typical pattern is shown 
in Fig. 3(b), and it is immediately seen that it differs 
very essentially from the simple imagined convection 
pattern of Fig. 2 in having a rather narrow “jet stream” 
which, while weaving radially inward and outward as 
does the pattern of Fig. 2, does not loop back on itself 
but continues to circulate in the same sense around 
the central axis and joins up with itself only after it has 
gone the whole way around [see Fig. 3(c) J. The fairly 
small fraction of the fluid constituting the jet stream 
moves rapidly, and the sectors of fluid isolated in the 
loops of the jet stream are impelled to rotate in both 
senses in the manner of idler wheels, but relatively 
slowly (the outer ones frequently hardly rotate at all) 
so that together they account for about as much radial 
flow of matter as does the jet stream. The narrowness 
of the jet stream may perhaps be accounted for in these 
experiments by the limited conduction of heat across 
the streams where they contact the hot and cold 
boundaries (though the corresponding meteorological 
phenomenon appears to be more dynamic in origin). 

The explanation of this flow and the sense of flow, as 
given by Hide, has much in common with the meteoro- 
logical theory of the “thermal Wind.” It is dependent 
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(c) íd) 


Fic. 3. Laboratory experiment exhibiting 
convection with a jet stream. 


on having gravity (or a strong component thereof) 
parallel to the axis of rotation. In the rigid rotation 
(about a vertical axis) of a dish of homogeneous fluid, 
there are equipotential surfaces of the potential giving 
rise to the gravitational-plus-centrifugal-force term (2), 
the parabolic free surface being one of them. We may 
discuss the flow in one of them, or for simplicity in 
moderately slow rotation we may temporarily neglect 
the centrifugal force and consider the flow in horizontal 
planes. 

Along the bottom of the vessel where the flow 
vanishes the pressure is then approximately independent 
of radius. With the inhomogeneity introduced by 
heating the outer cylinder, the lowered density due to 
thermal expansion up along the outer cylinder makes 
the pressure decrease less rapidly there than up along 
the cold inner cylinder, so that any level above the 
bottom (or more accurately on any paraboloid that was 
an equipotential surface before the thermal expansion) 
the pressure is higher at the outer boundary than at 
the inner. This requires a genera! circulation in the 
same direction as the rotation so that the Coriolis 
force may support the pressure difference (adding to 
to the effective centrifugal force), and indeed the jet flow 
is observed to be “forward” with the outer boundary 
hot, cold, or “backward” with the temperatures re- 
versed. The patterns drift in the direction of the jet 
flow, as is discussed further below. This is a model of the 
temperate-zone circulation, with the arctic cold béund- 
ary at the center and horse-latitude warm boundary 
outside, in which the prevailing winds and weather 
patterns indeed move eastward, or “forward.” 
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Since that “thermal wind” explanation accounts in a 
general way only for the direction of circulation, we 
may further consider the effect of the centrifugal force 
in impelling the jet flow. With the hot boundary outside, 
we have in a horizontal plane the centrifugal-force 
analog of a thermally unstable atmosphere, hot on the 
outside toward which centrifugal force is pushing it, in 
a way to drive any circulation which interchanges the 
hot and cold portions of the fluid. We may say that the 
warmer and thicker inward stream “floats” toward the 
inside more insistently than does the colder and thinner 
outward stream. We really have no reason to expect 
the flow to be so accurately horizontal as this, and 
indeed we run into trouble if we continue to expect this 
in the case with the temperatures reversed, for we see 
that the fluid is then stable with no flow. It is, however, 
unstable relative to a sort of spiral motion that de- 
parts from the horizontal plane, a compromise be- 
tween the circulation in planes through the axis it 
would have without rotation and the tendency toward 
horizontal motion introduced by rapid rotation. The 
vertical sheet of fluid that constitutes the jet is then 
presumably becoming thicker and thinner at various 
parts as a result of fairly small vertical components 
of motion within it, but still larger than permitted at 
the upper free surface and especially at the lower 
boundary. The “idler wheel” regions are long rollers 
in which a considerable degree of vertical rigidity is 
provided by the Taylor effect, though this is no longer 
completely effective in an inhomogeneous fluid. 
Let us now consider why the general flow pattern has 
a drift in the same circulatory direction as that in which 
the jet flows. To understand the steady state of the 
“thermal wind” we need only note that while most of 
the temperature drop is across the jet stream, part of it 
is across the “‘idler wheel” regions, so the same argument 
applies to their drift as to the jet flow. One may still 
attempt to trace some of the torques involved in 
achieving this steady state. We may make the simpli- 
fying assumption that the main motivating force of the 
convection acts on the jet flows, so the adjacent regions 
are truly “idler wheel.” (The neglected thermal drive 
of the “idler wheels” merely increases the force of the 
argument.) If the pattern had no drift in the rotating 
coordinate system, the direction in which it would 
start to drift would be determined by the total frictional 
torque acting on the fluid as it flows past the inner 
and outer boundaries. With the direction of the jet 
flow taken as “forward, the frictional force exerted 
on the jet where it rubs past the boundary is backward, 
whereas that exerted by the surface on each of the 
“idler wheels” is forward, since they are rolling over 
backward where they make contact with the surface. 
If the flow pattern is essentially stationary, the torque 
exerted on an “idler wheel” by the boundary is equal 
and opposite to that exerted on it by the jet stream 
wrapped around its other side. The shape of the pattern 
appears to be such that te jet stream makes effective 
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contact with the “idler wheels” (which are actually 
deformable columns of fluid and not really circular) over 
a much greater fraction of its length than it does with 
the boundary, and we may therefore expect the fric- 
tional drag between jet stream and “idler wheels” to be 
greater than that between jet stream and boundary, 
Thus the torque exerted by the boundary forward on 
the “idler wheels” is greater than that exerted backward 
on the jet stream, and the resultant forward torque 
would start the fluid pattern drifting forward. Alter- 
natively, we may think of the pattern as having 
achieved a steady forward drift which reduces the 
frictional drag on the “idler wheels” and increases that 
on the jet stream, making the two equal. 

In the meteorology of the temperate and polar zone 
of the earth’s atmosphere, the isobars are approxi- 
mately lines of flow because of the phenomenon of 
“seostrophic flow,” or the effect of the important 
Coriolis term in orienting the pressure gradient normal 
to the flow lines. The daily charts of isobars of the 
Northern Hemisphere (at the “500-millibar level”) 
present a varying pattern complicated by the geography 
of the continents, but they frequently include an irregu- 
lar multilobed jet-stream pattern vaguely similar to 
Fig. 3(b), and in his thesis Hide is able to present an 
unusually symmetrical weather chart carefully selected 
to be strikingly similar to a pattern observed in the 
laboratory. If we wish to assume that the two patterns 
have the same cause, the polar region and the equa- 
torial belt taking the place of the cold and hot bound- 
aries, we must conclude that the varying vertical 
component of w through the temperate zone apparently 
does not drastically effect the pattern. There is, how- 
ever, a theory of the progressing waves advanced by 
Rosby, which depends essentially on tkis variation of 
wn and it may be that both causes contribute to the 
observed result. 

Figure 3(b) illustrates a simple, steady, three-lobed 
flow pattern, but Hide also finds more complex behavior. 
As w increases the number of lobes increases. Under 
some conditions he also observes a characteristic 
periodic change in the nature of the flow pattern which 
he calls “vascillation.” Starting with a flow similar to 
Fig. 3(c), the jet stream seems to break off at each of 
the points where it approaches the inner boundary and, 
instead of proceeding forward along that boundary, 
winds back around the inner “idler wheel,” as shown m 
Hide’s photograph Fig. 3(d). After this it,returns to the 
original pattern with a continuous jet stream, and the 
cycle is repeated. With a steady flow similar to Fig. 3 (c), 
the “idler wheels” are so thick compared to the jet 
stream that it seems there would not be time for heat 
absorbed from the outer wall to penetrate well into the 
“idler wheel” and out again toward the inner wall ha 
a revolution later. Thus the outer “‘idler wheels” tend 
to remain hot and the inner ones cold, creating local 
instabilities. Vascillation apparently constitutes an 
equalization mechanisfa whereby the matter constitu- 
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ting the jet stream is periodically exchanged with that 
constituting the “idler wheels.” It obviously provides 
a wealth of opportunity to discuss variations in the 
pattern of the earth’s magnetic field. 

Enough has been said to suggest that rather com- 
plicated flow patterns may be expected in the earth’s 
interior. It is possible that they may have some slight 
similarity to some of those described here, but the 
complications introduced by the magnetic field, as well 
as the spherically radial gravitational field, will surely 
make serious alterations. Both the magnetic field and 
the spherical boundaries will greatly alter the influence 
of the Taylor effect. For the sake of trying to explore 
and explain the types of electromagnetic phenomena 
that may arise, we shall at first ignore most of the 
hydrodynamic complexity and undertake a lengthy 
discussion of the simple flow pattern of Fig. 2, realizing 
that it may have in common with the actual flow no 
more than the existence of inward columns of fluid 
flow at some azimuths, and outward columns at others. 


Il. INDUCTION THEORIES 
3. Development of the Dynamo Theory 


An early suggestion of Larmor (1919) that the 
earth’s magnetism might arise from electromagnetic 
induction in the fluid motions within the earth’s core 
was inadequate (Cowling, 1934) in the detail of its 
original form. From it there have developed two 
essentially separate induction theories, the induction 
or “dynamo” theory of Elsasser and Bullard which we 
consider first, and the “twisted kink” theory of Alvén 
which is discussed below (Section 11). Elsasser (1946- 
1947) initiated the development of the dynamo theory 
by pointing out the importance of the toroidal field, and 
later Bullard (1949) suggested a possible way of com- 
pleting the regenerative cycle, into an at least somewhat 
plausible theory of which the main outlines are fairly 
clear. The quantitative details remain as yet incom- 
pletely considered and are clouded by our ignorance of 
the physical properties of the earth’s interior. The main 
appeal of the theory in the form in which Bullard has 
tentatively proposed it, is that almost the simplest 
imaginable convective motions, those of Fig. 2, within 
the fluid core of the earth seem as though they would 
rather probably give rise to inductive effects, if they 
could be calculated, of the correct sign to regenerate 
an accidental initial field and bring it up to a magnitude 
determined by the power provided by the driving forces 
of the convection. In this latter respect the proposed 
inductive effects are quite similar to those in a series- 
wound dynamo when the polarity of the connection 
between the field coils and the armature brushes is 
properly suited to the direction of rotation. The dynamo 
postulated within the earth is more complicated than 
a simple series-wound generator, as we shall see in 
detail. It is more nearly similar to a generator equipped 
with an auxiliary or field generator, in which the 
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current from the armature of the main generator feeds 
the field coils of the auxiliary generator and the arma- 
ture of the auxiliary in a regenerative fashion feeds the 
main field coils. One of the main problems is to show 
that the convective currents in the earth are “hooked 
up” in a regenerative, not a degenerative fashion. With 
a pair of terminals reversed, such a system becomes de- 
generative and the currents and magnetic fields vanish. 
A start on the problem of the existence of terrestrial 
magnetism can be made by trying to understand what 
types of convective patterns and what types of sub- 
sequent inductive effects seem to contribute to regenera- 
tion. The problem is much more complicated than the 
system of generators because the earth’s core is pre- 
sumably built without any insulators and the possible 
current loops analagous to the field coils are shorted out 
by alternative paths in parallel, making it necessary 
to estimate which paths are most important in con- 
tributing to the final regenerative or degenerative result. 

Attempts at mathematical analysis of the inductive 
effects are extremely difficult and appear as yet not to 
have yielded a definite result, in spite of valiant efforts 
by Bullard and his co-workers to find a solution of 
Maxwell’s equations for an assumed flow pattern. The 
trouble seems to be that, even after one has, within 
the simplified general pattern of Fig. 2, selected a 
sample detailed radial dependence of the velocity dis- 
tribution, it is necessary in a power series expansion 
to make a sample selection of the terms in the expansion, 
and one cannot be sure that the sampling of the terms is 
adequate. It is to be hoped that further progress along 
those lines may be made. 

In order to see what the nature of the inductive 
process may be, and what local features of it may be 
most important, we here resort instead to a graphic 
approach in which by means of a model we attempt to 
trace the electric and subsequent magnetic effects of 
various volume elements of the flow pattern. 


4. The Simple Generators ° 


By way of preparation for a discussion of the magnetic 
induction effects within the earth’s fluid core, let us 
review the workings of the simplest model of a con- 
ventional series-wound generator, as depicted in Fig. 
4(a). It is a device for regenerating and amplifying an 
initial accidental field (which is usually supplied in 
practical generators by the remnant magnetization of 
a ferromagnetic environment not present ip this simple 
model). It consists of two loops of wire, one rotating 
within the other, connected to one another by means 
of brushes on a commutator arranged to maintain the 
correct polarity of the connection. With the commutator 
connection as shown, it will regenerate only if rotating 
in such a way that the top of the rotating coil is moving 
into the paper. Then an accidental field H, downward, 
for instance, induces an electromotive force E=vX H; 
toward the left in the top ald toward the right in the 
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Fic. 4. Two simple generators. 


bottom of the rotating coil, and thus a current J which 
continues in such a direction through the stationary 
horizontal coil to produce a downward magnetic field 
in the same direction as the original accidental field. 
Whether it is large enough to regenerate the original 
field depends on the conductivity o of the wire, the 
angular velocity w and the dimensions. An order- of- 
magnitude estimate suffices for illustration, in which 
the inhomogeneity of H and numerical factors near 
unity are neglected. The electromotive force is about 
Ea=avH =wa?H, the resistance is about a/cd?, so that 
the induced field may be written 


Ho~J/az~ (wa?) (cd*/a)/a=wod?H. 


[The same thing may be taken directly from one of 
Maxwell’s equations 


V XH=47J=cvXH, 


since VX# in the stationary wire is of order of magni- 
tude H,/d, and H., the tangential field at the edge of 
the wire, is about equal to (a/d)H».] The condition for 
H»= H; leading to successful regeneration is thus that 
w=1/(cd?). With c=10-‘cm™ sec, d=1 mm, as in 
wire used in ordinary genérators, this requires w =~ 106 


o A 


r 
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sec™!, which seems rather fast. However, in an ordina 
generator, this figure is cut down by perhaps a factor 
of 1000 by increasing the number of turns, and a 
factor 1000 by use of a ferromagnetic circuit, and then 
increased by a factor 100 by introducing an externa] 
resistor, giving a reasonable angular speed of 100 sec, 
Such a series winding is, of course, not the usual one, 
Without the ameliorating effects of ferromagnetic 
saturation, it has the characteristic that it does not 
regenerate until it reaches the critical speed, then uses 
all the power available without permitting that speed 
to be exceeded, since beyond that speed the field would 
successively amplify itself to infinity. 

The critical angular speed scales as 1/d*. In a con- 
ducting circuit within the fluid core of the earth, if we 
had a commutator there, the conductor would be very 
fat with d+ 108 cm= 10° mm. Thus the estimated critical 
angular speed would be reduced by a factor 10!8 to 
about 10-” sec", implying linear speeds of the order 
of 10-4 cm/sec. This gives hope that a considerably less 
efficient generator might require speeds 100 or 1000 
times as great, such as seem to be required by the rate 
of westward drift of the nondipole geomagnetic field. 

Another simple generator is shown in Fig. 4(b). In it 
the commutator is replaced by the even simpler sliding 
contacts on the axle and on the edge of the rotating 
disk. It functions by the same principle of electro- 
magnetic induction, which in this case amounts to the 
fact that the magnetic field in the stationary coordinate 
system transforms into an electric plus magnetic field in 
the coordinate system of the conducting material which 
makes up the rotating disk. It, too, regenerates a stray 
magnetic field only if it rotates sufficiently rapidly in 
the sense indicated. 


5. Drag of Magnetic Lines in Moving Conductors 


The induction effects implied by the Maxwell 
equations 


vXH=4r) =400(E+vXH). v-H=0. (10) 
vxXE=—-H v-E=4mp, (11) 


bring it about that magnetic lines of force tend to stay 
with the material in a good conductor, lagging behind 
the material at a rate determined by the imperfect 
conductivity. A simple and instructive example of this 
phenomenon is provided by the consideration of three 
conducting infinite slabs moving transversely to an 
exterior applied field H as indicated in Fig. 5. The 
motions of the slabs are in the vertical direction z, and 
the applied field is in the horizontal direction y. Every- 
thing is uniform in the «z-plane, and the only non- 
vanishing derivative is d/dy. A steady state is assume 
without accumulated charges p, and from Eq. (11) we 
take E=0. From Eq. (10) we then have 


dH,/dy=0, H,=const, (12) 
and 
EL Nth Peas hl pa (13) 
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which has opposite sign in the adjacent slabs. The lines 
of force are thus parabolas, joined at the boundaries 
as indicated to satisfy the boundary condition of a 
horizontal field outside the conductors, which also 
requires that the velocities v and vı be related to the 
thickness of each outer slab a, the half-thickness of 
the inner slab b, and their respective conductivities 
Ta, On, thus? 


02/01 =0 10/00), (14) 

and that 
(Hz) max = 4a yo wa = 4r Tyo tod (15) 
Z= 2Iroxb(a-+b), (16) 


where Z is the maximum over-all lateral deflection of a 
line of force. As the velocities and the conductivity in- 
crease, the lines of force are dragged increasingly far 
downward and crowded together more nearly parallel 
to the interfaces between slabs as H, increases. The 
force per unit area of the magnetic drag acting between 
the slabs, is the numbe? of lines of force crossing the 
unit area times the z-component represented by one 
of them, or 


F=H,(H,) max=4r0H,pva=4roH,00b, (17) 


while no vertical force is exerted across the outer 
boundary of the outer slab with the particular boundary 
condition we have imposed. 

It is sometimes convenient to think of the magnetic 
lines being dragged by the moving conductor as a result 
of the first and second members of Eq. (13), without 
reference to the current, and sometimes convenient 
to think of the current induced by the motion according 
to the second and third members of the same equation, 
then to think, of the secondary field, in this case H., 
induced in turn by this current according to the first and 
third members of Eq. (13). The two procedures are, of 
course, equivalent and we shall use them interchange- 
ably. The former procedure, ignoring the currents, is 
convenient for problems of hydromagnetics where the 
conductivity is effectively infinite because its greater 
simplicity facilitates intuitive reasoning and makes 
possible the treatment of some problems that otherwise 
would not have been undertaken. In the present applica- 
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Fic. 5. Bending of lines of force by ‘moving conducting slabs, 
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Fic. 6. A fanciful merry-go-round as a model for the fluid 
velocities in the earth’s core which are considered important to 
the inductive process in the dynamo theory. 


tion the conductivity is finite and this procedure in- 
volves curvatures in three dimensions which may to 
some readers seem less tangible than the lengthier dis- 
cussion in terms of currents. For this reason and for 
the historical reason that it has not been presented 
elsewhere the hydromagnetic approach is postponed to 
Sec. 10, and we first undertake to understand the 
regenerative process in the procedure that involves a 
multiplicity of currents and frequent recourse to a 
familiar right-hand rule for the sign of the induction. 


6. Visual Aids for Describing the Velocities, 
Currents and Fields Involved in 
the Dynamo Theory 


The earth may conveniently be described as a 
magnetic merry-go-round, a rather complicated car- 
rousel. Such a caricature is, in fact, a useful pedagogical 
tool in attempting to picture the relative three-dimen- 
sional motions postulated in a combination of con- 
vection and shearing rotation within the earth’s fluid 
core, and in visualizing the four successive steps of 
magnetic induction to which they give rise in the re- 
generative process of the dynamo theory. It seems 
desirable to be more’ specific concerning the question, 
“What moves relative to what?’’, than previous pub- 
lished remarks have been (Bullard, 1949), and sub- 
sequently, “What current induces what field where?” 

The carrousel shown in Fig. 6 helps to picture and 
point out the places and relative motions. It represents 
the northern hemisphere only, and would have to be 
mirrored in the equatorial plane (containing B and F) 
to represent the interior of the whole earth. The top 
T and the base B and the posts that connect them repre- 
sent a conducting spherical shell within which rotates 
a fluid sphere. The fluid sphere with convection currents 
similar to those of Fig. 1 is represented in Fig. 6 by the 
rotating machinery inside. (The slightly more com- 
plicated machinery representing the convection pattern 
of Fig. 2 is discussed later, at the end of this section.) 
The floor F of the carrousel, driven by the pulley above, 
rotates in a direction indicated by the arrow vı. (We 
shall see that the result is no, affected by this arbitrary 
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Fic. 7. First step in the inductive process. 


choice of the conventional positive angular velocity 
vector, upward along the z-axis.) The thick cylindrical 
wheel IV is mounted on F by a framework and is driven 
by the drive-wheel D which rubs on its lower edge and 
on the inside of the base B. The wheel W thus rotates 
about a horizontal axis in the sense shown by the 
arrow v while it rides around on the carrousel. The 
velocity vı thus represents the rotation and və the 
convection. 

The fluid core of the Earth is, of course, a continuous 
conducting medium, providing conducting paths in 
all directions everywhere. On the carrousel, by means of 
rods and sliding contacts, we have pictured those 
typical conducting paths to which we wish to call 
attention for purposes of explanation. The possibility of 
pointing them out in this way facilitates subsequent 
discussions of the relative importance of various paths 
in the regenerative process. 

Figure 7 shows the first of the four steps in the re- 
generative process. The accidental initial field Hu, 
which we wish to regenerate, is assumed downward to 
correspond to the situation in the earth. This is quite 
arbitrary and a matter of definition: The earth’s north 
pole is a magnetic south pole because it attracts a 
“north-seeking pole,” or north pole of a compass, which 
defines the direction of a magnetic field outside the 
earth. Inside the core of the earth we have the return 
circuit from geographic north to south. The velocity vı 
of the floor F traversing the field H, induces the electric 
field £Z,=2;XH, which gives rise to the current J; 
through the complete circuit indicated, across the floor 
F, up the central shaft, across the top 7, and down the 
outer post and through a sliding contact between B 
and F. This«is, of course,-typical of the current at all 


Fic. 8. Dragging of lines of force 
into a*toroidal form, result of the 
first_step. 
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four center posts in the carrousel, or, indeed, at all 
azimuthal angles in the earth, and gives rise to the 
“toroidal” field H which forms closed circles about 
the central axis, eastward in the northern hemisphere 
(Fig. 7) and westward in the southern hemisphere, 
This is the step that was pointed out and emphasized 
by Elsasser (1947), and is fundamental to any process 
of inductive regeneration. The vector sum of M, south- 
ward and Hp» eastward is a field in a general south- 
easterly direction, so the net result of this first step is 
to drag the lines of force around in a spiral toward the 
south east in the northern hemisphere and to let them 
spiral back toward the south west in the southern 
hemisphere, as suggested in Fig. 8, which is taken from 
Fig. 16 of Elsasser’s review (1950a). This represents a 
solid sphere rotating within a conducting spherical 
shell, very similar to its two-dimensional analog 
discussed above in connection with Fig. 5. 

The second step of induction is shown in Fig. 9, The 
toroidal field H» passes through the conducting wheel 
W and in the outer part of this wheel the upward 
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Fic. 9. Second step in the inductive process. 


velocity və traversing Hə induces the inward electric 
field E» (in the coordinate system of the conductor W) 
which drives the current J» in such a way as to produce 
a magnetic field H; downward on the right side of W 
(and also upward on the left side of W, though this is 
not indicated in such detail). The circuit of J2 rides 
around on the carrousel, and H; is a magnetic field in this 
moving coordinate system. It loops up through the top 
T on the left side of W and down through it on the 
right side as indicated. 

Figure 10 shows the third step. The coordinate 
system of H; moves eastward with a velocity repre- 
sented at some radius by 2. An equivalent way O. 
saying this is that the top T moves with the reversed 
velocity similarly represented by —1 relative to the 
coordinate system of H3, and thus traverses H3 toward 
the left in such a way as to induce an outward electric 
field E; at a point in the top T to the right of the wheel 
W, where the vertical component of H is downward 
as indicated. This produces eddy currents J, in loops 
both ways around this region, but the important loop 
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Fic, 10. Third step in the inductive process. 


is the one which includes the inward E; over on the left 
side of W. This loop produces a downward H4 in the 
central plane of the wheel W. We thus see that, from the 
point of view of an attendant riding on F, the upper 
part of the field H; has been dragged backward 
(westward) in this step.« 

The fourth and last step of the regenerative process is 
shown in Fig. 11 and all four steps are summarized in 
Fig. 12. The field H, is traversed by the outer half of 
the top of W with a velocity v inward, inducing a 
westward electric field E4. This (in series with a similar 
field induced in the similar wheel shown mounted 
opposite and possibly other such wheels not shown) 
drives the current J, around a large loop, which produces 
H; downward, in the same sense as the assumed original 
field H», which is thereby regenerated if the magnitude 
is great enough. We thus see, by comparison with our 
previous discussion of a simpler generator, that this 
curious carrousel would work as a dynamo if large 
enough and driven fast enough. To show that the 
earth’s interior would do likewise requires much 
further discussion, a little of which follows. 

We note that here (and also in the process we have 


Fic. 12. The four steps of 
the regenerative process, as- 
sembled. 


Fic. 11. Fourth step in the inductive process, which 
reproduces the original field. 


described in Section 7) two of the four steps involve 
vı and the other two v2, so the regeneration is quadratic 
in v; and in v and independent of the direction of either. 
Thus the wheel W turning in the opposite way would 
produce the same final current Jj. 

The convection with four currents in the equatorial 
plane, two up and two down, sketched in Fig. 2 is 
roughly represented in the northern hemisphere by a 
carrousel with four wheels W, two of them rotating in 
the opposite sense. This motion sketched in Fig. 2 is 
the case considered specifically by Bullard (1949a) in 
his pioneer discussion of the sign of the regeneration. 
Our discussion of the carrousel which follows his of the 
sphere closely for the first two steps in Figs. 12 (and 
14), is motivated mainly by the impression that his 
qualitative discussion, especially of the subsequent 
steps, is too brief for easy comprehension and much 
less specific than befits the inherent interest of the 
dynamo process. 

Our explicit discussion has been confined to the 
northern hemisphere. The patterns we have considered 
are symmetrical relative to reflection in the equatorial 
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Fic. 13, A dynamo model invoking horizontal shear. 


plane. The fact the regeneration is independent of the 
direction of H; is then sufficient to establish the applica- 
bility of the discussion to the southern hemisphere. 
More explicitly, the vertical component of v2 has the 
opposite direction in the southern hemisphere and all 
other motions are the same, while H> has the opposite 
sign (Fig. 8). These each enter the sequence of four 
steps only once, so there are two changes of sign in the 
chain from H, to Hs, which leaves the sign of the regen- 
eration unchanged in going to the southern hemisphere. 


7. The Compound Carrousel—A Model 
Incorporating Internal Rotational Shear} 


Let us now discuss another pedagogical contraption 
which illustrates another type of relative motion. The 
simple carrousel discussed above may be used as an 
analog of a sphere which is solid so far as its rotation is 
concerned, but with convection within it, rotating 
within a conducting shell, with electrical contact be- 
tween them. In turn, this may be used as a rather crude 
model of the interior of the earth. In a rotating fluid 
fluid sphere experiencing some sort of frictional or 
magnetic drag with its surroundings, it would be 
expected that, at a given distance p from the axis of 
rotation, the fluid near the equatorial plane might 
rotate fastest, that is, there would be rotational shear 
across planes parallel to the equator. 

To represent this shearing motion within the core we 
make our carrousel one step more complicated, as shown 
in Fig. 13. In it the top T, the base B, and the floor F 
are much the same as before, but the drive-wheel D is 


Frc. 14. The second, third, and fourth inductive 
steps involving horizontal shear. 


+ The casual reader may omiț Secs. 7 and 8 without serious loss 
of preparation for following’sections. 
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mounted on a horizontal radial axis which is part of an 
intermediate framework which supports the thick wheel 
W. The drive-wheel D rolls on T and on F in sucha way 
that the intermediate framework, which includes the 
axle of d, rotates about the central vertical axis with 
just half the angular velocity with which F rotates, as 
indicated in Fig. 13 by the vector 311. The drive-wheel 
D also rubs against W in such a way as to impart to it 
the same rotation as before, as represented by the 
typical velocity vo. 

Steps 1 and 2 are the same as before, up to the for- 
mation of Jz and Hs. Steps 3 and 4 are indicated on Fig, 
14. In step 3, the vertical component of H; penetrates 
the floor F downward on the right of the wheel W and 
upward on the left. The floor F moves to the right with 
a rotation indicated by vı relative to the loop in which 
J2 flows to make Hs, and this gives rise to the electric 
field E; inward along a radius of F on the right of W 
and outward on the left. The current J; thus circles 
around the outer part of the,bottom of W in such a 
way as to make the magnetic field Hy upward. (Again 
a matter of the motion dragging H;, this time forward, 
to become H4.) In this field Hy, the outward motion vs 
of the bottom of W asa fourth step induces the electric 
field E; to the left, which again makes J; in the large 
core-encircling loop to produce Hs in a sense to re- 
generate Hy. 

Both of the processes of regeneration suggested by 
Figs. 12 and 14 could contribute simultaneously to the 
magnetization of the earth, along with others and in 
competition with several similar processes of degenera- 
tion, all with this one combination of motions. Bullard 
(1949a) pointed out that there are also several possible 
combinations of motions which may regenerate the 
original solenoidal field. w 


8. Some Other Current Loops 


As yet the sides of our carrousel have been left open 
so that we may see in and also for the historical reason 
that the relative motions thus represented correspond 
most closely to Bullard’s original explanation of the 
regenerative process and its probable sign. The core of 
the earth is surrounded by a sphereical shell which 
extends all around it, not just on the top and bottom, 
so we may improve our carrousel by surrounding it by @ 
cylindrical shell to cover the sides, as shown in Fig. 15 
with part of the cylinder cut away. The motions in the 
interior are the simple ones of Fig. 6 [not compound 
as in (Fig. 13) ]. 

A very important current loop is the one shown 
carrying J, approximately in a vertical plane, giving 
rise to H; inward alongthe cylindrical radius. This Hs 
is dragged toward the west (in the rotating coordinate 
system) making H4 as the result of eddy currents in the 
conducting cylindrical shell as indicated, and in Step 
4 this H; is traversed by the wheel W with v2 upward, 
inducing E, toward the west as before and thus Te- 
generating Hı. This loop for J, together with the one 
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Fic. 15. Some current loops"in- 
volving the equatorial regions¥of 
the mantle. 


shown in Fig. 9 may besrepresented by an oblique one 
emphasizing the outward-and-upward motion of the 
convection suggested in Fig. 2, but the outward motion 
is more important than the upward motion because only 
part of the outward-moving material flows upward, 
while part flows horizontally as it approaches the sur- 
face. Thus the J» of Fig. 15 is expected to be more im- 
portant than the J» of Fig. 9 in the regenerative process. 
(Its prototype in the earth is also geometrically more 
favorably situated, as will be clear after discussion of the 
westward drift.) 

In the carrousel of Fig. 15, the current J,’ arising 
from the motion of the upper half of W circulates in the 
opposite sense, and this is the only reversal in the 
associated four steps, so this constitutes a competing 
degenerative process. The same is true of the inner half 
of W in Fig. 9, though the corresponding current loop 
is not there drawn. These loops in the carrousel do not 
cast serious doubt on the adequacy of the regenerative 
process in the earth because the parts of the motion 
involved do not correspond so directly to motions we 
wish to assume in the earth, in a simplified convective 
pattern similar to Fig. 2, rather than Fig. 1. We use 
the carrousel to help visualize motions within the 
earth, but it also contains some motions which we do 
not want to assume even in a simplified pattern in the 
earth bécause they are so strongly suppressed by the 
Coriolis force. 


9. The Inapplicability of Cowling’s Theorem 


One of the serious objections that may be raised to 
the dynamo theory of geomagretism as here presented 
is that it is too complicated to be called elegant. One 
is requested to twist his wrist several ways and almost 
stand on his head in the course of trying to under- 


stand it. An induction mechanism by means of a 


simpler and more elegant configuration of currents or 
lines of force would be preferable if one could find it. 


A sort of lower limit to the complexity is provided by 
Cowling’s theorem (Cowling, 1934), which was de- 
veloped to show that the very simple induction mecha- 
nism originally proposed by Larmor (in connection 
with sun spots) was too simple to provide a self-sus- 
tained field in the earth, though a comparable simple 
rotation may be of interest in concentrating a field in 
sunspots which is maintained by a more complicated 
mechanism elsewhere in the sun. 

In reporting the induction mechanism outlined above 
as a possible source of the earth’s magnetism, it is 
necessary that we show that it is not subject to the 
contradiction of Cowling’s theorem. It is hoped that 
this discussion of the theorem and its inapplicability 
will also serve to bring the reader into a mood of willing- 
ness to consider a fairly complicated mechanism by 
showing that for an induction mechanism to work, 
some such degree of complexity is necessary to evade 
the theorem. 

Cowling’s theorem in its simplest form refers to the 
situation shown in Fig. 16(a), in which attention is 
directed to a closed ring, indicated by a double line, 
about the axis of cylindrical symmetry of the system. 
Through this ring thread magnetic lines of force H, 
surrounding the ring in closed loops. (The ring is then 
at a maximum or minimum of the vector potential.) 
The theorem states that the currents induced around 
the ring by possible motions of the conducting medium 
in the magnetic field canrot maintain-the field, in 
particular in the immediate vicinity of the loop, so the 
inner closed line of force will soon disappear, and then 
the successively larger ones will collapse about it. 

In proving the theorem one appiies to a very small 
loop of H about the ring the familiar line and surface 


integrals 
fros- f H-ds= te (18) 
> 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


} 
\ 
i 
| 
| 
: 


PST SST 


Sore 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


226 De eRs 


cS 


(b) 


A 
(a) 


(c) 


Fic. 16. Twisting of bundles of lines of force is encountered 
only in complex geometrical situations. 


Here s is the distance around the loop (not the ring) and 
Ho is thus an average field around it. If we neglect the 
contribution of potential differences to the current 
(as we may in this case because of the cylindrical 
symmetry assumed also for the flow pattern), we have 


J=ovXH <otmaxH. (19) 


Here dmax is the greatest velocity at any point in the 
loop. The first integral in Eq. (18) is thus less than 
Umax oS, where S is the area of the loop, so we have 
the inequality 


Hos <oVmaxH 0S. (20) 


Since S is quadratic in s, the inequality is not satisfied 
in the limit for a small loop with finite c%max, which 
shows that the current is insufficient to maintain the 
field. It also happens that Ho decreases as the loop over 
which it is an average gets smaller (as follows from 
Eq. (18)), but this cancels out in Eq. (20). 

The more general form of Cowling’s theorem applies 
to the situation of Fig. 16(b). Here again a closed ring 
is discussed so that the integral of grad V around it 
shall vanish, but this time the nearby lines of force are 
twisted about it, always threading through it in the 
same direction. Because of the happenstance just 
mentioned, that the compgnent of H normal to the ring 
and circulating around it vanishes in the limit as one 
approaches the ring, the ring itself is a line of force. 
Even without cylindrical symmetry, Eq. (19) still 
applies to an average around the ring of the component 
of J along the ring, and the theorem applies that the 
field threading through the ring cannot maintain itself 
by induction arising from velocity of the medium. If the 
twisting lines did not always thread the ring in the 
same direction, then /"H-f's in Eq. (18) would be at 
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some places positive, some negative, and (20) could be 
satisfied in the average by cancellation. 

In the terrestrial inductive process as we have been 
describing it, the relevant magnetic loop analogous to 
the ring of Fig. 16(b) is sketched schematically in Fig. 
16(c). It loops around outside the conducting core and 
dips into it in such a way as to bend (at least part way) 
around the axis like a curved hairpin. The bundle of 
adjacent lines of force, one of which is shown in Fig, 
16(c), twists around the two prongs of the curved hair- 
pin. Corresponding to the fact that the final toroidal 
current, Js of Figs. 11, 14, and 15, is toward the west 
in both hemispheres, these lines of force twist around 
in one sense in the northern hemisphere and in the 
opposite sense in the southern hemisphere, from the 
point of view of one going around the loop. Topolog- 
ically, they wind up on one leg of the hairpin and un- 
wind on the other, in a manner in which originally 
separate continuous bands could be intertwined. For 
this reason Cowling’s theorem does not apply. 


10. The Dynamo Theory from the Hydromagnetic 
Point of View 


Hydromagnetics, or magnetohydrodynamics, as it 
has usually been called, is in many cases greatly 
simplified by forgetting about induced currents and 
considering only the tendency of lines of force to be 
dragged along by moving conducting material. Even 
though it may seem plausible that the current loops 
we have portrayed in Figs. 7-14 and particularly Fig. 15 
may be the most important ones, it would make the 
theory seem simpler and additionally convincing to see 
from this hydromagnetic point of view that the lines 
of force are kept from collapsing by being dragged 
along by the rotational and convective streams of 
matter. In the discussion of Fig. 8, we have already 
seen this for Step 1. The next steps are a little more in- 
volved and it is not so easy to make them as graphic, 
but with all their complexity they provide a true 
portrayal of the main regenerative process proposed in 
the dynamo theory. From the discussion of Cowling’s 
theorem, it is clear that we must discuss not just one 
line of force, but a bundle of lines of force, and consider 
how they become twisted about one another. Consider 
the part in the northern hemisphere, the upper leg of 
the “hairpin” in Fig. 8 and Fig. 16(c). The bundle of 
lines of force H in Fig. 17(a) is pushed upward (i.e., 
outward along the radial direction p, toward the right 
in the diagram) and northward by the convective 
current v, which has components v, and 2., as shown in 
Figs. 17(b) and 17(d). By way of a simple model (in 
which a gradual rotational shear is replaced by @ 
sudden shear) we consider the fluid not rotating, but 
contained in a conducting spherical shell which rotates 
so as to have a westward velocity —2 at the surface. 
This drags toward the west the upward (p) component 
of the bulge, as shown in an exaggerated fashion in 
Fig. 17(c). Even the lines that do not bulge out into 
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the surrounding shell are thus affected by the crowding 
together of the lines on the east side of the bulge. For a 
given p a little inside the surface, the component v, has 
a maximum in the equatorial plane, while v, is there zero 
and has a maximum value some distance above this 
plane. In the equatorial plane, v, approaches zero 
near the surface. The effect we are about to describe 
depends on both v, and v., so is most important where y 
is directed near the 45° direction and fairly near the 
surface. Let us consider the effective v in this direction 
having maximum value along some line, and consider a 
bundle of lines of force H which before deflection is 
centered on the line of this maximum y, as suggested 
by Fig. 17(f). It is deflected along p and z, but less along 
p than z in the central azimuthal plane of the convection 
current, because of the geometrical effect shown in Fig. 
17(c), (the side of the bulge being not so far out as the 
extreme of the bulge, while the z deflection is not thus 
modified down in this corner near the equator). The 
maximum v now pushes near the outer edge of this 
bundle of lines of force, in such a way as to give it a net 
counterclockwise twist, as in Fig. 17(g). The corre- 
sponding twisting moment is clockwise in the southern 
hemisphere. (For a downward convection current there 
is an inward bulge and y is reversed; the twist is still 
counterclockwise in the northern hemisphere, clockwise 
in the southern.) If we consider the lines of force to be 
effectively “anchored to the wall” where they penetrate 
through the conducting mantle (by the local eddy 
currents therein), then we see that these twisting 
moments are just what is needed to twist up the bundle 
of lines of force in the manner shown in Fig. 16(c). 

The cumulatively important part of the twisting 
occurs in the long stretches of the bundle, between the 
azimuthal planes of the convection currents where the 
twisting torque is applied, just as in the consideration 
of currents in Fig. 15 (for example) the important re- 
generating current J, flows in a large loop almost 
around the sphere as a result of electric fields confined 
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Fic. 17. Hydromagnetic equivalent of the dynamo theory. 
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to the vicinity of the radial convection streams. Within 
the bulge shown in Figs. 17(b) and 17(c) there exist 
local twistings and turnings in both senses which appear 
to be unimportant because they affect so short a 
length of the bundle, corresponding to the local currents 
set up by Ey in Fig. 14, for example, in the small space 
between E; and the outside of the conducting region 
in a return loop not shown in the figure. In addition to 
the central bundle selected in Fig. 17(f), there are other 
bundles on either side of the maximum y on which 
twisting torques act even before the displacement, but 
these tend to cancel out between the two sides and the 
change in torque induced by the bulging displacement 
is in the same direction as in the representative central 
bundle. 

To complete the schematic hydromagnetic picture 
we must now imagine the irregularly-shaped loop in Fig. 
16(c) repeated several times around the sphere in such 
a way that the “hairpin” part of each penetrates into 
the region between the two branches of the next “‘hair- 
pin.” Each of these twisted bundles making up a 
“hairpin” is defined by the way it passes through a 
particular upward or downward convective stream, so 
there is not a continuum but a discrete set of them that 
need not intertwine where they are locally adjacent. 
The reason that the external field represented by the 
outside part of the loop cannot collapse to nothing is 
then that it surrounds the “hairpin” of the next loop, 
in which the twisting mechanism explained above is 
operating to maintain a curl M. Thus as a result of 
motions not possessing cylindrical symmetry, we have 
an over-all situation in which a circulating magnetic 
field is prevented from collapsing in the equatorial 
zone, and this part of the field can then support the 
outer part of the field which threads through this ring- 
shaped region somewhat in the manner of Fig. 16(a). 


11. The “Twisted Kink” Theory 


When a rather stiff cord or a bundle of rubber 
strands is twisted it relieves the local twisting strain 
by looping itself into a kink. Alvén (1950) has Suggested 
that the tendency of twisted bundles of magnetic lines 
of force to do likewise may form a basis for inductive 
regeneration of the earth’s magnetic field. In particular, 
he points out the possibility that the adjacent lines of 
force can join together as the kink closes, allowing the 
column and loop each to close up and separate from 
one another. The mechanism of the twisted kink in a 
magnetic field has been treated analytically by Lund- 
quist (1951) and more recently by Dungey and Lough- 
head (1953), who show that there is another type of 
instability that may set in besides kinking, but do not 
show that it would preclude kinking, since they in- 
vestigate only the initial small deviation from cylindrical 
symmetry. ` s 

A possible way in which the kinking may contribute 
to the regeneration of the earth’s field is suggested in 
Fig. 18. We assume that the jet stream or other con- 
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Fic. 18. Formation of magnetic kinks. 


vective pattern will, through frictional drag provided 
mainly by local magnetic fields, cause the inner core to 
rotate faster than the mantle. As in any inductive 
theory, we start with (the z-component of) an initial 
stray field, however small. The bundle passing through 
the core tends to concentrate as a thick twisted bundle 
about the axis. Here we may apply Alvén’s process of 
kink formation and subsequent breaking away of a 
loop, as in Fig. 18. Once out on its own in the fluid core, 
the magnetic loop begins to decay with a characteristic 
relaxation time. From Maxwell’s equations (10) and 
(11) we have the wave equation VX_VXH |= —4rcH 
which order-of-magnitude-wise is simply 


H/H=—1/4rol? 


and defines the relaxation time, the time characteristic 
of the exponential decay of a field in a conducting block 
of linear dimensions /, as 


T =4rol? = 10-3(I/cm)? sec (21) 


if we use the conductivity (7). For a body the size of the 
core, this gives about 10" sec= 3- 10° years, or for a body 
ten times smaller, as one of the loops arising from 
kinking might be, about 10” sec. For a tightly twisted 
loop, it appears that this would not be very much less 
in a fluid than in a solid, though we do not have an 
adequate treatment of this question and the answer may 
depend on the magnetic environment. The conductivity 
might be less than (7) and the time nearer 10" sec. 
This is still much greater than the time of rotation of the 
inner core at a peripheral speed of 1 cm/sec, comparable 
to that of a jét stream, forexample. 

We may thus assume that a separated loop as in 
Fig. 18 may last during the subsequent generation of 
several more such loops by further twisting and 
kinking. Once ten or more of them have been generated 
in this fashion, they might be expected by squeezing 
each other from the side to cause an expansion of the 
size of the magnetic loops, until eventually the loops 
from the northern and; southern hemispheres might 
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join, making more lines of force thread through the 
inner core to be further built up by the same process. 
This forms what appears to be a fairly promising basis 
for the regeneration of the earth’s field, though much 
more careful consideration is required to validate the 
picture. It is not at all clear what effect the convective 
streams in the equatorial region would have on the 
expanding loops. Alvén in his discussion considered only 
the events in this region and suggested that the twisting 
exerted here by the convective streams would form 
kinks in this region which might in turn be stretched 
by the convective streams in such a way as to regenerate 
the original field. 


Il. THE WESTWARD DRIFT AND OTHER 
DETAILED CONSIDERATIONS 


12. Convection and Magnetic Drag Driving 
the Westward Drift 


We turn now to the problem of the westward drift 
of the geomagnetic patterns on the earth’s surface 
(Vestine ef al. (1947); Bullard (1949a); Elsasser 
(1950a)). Discussion of further details of the regenera- 
tive process of the dynamo theory will depend on details 
of the mechanism of westward drift. On the other hand, 
the mechanism of westward drift is largely independent 
of the details of the regenerative process; it depends 
merely on the same coexistence of rotation and con- 
vection within the fluid core, and on the observed end 
result of the regenerative process, the main (dipole) 
component of the earth’s field threading through the 
fluid core. It is even independent of the nature of the 
origin of the field, whether it is primarily inductive or 
thermoelectric, for example, and thus, unfortunately, 
does not help us to choose between these theories. 

Physical theories are considered well founded when 
many observed facts are deduced from few hypotheses. 
The geophysical phenomena of the deep interior are 
remarkable for their scarcity and remoteness, which 
makes it difficult to come to definite conclusions con- 
cerning the behavior of the core of the earth. Seis- 
mology has made important strides to delineate the 
boundaries of spherical shells of various density and 
rigidity characteristics, and astronomical data make 
suggestions concerning their contribution to the total 
moment of inertia. The magnetic phenomena consist 
not only of the primary effect, the existence and‘declina- 
tion of the main dipole, but also of a bewildering wealth 
of detail concerning deviations from a simple dipole 
field and their secular variation. It is perhaps too much 
to hope, at least of the simple outlines of a theory, that 
it account for much of the smaller detail. It is gratifying 
that the mere existence of radial convection currents, 
which provides the basis of the induction theories here 
outlined but might also coexist with some other origin 


of the main field, provides an entirely natural and very | 


simple explanation of the most striking feature of the 
secular variation, the westward drift of the general 
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pattern of the detailed nondipole field. This drift takes 
place at the rate of about 0.18° per year. 

f The westward direction of the drift may seem at first 
sight enigmatic because the earth rotates toward the 
east, and the Coriolis forces in a rotating convective 
fluid are in such a direction as to make the inner part 
rotate faster than the outside, that is, eastward relative 
to the outside in the case of the carth. However, the 
detailed structure of the field observed at the surface 
may be presumed to arise from motions in the outer 
part of the core, and this is expected not only to move 
westward relative to the inner part of the core, but 
also to move westward relative to the mantle, as we 
shall see immediately. 

We have seen that the rotational motion within the 
core is expected to be determined largely by a balance 
between Coriolis and magnetic forces. The Coriolis 
forces are virtual forces in a rotating (terrestrial) 
coordinate system which serve to conserve angular 
momentum in a stationary (astronomical) coordinate 
system. If there were no magnetic drag or other forces 
acting within the fluid, the radial convection would 
bring about a state of motion wherein the angular 
momentum (about the axis in the stationary system) 
per unit mass would be the same at all distances from 
the axis of rotation of the core. (In this nonrigid rota- 
tion, the axis is defined as a line of particles which are 
stationary in space.) Constant angular momentum 
per unit mass at various radii would imply tangential 
velocity inversely proportional to the distance from 
the axis, or angular velocity inversely proportional to 
the square of this distance, much greater near the axis 
than further out. With magnetic or viscous drag or 
both, this tendency for the inner part (i.e., the part 
nearer the axis) to rotate faster than the outer part 
persists to a lesser extent, the extent depending upon 
a balance between Coriolis forces and the drag forces. 

With magnetic drag forces, the axis of rotation of the 
core would be expected (Inglis, 1941) to coincide rather 
closely (within a few degrees) with the axis of rotation 
of the mantle, even if the boundary of the core were 
spherical, in spite of the gradual change in direction of 
the axis of rotation of the mantle described as the 
27 000 year precession arising from quadrupole-moment 
torques exerted by the sun and moon on the oblate 
mantle. For an oblate boundary between the fluid and 
solid, there is a theorem by Poincaré which, although 
proved (Lamb, 1919) only for an infinitesimal angle 
between the axes, appears to show that the two axes 
would coincide in spite of such a precession caused only 
by torques on the mantle. Furthermore, quadrupole 
torques acting on the core, with an ellipticity similar 
to that of the mantle, tend to give it independently 
approximately the same rate of precession as the mantle. 
We may thus take the axes of rotation of the core and 


_Iantle to be the same. 


In an earth rotating with constant angular velocity 
Q, and in the limit for small magnetic and other drag, 
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one would then expect the angular velocity of the core 
to be a function of the distance p from the axis of 
rotation, 


olp) =B p. (22) 


If the length B in the constant of proportionality 
QB? is less than the radius A of the core, then there is a 
cylinder of radius B which is the locus of points at rest 
relative to the mantle as shown in Fig. 19(b). With 
small drag, the only possible steady state is one in 
which B has the value, B <A, for it is only thus that the 
total torque exerted by the mantle on the core, or by 
the core on the mantle, can vanish. The inner part of 
the core, with p< B, rotates faster than the mantle, or 
eastward relative to the mantle, and the outer part, 
with p>B, rotates more slowly than the mantle, or 
westward relative to the mantle, as suggested in Fig. 
19(a). Thus the inner part exerts a torque of one sign 
(north) and the outer part a torque of the opposite sign 
(south) on the mantle, which can make a total of no 
angular acceleration in the steady state. Expressed 
otherwise, the inner part of the core experiences a south- 
ward torque from the mantle and a northward torque 
from the outer part of the core which must cancel one 
another, while the outer part of the core is subject to 
equal and opposite torques from the inner part and 
from the mantle. If the steady state has not been 
reached, on sees easily that the angular acceleration is 
in the direction to approach the steady state. With 
strong drag forces, w(p) will vary with p less rapidly 
than in Eq. (22), but the same sort of division of the 
core into two oppositely rotating parts must take place 
in the steady state. 

An attempt to picture the mechanism of the magnetic 
drag responsible for the westward drift of the outer 
part of the core is made in Figs. 19(b) and (c), where 
the deflection of a typical line of force in the inner part 
of the core is shown on the left and in the outer part 
of the core on the right. The torques exerted by the 
drags in opposite directions on the core as a whole 
balance one another. In order to appreciate the balance 
of torques on the two parts of the core separately, one 
must imagine a torque acting between the two parts 
of the core across the cylindrical boundary. This 
torque is associated with a net inward flow of angular 
momentum with the convection, angular momentum 
having been imparted to the inward flowing material 
by the drag in the outer part, and vice versa. 

For the sake of an estimate, let us assume that the 
field in the core is homogeneous and parallel to the axis, 
which makes it the field of a dipole in the mantle. In 
analogy with the discussion of Fig: 5, we may take the 
magnetic drag force per unit area of projection on the 
equatorial plane, Fig. 19(c), to be roughly proportional 
to the relative velocity of the core and mantle at a given 
p. The vanishing of the total torque exerted by the 
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mantle on the core may then be written" 
A A 
f (Q—o())e'dp=20 | (1—B*/p*)p*dp=0 (23) 
0 “o 


by use of the approximation in Eq. (22), which then 
gives 


B=374. (23.1) 


The fraction of the area of the spherical boundary be- 
tween the core and mantle which bounds the outer 
part of the core is thus (2/3)!=0.82, and only the 
remaining 18 percent of the outer surface of the core is 
found in the “polar caps” of Fig. 19(b) at the ends 
of the approximately cylindrical inner part of the core. 
Thus a very broad equatorial band of the outer surface 
of the core drifts westward in this approximation. 

If we assume that the observed drifting pattern of the 

earth’s field is caused by local irregularities such as 
vortices moving with the material just beneath the 
surface of the core, the mantle is thick enough to 
obscure to some extent the external observation of these 
irregularities in the small “polar cap” so the observed 
secular variation is dominated by a westward drift 
characteristic of the broad equatorial band. This does 
not correspond in detail with the conclusion of Vestine 
that the motion of the nondipole field is consistent with 
an angular velocity independent of latitude. It must also 
be remembered that the Taylor-effect tendency for 
convective flow parallel to the equatorial plane tends 
to bring the convective streams, and thus the turbulent 
irregularities, to the surface in the equatorial belt. 
The polar caps might thus be quite without distin- 
guishable character and all contributions to the irregu- 
larity of the pattern might come from the westward- 
drifting equatorial belt. However, the independence of 
latitude cannot be claimed with great accuracy, the 
patterns studied being sufficiently vague and ephemeral 
as to leave some freedom of interpretation, and it is not 
yet clear that the observations contradict the expecta- 
tions from this simplified model. On the other hand, the 
slight refinements of the model discussed below, in Sec. 
16 reduce the theoretically expected dependence on 
latitude. 
} A still more crude model for the discussion of the 
westward drift could consist of the two parts of the 
core pictured in Fig. 19(b) as solids sliding on one 
another and within the solid mantle, one rotating for- 
ward and the other backward relative to the mantle. 
Then one needs to think of The torques mentioned above 
acting on these rigid rotating bodies. 

Bullard, Freedman, Gellman, and Nixon (1950) have 
discussed the westward drift in terms of a somewhat 
similar, though in reality quite different, model. They 
likewise consider rigid inner and outer part of the core 
sliding on one another, but separated by a sphere 

rather than a cylinder of radius B. They point out the 
tendency of the inner part to rotate more rapidly than 
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the outer part, and can conveniently consider the in- 
duction mechanism to proceed within these two parts 
since the outer part is analogous to the top T of the 
carrousel in Figs. 6-12. They point out that current 
J, induced in the first step (Fig. 7) would also leak out 
into the more weakly conducting mantle and would 
couple with the field H, produced there by the currents 
in the inner part of the core in such a way as to make 
a westward drag on the core resulting in a westward 
drift. This model may also be discussed in close analogy 
to the linear problem of Fig. 5: v2 downward corresponds 
to the eastward motion of the inner sphere of the core, 
vı upward to the westward drift of the surrounding 
spherical shell of the core, and the magnetic drag across 
the boundary between conductors compensates the 
inward flow of angular momentum. The horizontal 
emergence of the line of force in Fig. 5 corresponds to 
the lack of torque between the core as a whole and the 
mantle, when averaged over all radii p. In the spherical 
problem, the core moves too fast to permit normal 
emergence at small p, and at large p the lack of penetra- 
tion through the core makes a deviation from normal 
emergence in the opposite direction out into the con- 
ducting mantle. The weakness of this spherical model is 
that one can see no mechanism in nature to provide 
the analog of the rigidity of the spherical shell B<r<A 
while it rotates westward relative to the mantle, and 
without this rigidity the model does not even approxi- 
mate the motion it is intended to represent. 


13. Speed of the Westward Drift 


The observed speed of the westward drift is very 
much less than the maximum that would be possible 
according to the mechanism proposed. If the Coriolis 
term (3) of Eq. (1) were very much lager than the 
others, we would have Eq. (22) asa good approximation, 
whence, with Eq. (23.1), we find for the angular speed 
of the almost-free relative motion between mantle and 
adjacent fluid of the core at the equator, 


|(A)—O| = (2/3)= (2/3) 2m day} 
=5X105 sec™!. (24) 


The angular speed of the observed westward drift is, as 
already mentioned, about 0.18°/yr=10— sec™. If the 
westward drift is to be attributed to the motion of the 
fluid core at all (and there appears to be no other 
possibility), the rapid relative rotation in Eq. (24) 
must be very effectively suppressed (to the extent of a 
factor f~2X10-5) by strong coupling forces between 
the mantle and the various parts of the core (or directly 
between the parts of the fluid core). This still leaves 
valid the explanation of the westward direction of the 
drift. 


Tf the magnetic assumption applying to Eq. (4) were 


strictly valid, that is, if the magnetic lines of force were | 


frozen in the conducting matter of the core and mantle, 
then as long as any relative rotation continued, the 
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Fic. 19. Differential rota- 
tion and magnetic drag in 
the core as a source of the 
“westward drift.” 


lines of force would continue to be wrapped further 
around the axis in the manner suggested by Fig. 8 or by 
Eq. (15) with c=, making the restraining force 
approach infinity. Thus with infinite conductivity the 
relative rotation would eventually stop and there would 
be no westward drift. 

The westward drift is analogous to vz in Eq. (17) in 
the linear slab problem, and its existence is allowed 
by the finite product of conductivity and dimensions, 
analogous to ob of Eq. (17). The rotating sphere 
problem differs from the linear slab problem in having 
electric charges built up on the surface, and in the fact 
that the plane of the paper containing a loop of the line 
of force shown in Fig. 5 is wrapped about an axis 
(Fig. 8). In Fig. 5 the lines of force slip through the 
matter only as a result of the curvature shown in that 
plane, but the third dimension of the spherical problem 
makes it possible for them to slip toward the axis of 
rotation and thus oppose the rotation of the inner part 
more than that of the outer part of the core. This intro- 
duces another tendency for a westward drift, and in- 
creases the fraction of the core that experiences a west- 
ward motion relative to the mantle. We neglect this 
effect in pushing the analogy so far as to use Eq. (17) 
for an order-of-magnitude estimate of the forces in- 
volved in the spherical problem (Eq. (28), below). 

Bullard, Freeman, Gellman, and Nixon (1950) [see 
also Bullard (1949a), Sec. 6] have made an estimate of 
the rate of change of angular momentum and of the 
forces required in their spherical-shell model of the west- 
ward drift, and it seems appropriate to make a similar 
estimate, which need be no more than an order-of- 


_ magnitude estimate, for the flow across the cylindrical 


boundary shown in Fig. 19. With B=3"1A=0.584 
from Eq. (23.1), the inner cylindrical part contains 


“4g X WESTWARD 
A DRIFT 


(c) 


46 percent of the volume. Thus the volume of the 
sphere is divided approximately into two halves by the 
cylindrical boundary. If the matter in the two halves 
is interchanged while rotating at approximately the 
angular velocity of the earth 9, the gain of angular 
momentum by the material put into the outer half is 


AL=9AI=0.5612=0.56(8x/15)SA5O, (25) 


about half the angular momentum of the core.{ The rate 
of change of angular momentum of this material is then 


L=AL/T, (26) 


where T is the mean time of interchange of the material 
in the two halves. 

We presume that this angular momentum is im- 
parted by the torque of magnetic forces, the magnitude 
of which depends on the relative velocity of the core 
and the mantle. It also depends on how much the 
magnetic field can “slip through” the material of the 
mantle (or of the core if it should have the lower con- 
ductivity). This relative tangential velocity v:(p) we 
may for simplicity assume to be less than that of the 
free convective rotation (24) by a constant factor f: 


v= f2((B*/p*)— 1)p, (27) 


(though in reality the strongly retarded distribution 
may be more nearly linear than quadratic in 1/p.) At a 
given p<B at one end of the inner cylindrical part we 
may think of the line of force represented by the left- 
hand loop in Fig. 19(b) as curving.and slipping through 
the core as through the inner slab of Fig. 5, through the 
conducting layer of the mantle just as in the outer 
slab of Fig. 5, and passing normally out into outer space 


t Here we use S for density, keeping p for cylindrical radius. 
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as a part of the dipole field we observe. We approximate 
the force per unit area, F(p), from the linear problem, 
Eq. (17), 


F(p) =4roH n(A? — p) = hro nA 
=4rolH?A (Co./(Co-t-Ac))1r~4r0CH*%,. (28) 


Here C is the thickness (corresponding to a of Fig. 5) 
and ge the conductivity of the conducting layer moving 
with the mantle; 4 and o refer to the core; v; corre- 
sponds to vı+%: in Fig. 5, and from Eq. (14) we have 


v= (Co./(Co-+ Ac) (Co./Ac)%, (29) 


for the case wherein the Co,/Ac<1. We have used this 
approximation in Eq. (28). 

In the coordinate system of Fig. 5 the field in the 
outside space is purely magnetic and perpendicular 
to the surface. In the terrestrial analog this system is 
moving at —2 relative to the observer sitting on the 
slab (mantle), in whose system there is according to the 
Lorentz transformation also an electric field E~ (w/c) H, 
but 2 is so small compared to the velocity of light c 
that this is presumably unobservable (of the order of 
10-8 volt/cm). 

The moment of force on the two rounded ends of the 
cylindrical region is 


B 
L= ar f F (p)p*dp~ 41° B'Co HH’ fQ, (30) 
0 


and from Eqs. (25), (26), (30) and B=3-?A, that is, 
equating the magnetic torque to the rate of change of 
angular momentum in the interchange, we have 


1/T~1f(Co./Ac)oH?/S =} fgo.H?/S. (31) 


Here we have introduced a “geoconductive ratio” g 
defined as the effective conductivity of the mantle 
relative to that of a normal metal as thick as the core, 
g= (Co./Ac)(c/o0)=(Co-/Aoo), where oo is the con- 
ductivity of a typical normal metal, 10-4 sec cm~. Since 
there are so many uncertainties that we cannot hope 
to derive the rate of the westerly drift with significant 
accuracy, we have used it to determine f~2-10-° as 
discussed in connection with Eq. (24). With o.=10- 
sec cm~*, H?=10 gauss?= 10 g cm™ sec”, and a density 


S=10¢g cm, we have as a unit of reciprocal time in 
Eq. (31) 


ooH?/S=10~ sec, (32) 
with which, and with f=2-10-*, 
T= 710" sec= g 300 years. (33) 


The time required fer the interchange, that is for half 
the volume of the core to flow across half the cylindrical 
boundary, 2rB(A?—B?)! with B’=0.37A?, at the mean 
radial speed 3, is 


T=0.1A/tr (34) 
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This with E 


0,=7 


q. (33) gives us 
10-''gA sec™!= 0.024 g cm/sec. (35) 


This is the mean radial speed necessary to make the 
interchange of matter between the two parts require 
the magnetic torque on the inner cylinder estimated 
by use of the observed westerly drift, not just the radial 
speed obtained simply by equating the speed of con- 
vection to the speed of the westerly drift. With g~1, 
this mean radial velocity is, however, of the same order 
of magnitude as the tangential speed v,= 10-4 sect 
~0.03 cm/sec corresponding to the observed westward 
drift (cf. remarks below Eq. 24): 


0,=0.7g0,~0.7%. (36) 


Thus a conductivity comparable to ordinary metals 
in a thick layer of the mantle, would suffice, with this 
simple model, to let the mean flow velocities within the 
flow pattern in the core be about as great as the drift 
velocity of the pattern relative to the mantle. Such a 
high conductivity makes the ‘relaxation time of the 
earth’s field rather long, a million years or more as 
we have seen in Sec. 11 [Eq. (21) ], too long to be com- 
patible with some of the local variations of the earth’s 
field which seem to take place in a time scale of more 
nearly a thousand years. If the conductivity of the 
mantle (or of the core) is an order of magnitude smaller, 
so as to act as a less effective brake on the relative 
rotation of the inner and outer parts of the core, the 
velocities within the flow pattern must be correspond- 
ingly slower in order not to drive the relative rotation 
of the inner and outer parts of the mantle much faster 
than the observed drift. More rapid flow is permitted 
if instead of averaging over the material of the core we 
consider the flow concentrated in a jet stream, as dis- 
cussed in Sec. 18. 

The rate of flow of matter across the interface im- 
plied by Eqs. (34) and (35) is of the order of 101g g/sec, 
which even with a small value of g is still so large com- 
pared to the flow in Urey’s settling process, 10" g/sec 
mentioned at the end of Sec. II, that the transport of 
angular momentum by the settling matter may be 
neglected. That is, the convective matter is much more 
abundant than the settling matter which may be 
driving the convection. 

The extent of the spiralling of the lines of force in the 
manner of Fig. 8 may be estimated by use of Eq. (16), 
which we express as follows [using Eqs (27) and (29) ] 
in adapting it to the notation of the spherical problem: 


Z= 2mo mwb? —>2ro(Coo/ (Coot A0)) 
XSJ%(1— B/A (4— P). (37 


For a typical radius p= B/2, and still with g= 1, this is 
evaluated approximately 


Z/A=10'. (37a) 


The lateral excursion Z of the lines is thus determined 
by the hold the conductivity of the mantle has on them. 
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With g=107 the lines of force are wrapped around the 
axis something like 100 times, though the number 
would be considerably reduced by the radial migration 
of the lines toward the axis which is neglected in this 
approximation. 


14. The Dynamo Theory as Modified by the 
; Westward Drift 


An unsavory aspect of the dynamo theory of terres- 
trial magnetism is that it involves nonanalytic relative 
motions in a continuum, and it is not quite easy to 
assure oneself that one has considered all of them and 
properly assessed their relative contributions. With the 
conventional positive rotation our exploratory discus- 
sions of the regeneration have treated explicitly a core 
rotating eastward within the mantle. It has appeared 
plausible that the relative motions most important to 
the dynamo action are those near but not in the 
equatorial plane, including the rotational shear relative 
to the equatorial belt surrounding the core represented 
by the cylindrical shel of Fig. 15. This primary con- 
tribution to the regeneration was discussed in connec- 
tion with Figs. 15 and 17. Other significant contribu- 
tions appear to be those involving shear between stacked 
disks, as discussed in connection with Figs. 12 and 13. 
Only the parts of these models far from the axis were 
considered to correspond roughly to convection in the 
earth consisting mainly of rising and falling currents 
near the equatorial plane (Fig. 2). Thus in either case 
it is mainly the outer part of the motion that matters, 
so the fact that the inner and outer parts rotate in oppo- 
site senses in keeping with our discussion of the west- 
ward drift does not greatly alter the regenerative 
process. That the important outer part rotates toward 
the west rather than the east within the mantle is also 
unimportant, as we have seen from the remark that 
the process is quadratic in vı, or in the hydromagnetic 
considerations of Fig. 17, from the fact that the twist 
is the same for either sense of relative rotation. 

No matter whether the rotational velocity varies as 
in Eq. (27) or less drastically with distance from the 
axis, there must be some rotational shear across 
cylindrical elements throughout the volume, of sign 
opposite to that encountered at the outer boundary. 
While this is deserving of more complete treatment, it 
is not expected that this should contribute over- 
whelmingly to the regeneration because we have found 
reason to emphasize the upward and northward part of 
the convective flow (outward and upward in the 
diagram), and this presumably exists only near the 
outer boundary where the upward convection is dis- 
persed horizontally. (If there is stratification of the 
convection, a sort of large-scale turbulence in layers 
divided by cylindrical surfaces, the horizontal spreading 
of upward and downward streams at such a surface 


_ would tend to cancel.) 


The problem of making a quantitative estimate of 
the field produced by a given.driving force of the con- 
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vection is considerably modified by this conception of 
the westward drift. Even the choice of a model to 
simplify the calculations may depend on one’s judge- 
ment of the conductivity of the mantle, on the parameter 
g. While it would be most interesting to have it shown 
analytically that velocity distribution of a simple con- 
vective pattern does act as a dynamo, it must be 
remembered that the patterns that have yet been 
discussed are too simple to be compatible with the 
westward drift. 


15. Estimate of the Fields and Energies Involved 


A basic question about the general problem of ac- 
counting for the earth’s field is the adequacy of the 
available energy source to maintain the field. A rough 
order-of-magnitude estimate indicates that it is possible 
to postulate a heat source within the earth supplying 
plenty of power, or alternatively that adequate power is 
involved in Urey’s hypothesis of the settling of denser 
matter. As is discussed further in Sec. 17, the thermal 
source at the center of the earth may well be of the 
order of magnitude S= 10” ergs/sec. The energy resident 
in the earth’s field is at least of the order of magnitude 
(H?/8r) (4 A?/3) = 10% ergs, calculated here simply as 
the part inside of the core of radius A if the surface 
field H~4 oersteds persists within. Without an energy 
source the field would, with a relaxation time T=10" 
sec given by (21), lose energy at a rate of about 10” 
ergs/sec and a source at least this great is required 
to maintain it. We thus see roughly that we have a 
generous factor of about 103 to spare to take account of 
the various inefficiency factors, thermodynamic 
(Carnot), electromagnetic or thermoelectric, and me- 
chanical, and of the excess of the internal toroidal field 
H, over the surface field H. 

Let us now look in more detail at the energy dissipa- 


‘tion specifically in the induction theory. One indication 


of the magnitude of the “toroidal” field Hz is given by 
an analysis of typical details of the secular variation. 
Bullard (1948) has shown that the secular, variation 
during the past century in the region of rapid change 
near South Africa may be accounted for by the devel- 
opment during this time of a dipole moment M~2 10 
gauss cm? just beneath the surface of the core. One 
mechanism for the development of such a dipole moment 
is the turning over of a mass of conducting fluid which 
was originally threaded horizontally by H+. That is, a 
turbulent surface eddy twists the lines of force. As- 
suming 400 km as a reasonable magnitude for the diam- 
eter of such a surface eddy in the earth, and that the 
velocities in it are comparable to those suggested by 
the westward drift, Bullard has concluded that Hz is of 
the order of magnitude of 10H;. Since this involves 
somewhat arbitrary choice of eddy size and indicates 
only conditions near the surface, and since another 
mechanism such as the transport of a magnetic mono- 
pole, which is equivalent to the development of a 
magnetic dipole, might be involved, we prefer to leave 
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H, as an undetermined parameter in discussing the 
magnitudes which might be encountered in the re- 
generative process. 

As we have seen, the proeess is sufficiently compli- 
cated to involve several unknown factors, and we here 
attempt a discussion which is intended to be merely 
illustrative of the sorts of considerations that must be 
involved, and cannot be considered in any way quanti- 
tative. In cases wherein the magnetic lines are essen- 
tially frozen to the matter the methods of hydro- 
magnetics which ignore the currents are particularly 
appropriate for quantitative estimates, but here where 
the slipping of the lines through the matter is essential 
to the process, we find it preferable to include the 
currents in the discussion. 

We consider the process typified by Figs. 7 and 13, 
with the crucial part of the regenerative process taking 
place near the equator. The process consists of four 
steps, i=1.. . 4. Ina typical step, H; and v; induce an 
electric field E;~2;H;. This field acts throughout a 
roughly cylindrical region of length a; and area A,, for 
example, and drives a current which outside the cylinder 
may, in a typical step, loop back from one end to the 
other of the cylinder in all azimuths so that only a 
fraction, say h, of the current through the cylinder is 
effective as J; in the current loop, giving rise to H;1 in 
which we are interested. Since the current returns 
through an area much greater than A;, most of the 
resistance of the circuit is in the cylinder and the 
current through it is 


Ji/h= Eis;A i= vHis;A i. (38) 


The magnetic field produced by a current J; in a loop of 
area a; at a distance along its axis a; is about 


H= 2irJ;/ai™= 2hvH;0;A i/ai =k;H;. (39) 


There are so many quantities of which the estimates are 
uncertain that we introduce an ignorance parameter h 
whenever it is needed to represent a number which we 
expect should probably lie between 3 and 3. In Eq. 
(39) we put a;=hA, A;=/2A2, A being still the radius 
of the core. If further we put 2;~22.~7 in keeping with an 
assumption of a simple pattern of convection as dis- 
cussed in connection with Eq. (36) and take o every- 
where the same, we have 


Hij41= 2h’ AH. (40) 


As the result of the four-step regenerative process we 
have : F 

Hs=H= (2ks A)*H3. (41) 
Thus v and o are related to each other and through 4 
to the geometry of the convection by the necessary 
condition that the dynamo shall function in a re- 
generative manner: 


WocA =1. (42) 
If we consider » to be of the, order of magnitude of the 
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velocity of westward drift, v=0.03 cm/sec, and express 
conductivity in terms of the typical value for metals 
oo= 10-4 sec cm~, we find from Eq. (42), c= 1000/2, 
a not unreasonable figure. 
The power dissipated electrically as heat in each 
step is 
W~ Eiai (J;/h) = korA’H?, (43) 


and in the sum over the four steps it is probable that 
one step may predominate because of the variation of 
H; from step to step, which is expected as a consequence 
of having different effective values of 4? in each step. 
From Eq. (43) with Eq. (42) we thus estimate the 
total power 


W=4A?H 2=$h(A m/H) Wo, (44) 


where Hm is the largest of the four values of H;. H, is 
about 4 gauss, and v~v,= 10~A sec, and we introduce 
as a convenient unit of power 


(45) 


The power available is probably somewhat greater than 
this unit. Urey gives as the power available from his 
setting process 10 10°!(Ap/p) ergs sec—!, where Ap is the 
effective difference between the densities of the settling 
material and its environment as it settles and com- 
presses across a large pressure difference. Since densi- 
ties may differ and change considerably, it would not 
be surprising to find (Ap/p) as large as 10-* even at 
these high pressures, which would then lead us to expect 
Hm~ 10H; or somewhat greater. The power that might 
be available in thermal convection has been discussed by 
Frankel (1945) and by Bullard (1949a) and in Elsasser’s 
review article (1950a). The figure given in the latter, 
1.1X 10 cal/sec per cm? of the surface of the central 
body (radius 1250 km), corresponds to 10! ergs/sec, 
the same order of magnitude as is reasonable from the 
settling process, and again Eq. (44) suggests Hn~10Hi. 
If Hm were He, this would be in keeping with Elsasser’s 
original emphasis on the first as an amplifying step and 
with Bullard’s conclusion from the secular variation, 
and in no way contradicts the explanation of the re- 
generative process. 

A determination of which H; should be the largest, 
Hm, or, just beyond this, an estimate of the earth’s 
external field from the power available, would involve 
more careful estimates of 4%. It-should be labeled /;; to 
distinguish the various ways in which it enters: in 
Eq. (39) we call it hi, representing the inefficiency of 
E; in producing J;, and put a;=hpA, A;=h;7A. Then 
Eq. (41) would be, more generally, Hs=Hyik; with 
TEE and instead of Eq. (42) we would 

ave 


Wo=vA°H?=5 10" ergs sec. 
sn 


Ie (46) 


i=1 


Thus instead of having the product, Eq. (42), equal to 
unity for all steps, we will have k; larger than unity for 
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at least one (amplifying) step and smaller for others 
The preliminary discussion of Fig. 7 would make it 
seem that the first step should be the most strongly 
amplifying one because the magnetic lines thread 
through the whole moving sphere in such a way that 
hyo and hiz might be expected to be about unity, and fy, 
in the preliminary discussion appears to be less than 
unity only because E, exists only part way around the 
circuit, whereas in the other steps the motion of only a 
small part of the sphere may be involved and there 
exist ineffective circuits competing for J». Our further 
discussion has shown that this simple conclusion is 
dependent on the presence of fairly good conductivity 
in the mantle but not more than is reasonable to assume. 
The ratio H/H, is roughly equal to Z/A of Eq. (37a), 
which, by neglecting unknown pressure effects on con- 
ductivity, we estimated to be about 10° or 104. This 
leaves room for quite a lot of deamplification by some of 
the other steps as seems consistent with the smaller 
amounts of moving matter and the competitive geome- 
tries involved. 


16. Inclination of the Magnetic Axis 


Each of the induction theories, the dynamo theory 
and the twisted-kink theory provides a natural connec- 
tion between the mechanical and magnetic axes of the 
earth; in their simplest form they seem to make the 
two axes parallel because of the manner in which Hs is 
formed from H, by the dragging around of the lines of 
force by the differential rotation associated with the 
mechanical axis. The 19° inclination of the magnetic 
axis is then to be explained as some sort of a perturba- 
tion. We shall see that in the dynamo theory it is almost 
too easy to obtain a sufficiently large perturbation. 

So long as the convection currents have a symmetry 
that leaves them unchanged by reflection in the equa- 
torial plane, no inclination is to be expected, and the 
models we have as yet discussed, with the convection 
currents centered in the equatiorial plane (and possibly 
the axis itself as in Fig. 1), have had this symmetry. 
It seems likely that the threading of the lines of force 
through the convection loops will tend to bring about 
a symmetrical configuration, unless the conductivity 
of the mantle (through which the lines of force must 
slip, as mentioned in Sec. 12) is appreciably non- 
uniform. We shall here see that in the dynamo theory 
there is an amplification of any chance, small deviation 
from symmetry such that a small deviation of the flow 
pattern can cause a large magnetic inclination. 

We consider a small deviation from this symmetry, 
with two of the opposite radial convection streams, for 
example, outward, deviating from the equatorial plane 
by an angle a, as shown in Fig. 20. The thickness of the 

th velocity v2 is kA. At a distance 


stream moving out wi ) : i 
p from the axis it is lifted a distance s=ap in the linearly 


varying field HHO OEE (47) 
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where H, is now the maximum value of the toroidal 
field which occurs at a distance almost A from the 
equatorial plane. The electric field induced by this 
change frorn the symmetrical position is 


Ey (p) = voll» (3) = (ap/A Joell. (48) 


This gives rise to an electric current density approxi- 
mately Ez (p) within the stream which is to be multi- 
plied by the width of the stream 4A and by dp to give 
the current flowing across the stream between p and 
p+dp. Of this a fraction 4 flows around the sphere as 
part of Jz’, the rest shunting back around the convec- 
tion stream above and below the plane of Fig. 20. We 
assume as an approximation that it flows circularly, so 
that the current circulating between p and p+dp is 


Jd (p)dp= kasv: spdp. (49) 


If treated as a current in a circular wire in the plane of 
Fig. 20, this contributes to the field at the surface of 
the sphere (at a point A along the x-axis) 2mp(p?+ A*)+ 
XJ (p)dp which when integrated gives for the field 
at the surface from the equivalent transverse dipole 


Hf =2r(1—71/4) adore 
=14/PaAorfl,~alls. (50) 


The last expression can be considered only an order-of- 
magnitude estimate as it involves cancelling out /? in 
substituting Eq. (42). The inclination of the main 
magnetic dipole moment y of the earth from true north 
is in this approximation 


=H i /Hy=oH2/ Hy. (51) 


Since the first step in the regenerative process is 
probably an amplification, with H>H, we probably 
have ¢>>a so that only a very small deviation of the 
convection streams from symmetry about the equa- 
torial plane is required to account for the observed 
magnetic inclination. 

One might hope for a more casual theory of the in- 
clination, relating it to other phenomena, but lacking 
it we may at least note that the inclination could thus 
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Fic. 20. Inclination of the axis. 
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perhaps be the chance result of rather small inhomo- 
genities. 

Inhomogeneities in the mantle, perhaps associated 
with differential cooling under the continents and 
oceans, might seem more likely than spontaneous un- 
symmetrical behavior on the core. The westward drift 
has been attributed to competition between various 
torques exerted between the mantle and core, with 
lines of force slipping through the material of the 
mantle in various directions, and a greater ease of 
slipping through one part of the mantle than another 
could provide another reason for the inclination of the 
axis. 


17. The Possibility of a Jet-Stream Flow Pattern 
in the Earth’s Core 


To avoid excessive complexity in presenting some 
of the inductive effects that might be important, we 
have in Secs. 6-15 confined the discussion primarily 
to an admittedly oversimplified flow pattern within 
the core, essentially that of Fig. 2. From simple analogy 
with the observations cited in Sec. 2 it seems likely 
that there is actually a more complicated flow pattern 
perhaps involving a jet stream. 

We have seen that the stability of a jet stream with- 
out a magnetic field is associated with its effectiveness 
in transporting heat without requiring transport by 
conduction over more than a very short path laterally 
into the narrow stream. A magnetic field, if sufficiently 
strong, may provide an additional reason for the 
stability of a jet stream in a conducting fluid. If a rubber 
band is stretched around the end of a right cylinder, 
but not across a diameter, it has a tendency to slip off. 
Correspondingly, if we have a cylindrical stream of 
conducting fluid moving parallel to the axis within a 
surrounding conductor, a magnetic line of force tra- 
versing the cylinder and not passing through the center 
has a tendency to be squeezed out, as suggested in 
Fig. 21. At high velocities or with large dimensions 
this effect; analogous to a “skin effect,” greatly reduces 
the magnetic drag opposing the motion in the cylinder. 
A simple dimensional estimate such as we have made 
above indicates that the characteristic velocity for 
this effect is væ (10° cm/L) cm/sec, where L is the 
thickness of the stream, so that the magnetic field may 
be expected to be squeezed out of any large stream 
within the core. When the magnetic field is squeezed 
out of the moving matter, the question then arises 
into what spate is it repeiled, and this, of course, 
depends on the flow pattern and may help determine 
which flow patterns are possible. 

In Sec. 2 we have discussed the “thermal wind” 
explanation of the eastward flow and drift in the 

meteorological case with cold at the center of a cylin- 
drical vessel rotating eastward. This explanation is 
dependent on the assumption of sticking at the bound 
surface at the bottom, which through the rather 
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(a) (b) a 


Fic, 21. Magnetic lines avoid a flow stream. 


small viscosity keeps the motion slow near the bottom 
and thus permits practically no Coriolis contribution 
to a radial pressure gradient at low altitude. This is the 
basis for assuming equal pressures at the inner and 
outer edges of the bottom surface, and in this process 
friction is not negligible at least in the bottom layers. 
It has been mentioned that with cold in the center we 
have a natural thermal convection in a horizontal plane 
driven by the centrifugal force, but that with a warm 
boundary in the center, the flow pattern must deviate 
from the horizontal plane. This distinction is of interest 
in connection with the principle of constancy of average 
angular momentum per unit volume with varying radius 
in a nonviscous rotating convective fluid, of which use 
was made in the more general considerations of Sec. 2. 
In the explanation of the thermal wind with cold at 
the center, no use is made of viscous drag between the 
upper layers and the constancy of average angular 
momentum with radius must apply to the motion 
confined in each horizontal slab (if the motion is indeed 
horizontal). That this principle is consistent with the 
simple geometrical situation, typified by Fig. 3(c), is 
apparent from the fact that the rapidly moving matter 
in the jet stream constitutes a larger fraction of the 
matter at small radii than at large, making the average 
tangential speed greater at smaller radii. With a thin 
jet stream and with the outer radius much larger than 
the inner, this requires that the flow in the jet stream 
be much faster than the velocity Qp due to the rotation. 
With the hot boundary at the center, the thermal wind 
explanation requires circulation in the opposite direc- 
tion, as is also observed, and the motion is no longer 
thermally driven if it is purely horizontal, so there must 
be transport of momentum across horizontal boundaries 
and the conservation of angular momentum within any 
horizontally bounded region no longer applies. The 
simple qualitative geometrical consideration of flow 
in a plane (Fig. 3(c) with v reversed) still leaves con- 
servation possible, but only if the jet speed, which is 
now oppositely directed to Qp, is much greater in 
magnitude so that the average tangential speed is in the 
direction of the jet flow, backward. So great a-jet speed 
m need not be, and apparently is not, achieved, and 
similarly in the case first discussed with cold at the 
center, there is no assurance of horizontal flow and no 
need to satisfy the simple conservation requirement at 
all exactly. Insofar as there are deviations, they are 
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transmitted up from the closed boundary at the 
bottom by viscosity and by vertical components of flow. 

In the earth’s core there is no question about the 
source of the thermal drive, if thermal it be with a 
source of heat at the center, since the inwardly directed 
radial component of gravity provides ample motivation 
for the hot part of the stream to float toward larger p 
(so long aş the thermal gradient exceeds the adiabatic 
gradient). But even with the jet stream thus driven, 
and at the same time retarded by a magnetic field, some 
of the thermal-wind considerations might apply to help 
determine what type and direction of motion may be 
stable. 

Hide (private communication) has suggested that the 
direction of flow permitted in the earth’s core may be 
determined by comparison with his cylindrical experi- 
ment, with a careful use of inversions and neglecting 
the p component of gravity even though it is important 
in driving the convection. Let us replace the spherical 
core by a cylindrical one, similar to Hide’s experimental 
arrangement, Fig. 3(a); but with a lid on it in contact 
with the top surface of the liquid, and with gravity 
pointing downwards in the top half, upwards in the 
bottom half, as the z-component of gravity does in the 
earth [left side of Fig. 22(a) ]. The two halves differ in 
two factors, the direction of gravity and the apparent 
rotation looking down on the appropriate pole, and 
with these two reversals the resultant phenomena are 
the same in the two halves. The boundary between them 
in the equatorial plane thus acts as a free surface for 
both bodies of fluid. The upper half differs from the 
laboratory experiment, Fig. 3(a), only in the inter- 
change of free and bound surfaces between top and 
bottom, for gravity is down in each. The lower half 
differs from Fig. 3(a) only in the direction of gravity, 
for the bound surface is on the bottom for each case. 


Fic. 22. Various flow patterns. 
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Thus each half has one mechanical reversal from 
Fig. 3(a). But there is also a thermal reversal, for we 
must now make the center of our model hot if it is to 
correspond to thermally driven convection in the earth. 
These two reversals leave the direction of flow the same 
as in Fig. 3(a), or the same as in the temperate-zone 
atmosphere, eastward. 

Since the magnetic drag, though strong, provides no 
preference for eastward or westward flow, it seems 
reasonable to presume that this is the dominant effect 
in determining the direction of stable flow. 

The complexities introduced by the spherical bound- 
aries of the core and spherically radial gravitational field 
may affect the motion in several ways. With no Coriolis 
force, we would expect convective streams along the 
spherical radius 7, in patterns of which Fig. 1 is a simple 
special case. With a dominant Coriolis force alone we 
might expect the thermal convection to take place 
mostly in planes normal to the axis, at least in the region 
between the dotted lines on the right side of Fig. 22(a). 
The Taylor effect would lead one to expect the fluid 
above and below this region to follow the convective 
pattern in part, but there are difficulties in trying to fit 
the curved upper and lower surfaces and also the surface 
of the core. With limited influence of the Coriolis force 
a compromise something like Fig. 22(b) might perhaps 
be reached, involving not quite cylindrically radial 
flow, with convective columns (like the “idler wheels”) 
either bent, as shown, to try to fit the boundary, or bent 
in another direction to try to accommodate the “hair- 
pins” of the magnetic field. Another possibility is a 
cylindrically radial stratification, as suggested in Fig. 
22(c). Such stratification is observed by D. Fultz in a 
laboratory experiment with a fluid container of this 
spherical symmetry rotating rapidly about a vertical 
axis. The tendency of the Taylor effect to induce 
vertical rigidity of the flow pattern is less flagrantly 
violated if the flow pattern is confined to a fairly thin 
cylindrical shell of which the ends, while not flat, do not 
deviate from flatness as much as in a thicker shell. 
Among the possibilities for the transport of heat out 
through this sequence of shells would be a jet-stream 
flow pattern within each shell, as suggested in Fig. 
22(d). Such a pattern might break up into a collection 
of cylindrical vortices with a tendency to avoid radial 
drifts because of the change of length required, though 
a change in length of such a vortex in a nonideal fluid 
is possible, with a corresponding change in thickness 
and angular velocity about its own axis. The modifica- 
tions of any of these patterns by a magnetic field might, 
of course, be quite profound. 


18. Influence of a Jet-Stream Flow Pattern on the 
Mechanism of the Westward Drift 


Let us consider a jet-stream flow pattern similar to 
Fig. 3(c), or even Fig. 22(d), near the equatorial plane 
of the fluid core, perhaps somehow fading off gradually 
toward the poles. Some of. the places where a magnetic 
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field parallel to the axis could thread*through this 
pattern without being squeezed out of moving matter 
are the inner core and the “idler wheels.” If these bodies 
continue to rotate in spite of the magnetic field, there 
will be a tendency to concentrate the field in twisted 
bundles of magnetic lines near the axes of rotation, in a 
form of bending within these smaller bodies somewhat 
resembling the “hairpins” discussed on a larger scale 
in connection with Figs. 8 and 18. (In the smaller bodies 
the migration toward the axis would be relatively more 
important and the lines would probably not twist 
around the axis so many times as estimated for the 
simple flow in the whole core, even without the in- 
stability that leads to kinking.) The torques exerted 
on the core as a whole by the twisting of these bundles 
contributes to the drift of the pattern but only in a 
small way, both because they are of opposing signs and 
because of the comparatively small radii of the bundles 
acting as “lever arms” of the torques. 

We have mentioned that it is possible to postulate 
an energy source about 108 times as strong as needed 
for a generator of high efficiency, but we have no as- 
surance that the energy source is this great. The 
property of a dynamo like the one here discussed to 
build up the field to such a strength as to use up all the 
driving energy, with whatever inefficiency it may have, 
makes it seem likely that the field would slow down 
but not stop the rotation of the central core and the 
“idler wheels,” and become somewhat bunched near 
the rotation axes. 

Besides these twisted bundles through the rotating 
regions, magnetic lines are to be expected, perhaps 
trapped only temporarily as they are pushed through, 
in the space between the idler wheels and the jet 
stream, penetrating into the edges of each. The main 
contribution to the drift of the flow pattern would be 
expected to come from the drag on these magnetic 
lines, since this drag acts with the lever arm p to produce 
a large torque about the z-axis. We have seen in Sec. 
16 that there is reason to expect the jet flow to be 
eastward. This flow is analogous to the paddle wheels of 
a river boat always moving aft and pushing the boat 
forward, in this case pushing the fluid core with its flow 
pattern westward. There are various sources of resist- 
ance to the general rotation of the core with its flow 
pattern, analogous to the hull friction of the boat, in 
particular the frictional or local magnetic drag between 
the outer “idler wheels” and the mantle where they 
come in contact. 

We keep the primary result of the “thermal wind” 
mechanism, that the jet flow should be eastward, 
because the magnetic field does not introduce a prefer- 
ence. The secondary result of the “thermal wind” 
mechanism is that the drift also should be eastward, but 
this is opposed and presumably overcome by the 
dominating magnetic effect arising from the drag on the 
eastward jet flow. 

The mechanism of the westward drift is then rather 
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similar to that discussed in Sec. 12 and Fig. 19, where 
the fluid core was considered to be divided into two 
parts, the volume inside and the volume outside of a 
cylinder of radius B. Here instead we again divide the 
volume into two parts, the jet stream and the rest of 
the fluid. The jet stream flows eastward for reasons 
associated with keeping the angular momentum per 
unit volume of the inner half equal to that of the outer 
half. The magnetic drag on the jet-stream part is west- 
ward, so the general drift must be westward in order 
that the drag on the rest of the fluid may be eastward, 
permitting the two magnetic torques to balance in the 
steady state. In this case the entire jet stream pattern 
drifts westward and there is no contradiction with 
Vestine’s conclusion that the angular velocity of west- 
ward drift appears to be independent of latitude, as 
discussed in Sec. 12. 

The type of flow pattern suggested in Fig. 22(c) and 
(d), which divides the fluid core into a number of in- 
crements of cylindrical radius dp and may contain a jet 
stream in each dp, may be expected in a general way 
to tend to establish a flow with the average angular 
momentum per unit volume independent of p, as 
previously discussed in Sec. 12 Eq. (22), ff, since 
the drag between adjacent sections provides for a 
radial transport of angular momentum. The general 
remarks there made about the westward drift might 
then apply, with the irregular pattern of the observed 
field arising from the outer section dp which drifts 
westward. 


19. Influence of a Jet-Stream Flow Pattern on the 
Induction Process 


The salient feature of the induction theory that 
became clear from the discussion of Secs. 6, 8, and 10, 
especially in connection with Figs. 15 and 17, is that 
it depends not only on the existence of upward and 
downward streams in the equatorial plane, such as are 
present also in jet-stream patterns, but also in an essen- 
tial way on the spreading out (or drawing together) 
of these streams at the surface in such a way as to 
provide a north-south component of velocity, such as is 
absent from a strictly two-dimensional jet-flow pattern. 
Since the presence of a magnetic field, the inhomo- 
geneity, and the spherical geometry all induce deviations 
from the two-dimensional flow of the Taylor effect, it 
is in a general way not to be excluded that there may 
be some flattening of the jet stream as it comes against 
the surface to provide the essential north-south com- 
ponent flow. From the above discussion it appears not 
to matter if the spreading is more nearly in the shape 
of a cobra’s hood than of a mushroom. It is, however, 
crucial that it should be the outward stream that 
spreads and the inward one which contracts, correspond- 
ing to the way in which, for example, v in Fig. 17(f) _ 
points upward to the right. The change in sign of just 
one component of this velocity would make the differ- 
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ence between a regenerative and a deg 
tion process. 

It would require a quite drastic deviation from the 
almost-two-dimensional flow patterns we have been 
discussing to provide sufficient spreading of the out- 
ward stream near the surface. The deviation suggested 
Fig. 22(b), with flow in shallow cones is, for example, 
not enough. Here the outward stream in such a cone 
still makes an angle with the radius vector at any 
point such as to have a component toward the equator, 
whereas a component toward the pole is required. Also 
the pattern suggested in Fig. 22(c) and (d) does not con- 
tain the north-south component required for the 
dynamo process, but would provide a rotation of the 
inner core as a basis for the twisted-kink process. 

It was mentioned at the beginning of Sec. 16 that a 
magnetic field should encourage the formation of a jet 
stream. The tendency to form a large Elsasser toroidal 
field Hz as suggested by Figs. 8 and 19 may be quite 
strong, perhaps as a result of a transient period of 
haphazard radial transport before establishment of a 
jet-stream pattern (as is observed in the laboratory 
demonstrations of jet streams). Such a field would favor 
a z-direction stratification of p-direction flow, and thus 
might give rise to a jet stream or other patterns of 
radial flow in the equatorial plane and perhaps others in 
planes parallel to it (e.g., just two others about 1000 km 
north and south to conduct the heat away from the 
polar regions of the inner core) separated by intervals 
dz in which Hz is strong and the flow inhibited. The 
magnetic lines could then weave between the regions 
of radial flow to form hairpins approximately as in 
Fig. 8. The radial streams might then have an approxi- 
mately round cross section, like the body of a cobra, 
spreading out like the cobra’s hood where the flow ap- 
proaches the surface and turns in the direction of H» so 
as not to be impeded by it. It is in this extreme, in 
which the magnetic field completely suppresses the 
Taylor effect and associated “thermal wind” tendencies 
for flow resembling two-dimensional flow, that the 
Bullard dynamo induction process may perhaps be 
realized. 


enerative induc- 


Iv. THERMOELECTRIC AND OTHER THEORIES 
20. Properties of Matter at High Pressures 


In units used in the study of atomic structure, the 
hydrostatic pressure throughout the core of the earth is 
of the order of magnitude one electron-volt per cubic 
angstrom. It is capable of supplying enough energy 
per atom to excite the atom from one electron state 
or band to the next, but this is not exactly what it does. 
Instead it is expected to compress the atom enough to 
broaden and raise the bands. : 

At high pressure a metal is apt to go into a body- 
centered cubic lattice in which the volume attributed 
to each atom may be approximated by a sphere of 
radius r.. The behavior of the energy bands asa function 
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Fic. 23. Electron energy bands as affected by compression. 


of r, is represented schematically in Fig. 23. At large 
r, the levels are the atomic energy levels. As the atoms 
are brought together, first the upper unoccupied levels 
start to spread out into bands, then the level occupied 
by the outermost electrons, and at still smaller r, the 
lower electron states fully occupied by some of the 
inner electrons start to spread out into bands. The 
spreading may be thought, on a simple Heitler-London 
picture, for example, to arise from the fact that there 
are many states of the polyatomic system corresponding 
to one state of each atom, and that these degenerate 
states of the system start to “repel” each other because 
of the electron interactions when the wave functions of 
the corresponding electrons in the atoms begin to 
overlap. Thus the smaller the radius of the outer part 
of an electron wave function in an atom, the closer the 
atoms must come before the electron level starts to 
spread out into a band. 

As the atoms are brought closer together, another 
effect is the gradual rise of the centroid of the band to 
higher energies. This is analogous to the way the 
quantized states in a box are elevated in energy as the 
box is made smaller. More kinetic energy is prescribed 
as the atoms are squeezed into a smaller space. The 
cohesion and equilibrium radius for ordinary metals are 
determined by the fact that the highest occupied band 
is only partly filled, so the electrons may settle pre- 
dominately in the depression in the lower half of 
the band. 

As pressure is applied, at the radius at which it has 
imparted to the atom one or a few electron volts, most 
of this energy will be accounted for by the rise in energy 
of the electrons in the highest occupied band, though 
the contribution of the rise of the next lower band may 
also be significant. At the compressed radius, some of 
the lower bands will have begun to spread out, and the 
empty band next above the first occupied band may 
have begun to overlap it, as r,=A in Fig. 23. At normal 
pressures, r,= B, there is a gap between these twa bands 
and the material is a conductor or an insulator depend- 
ing on whether or not the first occupied band is partly 
or completely filled, that is, on whether or not the elec- 
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trons at the top of the “Fermi sea” may become con- 
ducting to other states in the same band, by being 
excited only a little by an electric field. If the gap 
between the two levels is not much wider than &7, or 
if it is divided into small jumps of this order by inter- 
mediate levels provided by impurtiy atoms, the sub- 
stance is a semiconductor with conductivity increasing 
with increasing temperature T. Thus we see that the 
application of pressure of the order of magnitude avail- 
able can change an insulator or poor semiconductor 
into a good semiconductor or a conductor by reducing 
or eliminating the gap. 

The magnitude and sign of the thermoelectric effect 
depends in a complicated way on the difference between 
the behavior of the electrons in the conduction bands 
of the two materials, and on the matching of the bands 
at the junctions. It is affected by the distribution of the 
density of levels in energy throughout that portion of 
the bands to and from which electrons may be thermally 
excited. The distribution of level density is, of course, 
different in the two materials, and the amount of 
thermal excitation is different at different parts of each 
because of the temperature gradient. While these effects 
are complicated enough that we cannot expect to find 
them completely calculated, it appears that the general 
nature of the thermoelectric process is not greatly 
altered by the application of high pressure. The tabu- 
Jated thermoelectric powers of the metals (relative to 
Pb) near room temperature range in absolute magni- 
tude from about 0.003 to 300 microvolts per °C differ- 
ence between the two junctions, with only about 
one-fourth as many above 10 uv/°C as below. The few 
exceptionally large values may be expected to arise 
from cases in which the bands in the two metals 
happened to fit together in some particularly favorable 
way, perhaps with the top of a band in one metal just 
overlapping the bottom of one in the other, for instance. 
High pressure spreads out the bands, causing them to 
overlap and lose some of their distinctive features, 
and high temperature smears out the surface of the 
“Fermi sea,” and averages over a wider region, so we 
may expect, in general, that the main effect of high 
pressure and high temperature will be to seek the 
average of thermoelectric properties and avoid extreme 
values. Thus it is a reasonable guess that the thermo- 
electric power encountered between two different sub- 
stances deep in the earth will be comparable with the 
representative, not the exceptionally large, laboratory 
values, say 1 to 10 uv/°C. 

The Hall effect depends in sign presumably on 
whether the conduction is primarily attributable to 
electrons or “holes” (that is, the empty states not 
occupied by electrons at the top of an almost-filled 
band). Since the pressure tends to make the relevant 
bands overlap, it voids the concept of “holes” in the 

simplest sense, and therefore might be considered to 
favor the probability of a negative Hall effect, due to 
electrons. However, the distinction is not quite so 
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simple, and depends on the derivative of level density 
at the energy of the conduction electrons. The magni- 
tude of the Hall coefficient depends on the effective 
mean free path of the electrons, that is, on how far 
they go before they suffer a collision and forget about 
their recent magnetic deflection, and this may vary 
with energy. The high temperature means that one 
must average over a quantity of varying sign and 
magnitude over some energy range, which again should 
tend to avoid extreme values. and give relatively small 
values of the Hall coefficient, still, however, of the order 
of magnitude of most laboratory values. The effective 
mean free path is shorter at higher temperature because 
of the greater disorder in the lattice, and this again 
tends to make both the Hall coefficient and the con- 
ductivity deep in the earth somewhat small compared 
to laboratory values, though this effect may be com- 
pensated by the ordering effect of high pressure. 


21. Thermoelectric Effect as a Possible Origin of 
the Earth’s Field 


Because of the easy distinction of the presumably 
metallic fluid core and the solid mantle which may be 
assumed to consist of minerals not very different from 
those we know near the surface of the earth, particu- 
larly olivene, (Mg, Fe):SiO4, the first modern attempts 
to treat the origin of the earth’s magnetism did so on 
the assumption that the primary conductivity takes 
place in the core, while the mantle was assumed to be 
almost an insulator and to conduct only enough to 
attenuate the fluctuations of the fields produced in the 
core. With this assumption, Elsasser (1939) proposed 
that geomagnetism might have its origin in thermo- 
electric currents within the core. The convection within 
the core was assumed to be driven by a radioactive 
heat source near the center and to consist of upward 
and downward streams, in a pattern perhaps similar 
to Fig. 2 though a smaller-scale convection was favored 
in the discussion. The thermocouple was obtained by 
assuming that the warmer upward stream had also a 
somewhat different material composition from the 
downward cool stream, perhaps as a result of random 
inhomogeneities in composition. The effect of the 
Coriolis force tends to keep the circulatory motion 
parallel to the equatorial plane, and the otherwise 
random inhomogeneities in the temperature-composi- 
tion pattern are then expected to produce a magnetic 
moment parallel to the axis of rotation. Difficulties 
were encountered [Inglis and Teller (1940) ] in permit- 
ting large enough velocities for a sufficient Coriolis 
force and large enough temperature differences for a 
sufficient thermoelectric force without requiring more 
heat transport than seemed compatible with the 
observed flow through the crust. 

The more recent realization that the conductivity 
in the mantle can be great enough to contribute to the 
primary process makes it possible to improve on this 
early mechanism, but the same difficulties still make 
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Fic. 24. Possible thermoelectric currents. 


it implausible that thisis the main source of the earth’s 
magnetism, as we shall see. In Fig. 24, a pattern is 
drawn of six convective loops in the equatorial plane, 
and it is assumed that the loops rotating counter to 
the earth’s rotation are larger than the others because 
they are favored by the Coriolis force. Alternatively, 
we could just as well consider the jet-stream pattern of 
Fig. 3(c), in which the angle between the upward and 
downward streams of a given loop where they make 
contact with the mantle is less than the angle occupied 
by the adjacent ‘‘idler wheel” region. Let us now con- 
sider the thermocouples formed by the mantle as one 
conductor and the core as the other. The hot and cold 
junctions at the surface of the core are marked H and 
C, respectively, in Fig. 24. If the thermoelectric power 
of the mantle relative to the core is positive, current 
flows in the six loops as shown, the three larger loops 
clockwise and the three smaller ones counterclockwise.§ 
There is thus a north-south asymmetry that may be 
expected to give rise to a net magnetic dipole moment 
parallel to the axis. 

The upward flow of heat through the crust is observed 
to be about 1.5-10-® cal cm sec (Jeffreys 1952, page 
282) which, if assumed to apply also to suboceanic 
flow, amounts to 8-10” cal/sec for the whole earth. 
This may be accounted for entirely by radioactivity i 
the criist. Thus we have no indication that there is 
any upward heat flow through the outer mantle. We 
have no reliable indication of its temperature. Urey’s 
mechanism for convection driven by the settling of 
heavier constituents provides a very diffuse source of 
heat, spread out wherever there is viscous friction, or 
electric dissipation from magnetic drag opposing the 


54) has discussed a thermal and current distribu- 
Ee An Boma REE Legendre polynomial Pi™@)e?™? (de- 
pending upon further interaction with the flow to change an 
“electric-mode” field into a “magnetic-mode field), but it should 
be noted that the distribution of Fig. 24 cannot be approximated 
in this way without introducing ,additional symmetry, which 


would nullify the proposed effect. 


large-scale convection. A reasonably efficient thermo- 
electric generator requires a more concentrated source 
of heat. Meteoric samplings, presumably of a shattered 
planetary mass perhaps similar to the earth’s interior, 
make it seem unlikely that it contains a concentrated 
radioactive source of heat, but this is not direct proof, 
The solid inner core of radius 1250 km within the fluid 
core would be a favorable place to have a source of 
heat. Elsasser (1950, page 28) has postulated that UO; 
and ThOs might provide such a source, but Urey (1951, 
page 270) objects that it is difficult to imagine any 
geochemical process which would result in the segrega- 
tion of these oxides. 

For the sake of examining the thermoelectric theory, 
let us simply assume as large a heat source in the inner 
core as may be needed, and still not contradict directly 
observed terrestrial data such as the flow through the 
crust. We may assume that the earth started out 
fairly cool and has been heating up from the center ever 
since, so long as the heat wave has not penetrated out 
to the surface. The heat conductivity over the large 
distances is so small and the heat capacity of the large 
mass so large that this process will permit a somewhat 
greater heat transport across the fluid core than is ob- 
served up through the crust. It is thus not necessary 
to assume a steady state as was done by Inglis and 
Teller (1940). 

We assume a steady source of heat in the inner core, 
some of which is used to heat up the core to a tempera- 
ture which throughout the fluid core is almost inde- 
pendent of radius because of the convection. Let the 
rest of it, the rate of flow of heat up through the inner 
surface of the mantel, be S. If an inner layer of the 
mantle of thickness r— Re, density p and specific heat 
s, is heated to an average temperature T in the age of 
the earth /, we have roughly 


St=4aRe*(r—Re)psT. (52) 


If this heat flow passes into the mantle by conduction 
rather than convection, there will be a temperature 
gradient at the inner surface somewhat larger than, 
but of order of magnitude, T/(r—Rc), and with heat 
conductivity A, we have 


S=4rRAAT/(r— Re). (53) 


In terms of electric conductivity s which is expected 
to be not so strongly dependent on temperature we 
have, from the Wiedemann-Frantz relation (Chapman 
and Cowling 1952, page 316) z 


A= 2.44 10°To g cm? sec deg’. (54) 


Combining these three equations we obtain a rough 
estimate of the electrical conductivity ¢ required for an 
inner layer of the mantle of thickness (r— Rc) to be 
heated to a temperature T by thermal conduction in 
the age of the earth: 


o= (r—Re)*(ps/2.4 10°TH g cri sec de O 
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In evaluating this roughly we may take ps=1 cal deg ! 
cm™? and ¿= 10" sec. Because, as was mentioned in the 
discussion of Sec. 20, the pressure may be assumed to 
have spread the upper occupied electron energy bands 
in olivene enough to wash out the distinction between 
conductor and semiconductor, and it is plausible that 
the conductivity might have a value even as high as 
one characteristic of very good metallic conductors, 
a= 10° ohm™ cm™= 10 cm™ sec, ten times as high as 
in Eq. (7). The temperature at the surface of the core 
cannot be higher than the melting point of silicate 
rocks there. At laboratory pressures they expand on 
melting so high pressure inhibits melting. At zero 
pressure the melting point is about 1300°C and at the 
pressure of the surface of the core Uffen (1952) has 
estimated an upper limit of 6000°. For purposes of 
orientation, we find from Eq. (55) that the conductivity 
required so that the inner half of the mantle, with 
(r—Rc)=1500 km, may be heated by conduction to 
half of this temperature, 3000°, is about 


o=10-> cm™ sec. (56) 


It is thus very likely that the conductivity of the 
lower half of the mantle is great enough to have heated 
it to a rather high temperature, if a sufficient source of 
heat is available in the core. At the lower pressures of 
most of the outer half of the mantle, the conductivity 
would be expected to be considerably lower than that 
of metallic conductors, and perhaps less than Eq. (56), 
and without transport by conduction alone this could 
prevent the heat from penetrating too abundantly out 
to the surface even if the inner half were heated to, 
say, 3000°. 

With the enormous dimensions available in the earth, 
plastic flow is considered to be a likely mechanism of 
heat transport throughout most of the mantle, in spite 
of its being sufficiently solid to transmit rapidly 
transient transverse seismic waves. If there is no 
regularly stratified chemical inhomogeneity to inhibit 
convectiqn, this mechanism could limit the temperature 
of the inner part of the mantle to rather smaller values, 
perhaps near 2000° (Verhoogen, 1954). 

For the sake of all reasonable generosity to the 


thermoelectric theory of terrestrial magnetism, let us | 


assume that the inner half of the mantle has been 
heated to a temperature of 4000°. The rate of heat flow 
required to do this in the age of the earth is 


S=2X 10" cm*psT/t=1.5X10" cal/sec, (5 7) 


which is only about twice the flow out through the outer 
crust. Since most of the latter must be attributed to 
radioactivity in the crust, we thus gain perhaps a factor 
ten over a permissible steady-state assumption. 

In order to obtain as large a thermoelectric effect as 
possibie with this limited heat flow, we assume that a 
circulation in the core similar to that in Fig. 24 takes 
place in quite narrow filaments, of diameter D, cross- 
sectional area D?. Each rising filament heats a consider- 
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able area, of order of magnitude R?, of the core-mantle 
interface. Presumably the conduction in the mantle is 
poorer than in the core (perhaps because of less loca] 
convection within the layer) and the temperature at the 
interface at any point will be nearly that of the stream 
in the core. We may thus take the same temperature 
difference AT between the centers of the junctions H 
and C in Fig. 24, and between the rising and descending 
filaments in the core. The temperature of the interface 
will vary continuously between the limits at H and C, 
so in effect we have two large thermal junctions, 
each of area about R?, with an effective difference of 
temperature something like AT/2. 

We then have a thermoelectric current approximately 


I~RoQAT/2 (58) 


going around a circuit of area roughly R2, to make a 
magnetic moment 


u~IR?= RoQAT/2 (59) 


for each of the six loops. Becduse of the cancellation 
between the loops, the total dipole moment may be 
comparable to that for one loop. Here Q is thermo- 
electric power, ø is electric conductivity, and R the 
radius of the core, 3400 km. Taking the representative 
values Q=10~* volt deg= 10° g? cm? sec? deg and 
o=10% cm™ sec, we have 


u=2X10AT g? cm! sec deg!= 10% dyne? cm?, 


the latter being the observed value of the earth’s 
moment. We thus estimate the order of magnitude of 
the temperature difference required to be 


AT=5°C, (60) 


or perhaps more because of possible further inefficiency 
of the process. 

If for the product of the density and specific heat of 
the core fluid we again take ps=1 cal deg cm, the 
net upward heat transport by WV pairs of upward and 
downward streams, each of speed v and area D?, is 


ND*1psAT =6 X 10!"N (D/R)?0 cal/cm. (61) 


Tf we equate this to the heat flow of Eq. (57), taking 
N=3, we find the speed 


v= 10-5(R/D)? cm/sec. (62) 


This is an upper limit: if AT is larger than Eq. (60), 
v must be correspondingly smaller in order not to carry 
too much heat. 

Unless the stream diameter D is very small, this is a 
small velocity and the question arises whether the 
Coriolis force is large enough to have any appreciable 
effect on the motion. The most favorable case is the 
limit in which Eqs. (60) and (62) apply. The Coriolis 
force per unit volume arising from the earth’s angular 
speed Q~ 107 sec™ is 


Fe=2pi=2X10-(R/D)2p cm sec 
=2X10-8(R/D)? dyne cm™, (63) 
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with Eq. (62) and p=10 g cm”. A fair indication of the 
other forces acting on the fluid is obtained by estimating 
the buoyant driving force, which is the difference 
between term (2) of Eq. (1) for the upward and down- 
ward streams, 


F'p=apgAT=3X 10 cm sec~?, 


with the céefficient of expansion a= 107° deg and with 
g=600 cm sec~*. In the complicated local balance be- 
tween the terms of Eq. (1), it is not clear how large the 
Coriolis term would have to be to introduce an appreci- 
able asymmetry in the convection pattern. The Coriolis 
force acts in a direction normal to the driving force, and 
is compensated directly rather by gradients of the 
pressure that are built up, while the driving force is 
compensated more directly by magnetic drag and 
perhaps viscous forces. It seems safe to assume that 
the term C, if not quite as large as the others, will have 
to be within an order of magnitude of them to have 
any appreciable influence. Putting Fe=Fp/10 we find 
that the stream diameter D must be very small, 


D=10R. (64) 


(This large v is required only by the thermoelectric 
theory; an induction theory allows AT, and thus the 
driving force per unit volume to be much smaller and 
a smaller v in a thicker stream may still leave the 
Coriolis force significant.) 

Though the very narrow stream required by the 
thermoelectric theory seems extreme, the crowding of 
the stream by the magnetic field discussed in connection 
with Fig. 21 might perhaps provide a sufficiently effec- 
tive mechanism to confine the flow to such fine filaments. 
It could also provide the drag to prevent further 
acceleration, and it is needed for this since even with 
such narrow streams ordinary viscosity is not sufficient. 
If the flow just outside of the filaments should remain 
locally laminar, the retarding force per unit volume of 
the filament would be about 


Fy=Vn/D?=10-"y cm™ sec. (65) 


For this to equal the driving force, Fy=Fp with p= 10 g 
cmc? would require an enormous coefficient of viscosity, 
3 10 g cm™ sec“, about that for cold pitch and much 
larger than the usual values for liquid metals, 10~° to 


10 g cnr sec. 
22. Combined Thermoelectric and Hall Effects 


Thus we see that rather extreme assumptions are 
required to make a thermoelectric theory of the earth’s 
field adequate, either an extreme geometry or extreme 
and implausible values of some of the physical proper- 
ties of matter such as the thermoelectric coefficient Q. 
These unpleasant requirements arose from a desire to 
ough to influence the 
mmetry in the spacing of the hot 


flow, to attain an asy 3 
; fd cancellation of the con- 


and cold junctions so as to avo 


Fic. 25. Hall effect influencing thermoelectric currents, 


tributions to the earth’s magnetic moment. An equiva- 
lent asymmetry may be supplied by invoking the Hall 
effect on the currents in the mantle, in a comparatively 
simple and attractive scheme suggested by Tatel, Tuve, 
and Vestine (Vestine, 1954). It requires temperature 
differences of the order of magnitude given by Eq. (60), 
but the velocities may be as small as or smaller than 
Eq. (62) with D~R, since the process is not critically 
dependent on Coriolis force. Just as with the thermo- 
electric effect alone, the streams must carry enough 
energy to supply the Peltier heat, but the general 
thermodynamic argument mentioned above indicates 
that they can. 

Let us assume an even simpler convective situation 
that we have been considering, with simply a cold belt 
C, around the equator and a warm region W, at each 
of the poles of the core-mantle interface. This could 
come about because the velocities are so low that 
Coriolis effects are negligible compared to the magnetic 
field which inhibits the sort of flow we have been 
discussing in the equatorial plane but permits a modified 
Bernal cell flow between the inner core and the polar 
regions of the mantle, parallel to the magnetic field. 
The opposite situation, with cold at the pole and a warm 
belt at the equator, could he established if the Coriolis 
effects are more influential than the field. Either situa- 
tion is as good as the other for the first step of the 
process. In this step it is only the symmetry that 
matters unless we go farther and worry about the 
signs of the thermoelectric and Hall coefficients which 
are difficult to guess from experience confined to the 
laboratory. In Fig. 25, the circle represents the core- 
mantle interface, and the system of currents J; is set 
up by the thermocouple. With this system of currents, 
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the Hall effect in the mantle (where it is expected 
to be much more important than in the molten metal 
of the core) if it happens to have the appropriate sign 
can produce a regenerative’ process. A stray field H 
produces Hall currents J» in circles about the axis in 
the directions indicated (cross, into the paper: dot, 
out), which tends to regenerate the part of the field 
threading through it. 

Examined in a little more detail, the scheme is not 
quite as simple as it seems at first. Since a circle of 
current Ja contributes only to the field threading 
through it, attention should be drawn to the question 
“What current supports the field immediately sur- 
rounding the equatorial circle?” In Fig. 25 are indi- 
cated successive loops of H about the point on the 
equator marked C. For each such loop we have 


if H-ds= i J2-dS, (66) 


where the first is a line integral around the loop and the 
second is a surface integral over the area enclosed by it. 
Considering these quantities as functions of the distance 
r from C (after separating off a common angle factor 
in the neighborhood of C where the geometry is simple), 
we may take H(r) proportional to the Hall current 
Jə(r) that it produces, H(r)=k-J2(r), and consider 
|dS|=dr Sf |ds]. We then find 


J(= f Jo(r’)dr’ or J:=4Ae*. (67) 
0 


Here & is essentially the Hall constant divided by the 
conductivity and has the sign of the Hall constant. 
Depending on whether & is positive or negative, 
the circulating current either builds up or decays 
with increasing 7 from the value of J2=A at the 
appropriate point near the equator at which H 
vanishes. This r=0 value of Je is not supported by the 
Hall effect. With a Hall constant of the appropriate 
sign (positive for the Jı shown in Fig. 25), the effect 
here considered can merely act as an amplifier of an 
initial Js at the equatorial point r=0, and then only 
if this initial Jz has the appropriate sign (the one 
indicated in Fig. 25 for the stray H there shown, or 
the opposite one if H is reversed). 

The rectifying action of this last condition provides 
the necessary north-south asymmetry, to take the place 
of the Coriolis-force used in-Sec. 22. Consider the case 
in which the equatorial plane, while in general perhaps 
warmer than the poles, has convection and temperature 
differences within it. If in Fig. 24 the Coriolis influence 
is relaxed so much that there are hot and cold junctions 
equally spaced around the equator, then the thermo- 
electric current 7 at or very near the equator would 
have directions alternating with longitude, and only in 
alternate sectors the amplification is described by Eq. 
(67) would be effective {unless the stray H is more 
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complex, alternating between the sections, in which 
case it would be expected to be unstable and grow in 
some sections at the expense of the others). At this 
stage we would have a net magnetic moment of the 
earth, but with a field more dependent on longitude 
than observed at the surface. As a further step, the 
amplified Jz arising from the Hall emf in the favored 
sectors might overwhelm the primary currtnt i near 
the equator in the unfavored sectors, leading to ampli- 
fication in those sectors also and smoothing out the ob- 
served field. 

Though various other motions and effects would be 
superposed on this thermal mechanism, it does not 
appear that they would much modify it unless they are 
so strongly degenerative as to nullify it. In particular, 
if there is any convection parallel to the equatorial 
plane, the mechanism of the westward drift (Sec. 12) 
would be operative and tend to make toroidal magnetic 
fields in the core, the “hairpins” of Fig. 16. The sche- 
matic straight lines of H drawn in Fig. 25 would be 
bent also by the p components of the convection. This 
flow might include a jet stream warming the equatorial 
belt more than the polar regions, when averaged over 
longitude. 


23. Regeneration by the Electric Field Induced 
by Compression 


Quite independent of the complexities of fluid flow 
within the core and even independent of the rotation of 
the earth, there are possibilities for a relatively simple 
regenerative mechanism for maintaining the earth’s 
field arising from the gravitational field within the 
enormous mass of the earth, and the consequent electric 
fields. In a series of early papers on the subject, Gunn 
(1929) has suggested some of these possibilities. He 
discusses in particular the motion of conduction elec- 
trons in an electric or gravitational field crossed by a 
magnetic field and their contribution to regenerating 
the magnetic field. In more recent unpublished re- 
marks he has suggested that the source of the electric 
field is probably the temperature gradient (Nernst 
effect), but it seems that an even more potent source of 
electric field is to be found in the compression caused 
by gravity, and it will suffice to present this as perhaps 
the most important representative of the theories of 
this general type. i 

In laboratory physics one makes a habit of thinking 
that electric fields cannot exist within an ordinary 
conductor in which no current is flowing, but with 
thermal gradients or in a conductor so large that gravity 
is important this is no longer true. Gravity acts on the 
nuclei but practically not at all on the electrons, and 
thus tends to cause a separation of positive and negative 
charges. This tendency is strongly opposed by the 


attraction between nuclei and electrons, accounting - 


for the high stability of atoms, but in a large conductor 
Such as the interior of thé earth, a small partial separa- 


8 CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


H Digitized by Arya Samaj Foundation Chennai and eGangotri ia 


orn 


{ 
$ 
, 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


THEORIES OF THE EARTH'S MAGNETISM 245 


tion may be effected and electric fields m 
in the process. 
pon in ee 26 the small elements of volume 
Vi and V2 containing identical collections of atoms 
(or single atoms), but with V, deeper and more highly 
compressed than Vy. The conduction electrons are 
piled up to a higher kinetic energy or “Fermi energy” 
in Vo thar in Vy, that is Er:> Eprı. This is a rough way 
of speaking of the phenomenon illustrated in Fig, 23 
in connection with which it was mentioned that the 
pressure well within the earth amounts to an energy 
per atomic volume of the order of magnitude of one 
electron volt. 

3y way of a simple estimate of this quantity, a 
column of matter 1 cm square and 3400 km high with 
an average density of 7 g/cm? in an average gravita- 
tional intensity (2/3)g~700 cm/sec? exerts a pressure 
at the bottom of 1.710" erg/cm*= 10% ev/cm*. Ordi- 
nary iron at zero pressure, for example, with a density 
7.85 g/cm? contains 8.710” atoms/cm*. If the atoms 
were compressed to twice this density at the center 
of the earth, or about 1.710" atoms/cm’, this pres- 
sure would correspond to 6 ev per atomic volume. 

Simple quantization in a box of one electron per 
atomic volume shows that the kinetic energy per 
electron is 


KEn= (3/40)h?/mV3=2.25 107 ev-(cm?/V3), (68) 


ay be developed 


where V is the atomic volume and / is Planck’s constant. 
The atomic volume for ordinary iron, V=(1/8.7) 10 
cm? makes KE,,=4.4 ev. Reducing V to half this in- 
creases KE, to 7 ev, an increase of 2.6 ev per atomic 
volume, which is about half of the pressure energy 6 ev 
per atomic volume estimated for the center of the earth 
if iron were compressed by a factor two. Thus about 
two “conduction electrons” per atom would be required 
in order that their increase in kinetic energy would 
account for this compressibility. Actually, iron has more 
valence electrons than this but they are not all equiva- 
lent to conduction electrons. The compression reduces 
their potential energy by squeezing them closer to the 
nucleus at the same time that it increases their kinetic 
energy, in a manner in which even Fig. 23 is a simplified 
representation, so the net effect may be about the same 
as the increase of kinetic energy of about two simple 
“conduction electrons” of which the potential energy 
is ignored. It is, on this rough theoretical basis, plausible 
that so drastic a compression should take place. Empi- 
tical evidence seems to require almost this much. 

For the sake of a rough estimate, let us assume that 
most of the mantle is conductor—our treatment will 
well to the core alone, or to the two 
together considered as one large conductor. In a deep 
element of volume Va the Fermi energy Ere exceeds 
Ep, higher up by one or two electron volts, say 2 ev. In 
there is no dam to support the higher 
until an electric field develops by 
raction of the conduction 


apply about as 


such a situation, 
electron level at V2 
the migration of a very small f 
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Fic. 26. Effect of radial electric field arising from pressure gradient. 


electrons. The migration stops when the electric field, 
or difference in electric potential, has lifted the bottom 
of the “Fermi sea” of electrons at V, until the top 
levels coincide at the two positions. This requires a 
potential difference (Ep2—Er:)/e=2 volts between Vi 
and Va, the region around V, having acquired a net 
negative charge and around V2 a net positive charge, 
say Q in the whole inner part of the mantle. We thus 
have concentric spherical shells, the inner one with Q, 
the outer one with —Q, and an electric field confined 
to the region between them of magnitude E=Q/r 
=~ 2 volts/10% cm=7 10” esu/cm, with r varying near 
2 10% cm, requiring Q~10-* abcoulomb or 6 10" elec- 
tron charges, about 10" e per atomic volume. 

The top of the electron sea is fuzzy because of the 
high temperature, as indicated by the short curves 
near the upper right corner of Fig. 26 representing the 
shape of the Fermi distribution in energy. The “tailing” 
of the distribution extends over about kT ‘which for 
T=3000° is about + ev. This is enough smaller than 
Er2— Er: to show that the temperature is only a sec- 
ondary cause of electron migration. 

Perhaps the simplest of the many inadequate 
“theories” of the earth’s magnetism is the old sugges- 
tion that this charge distribution Q rotating with the 
earth has a magnetic moment. The outer, negative, 
shell, of course, predominates and incidentally gives 
a moment of the observed signg with the magnetic north 
(-seeking) pole at the geographic south. The magnetic 
field at the center of a loop of radius 7 with this charge 
flowing around it at angular velocity 2 is QQ/r= 10-7 
oersteds for the earth with Q= 10° sec, r~10* cm, 
about 107*5 too small! s 

Let us assume that there is an initial stray magnetic 
field H in the negative z-direction (into the paper in 
Fig. 26) and discuss the possibility of regeneration by 
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the electron motion in this and the electric field Æ in the 
positive y-direction (radially outward). The path of a 
(hypothetical) free electron in such crossed fields is a 
cycloid, the superposition of a uniform rotation on a 
uniform translation, since there is a moving coordinate 
system with its 7,=—cE/H, in which the magnetic 
force due to the translation just compensates the 
electric force and relative to which the motion is there- 
fore circular, with Hr’=mv’c/e. There are two possible 
types of cycloidal motion as shown in Fig. 26, one with 
loops in which the translational v,<v’, the speed of the 
rotational motion in the moving system and one without 
loops in which v,>0’. 

The statistical-mechanical concept of electrons travel- 
ing in free paths between collisions with atoms provides 
a useful approximation to the more refined discussions 
in terms of the states in energy bands and the interac- 
tion of these with lattice irregularities. Comparison of 
the two approaches shows that the mean free path, A, 
is to be taken to have the order of magnitude 5 107" cm 
for a very good conductor under ordinary conditions 
(Seitz, 1940, page 184). This is much less than the period 
(in x) of the cycloids encountered in the fields with 
which we are concerned. The curvature of each electron 
free path will, however, correspond to some fairly short 
segment of a cycloid. We make the usual assumption 
that the free paths start in random directions, and we 
provisionally neglect any dependence of on energy. 
The problem is to average over the a-displacements of 
all these free paths of length A, and it would not be 
surprising if the general tendency of the cycloid to 
sweep toward the negative «-direction should also 
appear in this average, as it obviously must for long À. 

Instead of averaging properly over angles, let us 
examine the four simplest examples, with electrons 
leaving their collisions in the positive and negative 
x- and y-directions. With the initial velocity vo up or 
down, at point a or b on the upper cycloid in Fig. 26, 
the energy is changed in a path A by -+LAeE so the 
average energy during the flight is 


md =m +eE/2, (69) 


and roughly we may take the average radius of curva- 
ture to be 


p=meb/eH = po(1tdeE/2mn,?- - -). (70) 


In either case the average x-component of the excursion 
is s=p(1—cos(A/p)=)2/2p and the average over paths 
starting at a aŭd bis + 


(Zat Es)/2=MeE/4miypo=MEEH/4mcv (71) 


primarily toward the right in Fig. 26 and against the 
general cycloidal drift because of the greater curvature 
of the path starting at a than at b. Consider now the 
paths starting at c and d in Fig. 26, with vo horizontal. 
Here we have 


1/a=0%y/ds= (0"y/00)/v8=(—EszvoH/cje/mue (72) 
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in which cases = +5 sin(A/p) = +A (1—)?/6p?) and we 
have 


(&.+44)/2= —-XEEH /3 mcv? (73) 


toward the left in Fig. 26 and with a coefficient 3 
slightly larger in magnitude than the coefficient } in 
Eq. (71). Thus the average over the four starting 
points is e 


T= —MCPHLH/24m?co93, (74) 


indeed in the direction of the general cycloidal drift. 
We use this as a rough estimate of the average over-all 
directions of vo. In electron experiencing %/A collisions 
per second has a corresponding drift velocity (vo/A)Z 
and contributes (e/c) (vo/A)@/R? to the magnetic field at 
the center of the earth. Taking two electrons per 
atomic volume V=10~* cm? throughout a volume of, 
say, R? with R=1.5X10° cm we have for the order of 
magnitude of the regenerated field 


H=KH, K=()2/12v) (e/mc*) (eER/mè) 
~2X10-%. (75) 


In evaluating K we have also used e?/mc?=2.8 10-* cm, 
eEr=2 ev, mvọ=ż4 ev, the value of 2kT for about 
3000°. In this rough estimate, the mechanism proposed 
thus lacks a factor 500 of being regenerative. The sign, 
at least, appears to agree with observation (and leaves 
no room for the controversial reversals). 

This estimate contains many roughly guessed factors, 
but still makes it seem extremely unlikely that a 
mechanism of this general type could work. Perhaps 
the greatest uncertainty is in the effect of very high 
pressures on effective mean free path, particularly in 
minerals, but in making the estimate we have already 
been fairly generous in extrapolating from known data 
for metals, taking a value of A for silver, about 7 times 
as great as for iron at zero pressure. Extrapolation 
from data at laboratory pressures (Bridgeman, 1949) 
would indicate an increase of A for iron by about a factor 
3 at the pressures near the earth’s center at room tem- 
perature, but this expected increase is more than offset 
by extrapolation of the decrease with temperature. 

There is a possibility that the effect is considerably 
enhanced by the chemical discontinuity between the 
core and mantle. Though contact difference of potential 
at zero pressure is commonly of the order of magnitude 
one volt, comparable to the two volts we have just 
used arising from pressure difference in a homogeneous 
medium, it seems likely that the contact potential will 
be considerably enhanced by very high pressure such as 
encountered at the surface of the core. If the effective 
number of conduction electrons per atom in one sub- 
stance (iron-nickel alloy): is half that in another 


(olivene), the Fermi energy may be raised by the pres- ~ 


sure twice as much in the first as the second. It does not 
seemilikely that this type of contribution will yield 
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more than a factor ten and it may have the w 
because the lighter atoms of olivene h ; 
trons in the closed shells to oppose co 
does iron. i 

It has been noted that this mechanism is not de- 
pendent on the rotation of the earth, but if it were the 
primary source of the regeneration the direction of the 
field coulde still be coupled to the axis of rotation b 
the differential rotation of core and mantle, and ie 
general mechanism of the westward drift already dis- 
cussed would still apply. 


rong sign 
ave fewer elec- 
mpression than 


24. Inadequate Generation by Differential Rotation 


Among the other theories depending on the mean 
free path in transport phenomena under peculiar cir- 
cumstances, there is one depending on the radial 
gradient of horizontal velocity that is of some slight 
present interest because it has fairly recently been 
given an experimental test which shows it to be inade- 
quate. Consider the component circulating about the 
axis, va, of the velocity of the core fluid as a function of 
distance from the axis, vg(p). In familiar statistical- 
mechanics fashion, the electrons passing a point at p 
carry in addition to a random motion the velocity 
characteristic of the place predominantly near p=) at 
which they had their last collision, and this velocity 
may be expanded 


velp) = 04(p)AdI4/ Oot (X*/2)d*v4/dp?. (76) 


There will thus be, on the average, a drift velocity 
proportional to the last of these terms and a current 
i=eN\20%04/dp? giving a magnetic field H~ip at the 
center of a thick, doughnut-shaped loop which scales 
as vg/p if we take à and the electron density M to be 
the same in the cases compared. The experiment 
(Lochte-Holtgreven, 1953, and personal communica- 
tion) involves a vigorously churned pot of mercury with 
p=1cm and 2%~100 cm/sec. A field of about 10~* 
oersted was observed. Scaling this to the earth with the 
1/p factor only, even taking this too-high value of ve 
for the earth, gives a result too small by a factor 10”. 


25. Concluding Remarks 


The more natural title “Conclusion” for this section 
must unfortunately be avoided because the present state 
of knowledge permits no definite conclusion. In the 
light of our very limited knowledge of the earth’s 
interior, we have examined the strong points and the 
weaknesses of a number of proposals, and cannot with 
finality select any one as along plausible. We are struck 
not by the paucity but by the wealth of possible origins 
of the earth’s magnetism within the framework of 
Nature’s laws as we ordinarily postulate them, and thus 


- find it quite unnecessary to give heed to any possibility 


of a new law to account for just this phenomenon (such 
as Blackett’s hypothesis of & universal relationship 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


between rotating matter and magnetic moment, for 
which empirical backing has vanished with newer 
astronomical measurement). 

The induction theories are concerned primarily with 
details of the fluid motions within the earth’s core and 
it seems extremely likely from what has been said that 
there must be a considerable variety of hypothetical 
flow patterns that will cause regeneration, and many 
others that will not. It is likely that there is an entirely 
adequate source of energy to maintain the field with 
the rather low efficiency that would be expected of 
such a mechanism. The evolution of flow patterns that 
would be induced by the accelerations possible with 
such a source would be expected to encounter some 
flow pattern that would generate a magnetic field and 
be stabilized by the decelerating influence of that field. 
Thus even without any comprehensive understanding 
of the details it seems likely that a mechanism similar 
to one of the two induction theories, or to a combina- 
tion of them, should be providing us with the earth’s 
field unless the field is already provided by some even 
more favorable mechanism (in which case the evolution 
might be stopped by the decelerating influence of the 
field before reaching a regenerative flow pattern). 

The two induction theories, while quite similar in a 
general way, differ markedly in the fact that the dynamo 
theory contemplates that the magnetic field will 
modify the flow pattern mainly by opposing the flow 
along paths it might follow without the field, whereas 
the twisted kink theory, while opposing acceleration of 
the general flow pattern, may also cause local disturb- 
ances in the flow by carrying matter along with the 
kink-forming process. For the dynamo theory with a 
given flow pattern, the main question is one of sign, 
whether it is regenerative or degenerative, and only 
incidentally whether it is strongly enough regenerative. 
For the twisted kink theory there is essentially no 
question of sign; it is regenerative if the twisting Is 
rapid enough to make kinks at all, but perhaps not 
strongly enough regenerative to overcome both the 
tendency of the kinked loops once formed to attenuate 
and the possible degenerative tendency of any dynamo 
action (and thermoelectric action for that matter) that 
may be present. 

Among the other theories, the combination of thermo- 
electric and Hall effects seems the most attractive. 
Here again it is a question of sign whether it is con- 
tributing to, perhaps beinf primarily responsible for, 
the earth’s field or opposing it. 

Sincere thanks are expressed to W. M. Elsasser, E. 
Teller, M. A. Tuve, H. C. Urey, and E. H. Vestine for 
stimulating discussions during 1950 as well as later, 
when a first draft of this review was being written, 
and to E. C. Bullard, T. G. Cowling, D. Fultz and 
colleagues, R. Gunn, R. Hide, S. K. Runcorn, S. Chan- 
drasekhar, and H. Tatel for, more recent suggestions 
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and deeply valued discussion and criticism of various 
aspects of this problem. 
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REVIEWS OF > 
MODERN PHYSICS 
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| 


Memorial Symposium Held in Honor of Enrico 
Fermi at the Washington Meeting of the 
American Physical Society, April 29, 1955 


| Hans A. BETHE, presiding“ 


at the same time being one of the greatest theoretical 


Dr. Bethe: Members of the Physical Society 


E are here assembled to honor the memory of 
| Enrico Fermi who died on November 29 of last 
| year at the age of only 53. We had all expected that he 
‘would continue for many years to give to physics as 

much as he had given before. In Fermi, the Society lost 
| its most recent past president and the world of physics 
| lost one of its greatest members. Fermi was unique 
among the great physicists of the twentieth century in 
being one of the greatest in experimental physics and 

* The first pages of this issue are dedicated to Enrico Fermi, 


recent president of the American Physical Society, whose untimely 
death occurred November 29, 1954. Editor. 


University of Illinois 


INTRODUCTION 


| WN 1924 Einstein received from S. N. Bose! of the 
I University of Dacca a manuscript 1n which the 
Planck distribution function was derived onan entirely 
The particle nature of light quanta was given 

+ oral interpretation, as suggested dra- 
covered Compton effect, and 
f Gibbs and Boltzmann were employed 
most probable state of the quanta 
f fixed energy, but variable number. 
d distribution could be ob- 


Bose found tha! ta havi > ; 
tained only if it was ee TE RIE ciently low temperatures. Second, he pointed out that- <i 
: n SS ia 
specified wave number and P TA Engtein, Sitzungsber. Akad. Wiss. Berlin, p- 261 (1924), 
aa «1. 26, 178 (1924); 27, 384 (1924). p. 318 (1925). : kis 
1S. N. Bose, Z. Physik 26, JER 949 Oe 
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Fermi Statistics and Its Applications 


FREDERICK SEITZ 


physicists. He was unique also in the width of his con- 
tributions. He may have been one of the last physicists 
who knew almost all of physics and used it in his 
research. 

As we shall review Fermi’s life’s work we shall review 
the history of the last thirty years of physics. Wherever 
new and important frontiers were opened in physics, 
Fermi was there to lead the way and to show us how to 
make progress. Probably the first connection in which j 
the student will hear the name of Fermi is Fermi’s 
statistics. Professor Seitz will tell us about Fermi’s i 
statistics and what it means for modern physics. É 


completely indistinguishable in counting the number of 
complexions. 

Einstein, with characteristically brilliant insight, 
heralded Bose’s work and, in two papers stimulated by 
it, pointed out several highly significant implications. — 
First, he emphasized that Bose’s viewpoint made it 
possible to develop a strictly quantum treatment of the 
ideal gas in a manner which did not leave the entropy 
at absolute zero undetermined. He generalized Boses 
distribution function for the case of a fixed number of 
particles and pointed out that the gas would exhibit a E 
novel type of condensation of quantum origin at suffi- 
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Bose’s work showed the necessity of accepting the wave 
and particle descriptions of electromagnetic radiations 
on an equal, but complementary basis. In this connec- 
tion, he called the attention of colleagues to the stimu- 
lating thesis of de Broglie, with its implication that a 
wavelength A=//mv is to be associated with a free 
particle having mass m and velocity v and suggested 
that this work might well form the basis for a proper 
wave description of “material” particles to complement 
the customary particle viewpoint. 

At about the same time Pauli,’ stimulated by Stoner’s' 
analysis of the quantum states of the electrons in com- 
plex atoms, proposed the exclusion principle as a general 
rule governing the behavior of electrons in multi- 
electron systems. 

Fermi® entered on the scientific stage in his first 
major role at this point at the age of 23. He saw that 
the Pauli exclusion principle implied that electrons do 
not obey Bose’s counting rules when treated in a statis- 
tical assembly and he proposed a quantum treatment of 
the ideal gas on the basis of the alternate type of statis- 
tics, which now bears his name. At the time, he regarded 
this treatment as more appropriate for “material” par- 
ticles and hence as more appropriate than the Bose- 
Einstein formulation for entities other than light quanta. 
This point of view was generally accepted at the time, 
after receiving Pauli’s blessing. Nearly fifteen years 
passed before Pauli’ ventured an essential generalization 
by pointing out that there appears to be an invariable 
correlation between spin and statistics and that some 
fundamental particles other than light quanta do obey 
Bose statistics. In the meantime Dirac and Heisenberg? 
demonstrated that Bose and Fermi statistics had a 
unique and natural position in the formulation of wave 
mechanics through the existence of symmetric and anti- 
symmetric states. 


THE FERMI FUNCTION 


In the new statistics, the classical Maxwell-Boltz- 
mann distribution function 


f=g(e)A exp(—e/kT) (1) 


for the number of particles of energy e per unit energy 
at temperature T was replaced by the Fermi function 


3 g(c) 
exp (e— é)/kT +1 


Here g(e) is the density of states of the particles of 
energy ¢, and & is Boltzmann’s constant. A in (1) is a 


3 W. Pauli, Z. Physik 31, 765 (1925). 
4E, C. Stoner, Phil. Mag. 48, 719 (1924). 
5 5 Fermi, Lincei Rend. (6) 3, 145 (1926); Z. Physik 36, 902 
1926). 
i 6 W. Pauli, Z. Physik 41, 81 (1927). 
7W. Pauli, Phys. Rev. 58, 716 (1940). 
s P, A. M. Dirac, Proc. Roy. Soc. (London) A112, 661 (1926); 
W. Heisenberg, Z. Physik 38, 411 (1926). 
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temperature-dependent normalization constant deter- 
mined by the condition that the integral of f over all 
energies be equal to the number of particles, if particles 
are indeed conserved. The quantity e in (2), the Fermi 
potential or level, is a similar parameter. It is also 
equal to the chemical potential of a particle in the dis- 
tribution, that is to the energy gained by adding a 
particle. At sufficiently low temperatures, eds such that 
(e—¢)/kT in (2) can become large and negative for 
permissible values of e. f is essentially equal to g(e) over 
this range of e, below the Fermi energy, and the particles 
are packed to the maximum density permitted by the 
Pauli exclusion principle. The coefficient of g(e) in (2) 
decreases from unity to values substantially less when 
e becomes larger than e’ by an amount several times kT’. 
The distribution function approaches the classical form 
(1) in this range of e above the Fermi potential because 
the influence of the Pauli exclusion principle on particles 
of similar energy becomes negligible there. 


THE STATISTICAL ATOM MODEL 


Of the many applications of Fermi statics which fol- 
lowed its formulation, Fermi is responsible for only one 
which uses the distribution function as the primary 
tool, aside from the application to the ideal gas involved 
in the genesis of the theory. This second application is 
to what is commonly termed the Fermi-Thomas? model 
of the atom. Actually the uses of the statistics are so 
enormous that a lesser man might well have devoted his 
life to the subject. Characteristically, Fermi moved off 
into regions closer to the frontier of physics where his 
genius was more nearly indispensible. 

The Fermi-Thomas, or statistical, atom model, rests 
on the postulate that the electronic cloud outside the 
nucleus can be treated as if the electrons are free par- 
ticles in any region chosen so small that the potential 
field does not vary appreciably over its dimensions. In 
essence the cloud is regarded as a degenerate free electron 
gas which is not uniformly distributed because the poten- 
tial varies from the nucleus to the periphery but which 
possesses a common Fermi potential. The cloud does 
not collapse into the nucleus as would be the case for a 
classical distribution because the exclusion principle 
limits the occupation of states. The variation in electron 
density from one part of the atom to another resembles 
somewhat the variation in depth of a lake which has a 
variable bottom but a well-defined surface level. The 
analogy has limited applicability in the sense that the 
contour of the floor of the lake is determined only 
indirectly by the presence of the water, whereas the 
coulomb field of the electrons in the cloud has an im- 
portant effect on the potential distribution. 

The statistical atom model represents a somewhat 
crude approximation to the true state of affairs in the 
electronic shells of the atom. Nevertheless it has proved 


9E. Fermi, Z. Physik 48, 73 


1 i (1928); L. H. Thomas, Proc. 
Cambridge Phil. Soc. 23, 542 (1927) 


e CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


w 


rn ee 


! 
i 
i 
| 
i 


è 


a a = 


On a a a 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


FERMI 


to be exceedingly useful in cases in which there is need 
for equations which can be applied simply and generally 
to the entire periodic system. Fermi demonstrated in 
fact that the model gives a good description of some of 
the more subtle features of the period chart, such as the 
manner in which the shells of different angular momenta 
become occupied. 


° 
APPLICATIONS TO THE THEORY OF METALS 


Pauli” pointed out in 1927 that the weak temperature- 
independent paramagnetism of the metals, such as the 
alkalis, which have rare gas configurations inside the 
broken valence electron shells, may be interpreted semi- 
quantitatively by representing the conduction electrons 
as a Fermi gas of free particles. He demonstrated that 
the electron gas in a typical metal is highly degenerate 
in the sense that the Fermi potential e’ in (2) is several 
electron volts above the ground state of the conduction 
electrons, that is, by an amount large compared to $T. 
It follows that the introduction of Fermi statistics has, 
to first approximation, the effect of making the dis- 
tribution independent of temperature. In this way, the 
modern theory of metals was born as the result of a 
wedding between Fermi’s new idea and the proposal 
Drude" had made a quarter of a century earlier to the 
effect that conduction electrons in metals can be treated 
as free particles. 

The older theory of metals, based upon use of clas- 
sical statistics, had been developed to a fine point by 
Lorentz,’? but possessed many weaknesses and contra- 
dictions. Perhaps the most outstanding of these was the 
fact that the measured specific heats of the simpler 
metals possessing highest electrical conductivity did not 
display evidence for the contribution 3R/2 expected 
classically from each mol of free electrons; however, 
other difficulties, fully as serious, had been encountered. 

Pauli’s success in explaining the paramagnetism of the 
conduction electrons gave hope that the time was finally 
ripe for a new and more successful advance in the 
general theory of metals. Sommerfeld led the new 
period of investigation in a program that had many 
fruitful consequences. It became evident at once that a 
large number of the difficulties of the Drude-Lorentz 
theory vanish with the introduction of Fermi statistics. 
For example, the anomaly concerning the specific heat 
was resolved immediately, for only a small fraction of 
the electrons having an energy in a band of the order 
of kT about the Fermi potential are sufficiently un- 
encumbered to be influenced by temperature and con- 
tribute to the specific heat. Subsequent measurements 
near liquid helium temperatures have revealed the small 
electronic heat proportional to the first power of the 


1 W., Pauli, Z. Physik 41, 81 (1927). 

u P, Drude, Ann. Physik 1, 566 (1900). A 

2H, A. Lorentz, The Theory of Electrons (Teubner, Leipzig, 
1909). 

13 A. Sommerfeld, Z. Physik 47, 1, 33 (1928). 
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temperature predicted by the theory. In general it was 
found that most of the properties of the electron gas 
which vary with temperature and applied fields are 
determined by the small fraction of electrons having 
energy near the Fermi potential. 

It is interesting to note in passing that the introduc- 
tion of Fermi statistics brought about precisely those 
qualitative modifications in the free electron theory of 
metals which Wien" had insisted would be necessary in 
a critical review of the failures of the Drude-Lorentz 
theory in 1913. It may be added that Einstein proposed? 
in 1924 that the difficulties might be solved by postu- 
lating that most of the electrons in metals are in the 
lowest state of the Bose distribution. 

The successes which accompanied the first introduc- 
tion of Fermi statistics into the theory of metals were 
so great that remaining discrepancies and unresolved 
issues were given serious attention in the full light of 
wave mechanics which was now highly developed. Two 
topics became the focal points. Fermi statistics played a 
major role in the solution of both. One centered about 
the calculation of the mean free path of the conduction 
electrons from first principles. Houston'® pointed out 
that the electrons at the top of the Fermi distribution 
usually do not satisfy the Laue conditions for diffraction 
from the perfect lattice, so that they should move un- 
hindered through the crystal unless scattered by im- 
perfections. He concluded that the waves of lattice 
vibration are the agents primarily responsible for scat- 
tering in the purest metals and presented a rudimentary 
theory of the temperature dependence of resistivity on 
this basis. Bloch'® extended the theory and demon- 
strated that the restrictive influences of the Fermi 
distribution on the changes in electron energy play an 
exceedingly important role in determining the allowed 
scattering transitions and hence have a profound in- 
fluence on the mean free path. 

The second unresolved topic which drew major in- 
terest was the anomalous Hall effect observed in many 
metals. The sign of the transverse electromotive force 
developed in a current-carrying strip of metal when it 
is placed in a transverse magnetic field depends upon 
the sign of the carriers when the latter are perfectly 
free. The sign in most metals is what one would expect 
for negatively charged electrons, however, many metals 
behave as if the carriers are positively charged. This 
observation was explained by Peierls‘? in terms of the 
combined effect of the Fermi distribution function and 
the band theory of electron states in solids. In brief, 
some of the distributed, or free particle, states of elec- 
trons in solids, particularly those near the top of the 
allowed bands of energy, have the characteristic that 
they describe electronic motion corresponding to a nega- 
tive mass when the electron is subject to an external 


4 W. Wien, Sitzungsber. Akad. Wiss. peim p 184 (1913). 
15 W. V. Houston, Z. Physik 47, 33 (1928); 48, 449 (1928). 
18 F, Bloch, Z. Physik 52, 555 (1928) ; 59, 208 (1929). 

WR, Peierls, Z. Physik 53, 255 (1929). 
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force. In other words the group velocity of a wave 
packet constructed from closely neighboring states de- 
creases when the applied force is in the direction of the 
velocity” vector. All energy bands appear to possess 
states of this type. Whenever the forces are electro- 
magnetic, the negative effective mass can be viewed as 
if originating in a positive charge. Under proper con- 
ditions, which are far from uncommon, the Fermi 
potential lies in a region of the band where the effective 
mass is negative. The electrons near the Fermi level 
responsible for transport then behave as if positively 
charged. 


THE THEORY OF HOLES 


The Fermi level lies in the region between two well- 
separated bands of levels in the ideal insulator, so that 
the lower band is completely filled and the upper com- 
pletely empty when the temperature is close to absolute 
zero. The states at the top of the filled band are in- 
evitably of the type in which the electron behaves as if 
it possesses a negative mass. Thus the “holes” produced 
in this band by removing electrons from the top of the 
Fermi distribution in it behave!’ exactly like positive 
charges. Moreover the symmetry properties of the 
Fermi distribution function for the electrons are such 
that an equilibrium assembly of holes can be regarded as 
distributed in accordance with a Fermi function possess- 
ing the Fermi potential —e’, provided the energy of a 
hole in the electron level e is taken to be — e. 

It follows that the hole concept, which has achieved 
almost household status in discussions of insulators and 
semiconductors during the last twenty years, rests in a 
very intimate way on Fermi statistics and the closely 
associated Pauli exclusion principle. 


THEORY OF POSITRONS 


Dirac’s! discovery of the negative energy states of 
the electron in the development of a relativistically 
invariant theory of the electron created a baffling 
paradox. At first it appeared that the negative and 
positive energy states might form noncombining sys- 
tems, analogous to the symmetric and antisymmetric 
solutions of the Schrédinger equation. This hope proved 
forlorn and an alternate general principle was needed to 
explain why normal electrons do not use the opportuni- 
ties available in the theory to fall endlessly to lower 
energy levels. This difficulty was finally resolved” by 
postulating that in its normal state the universe is 
flooded with a Fermi distribution of electrons which 
completely fill the negative energy states. A volume in 
which the negative levels are exactly filled corresponds 
to the neutral vacuum which provides the background 
for the normal events of physics. The Fermi potential of 
the distribution actually extends to positive energy 


18 W, Heisenberg, Ann. Physik 10, 888 (1931). 

19 P, A. M. Dirac, Proc. Roy. Soc. (London) 117, 610 (1928) ; 
118, 351 (1928). 

2 P, A. M. Dirac, Proc. Roy. Soc. (London) 126, 360 (1931); 
133, 60 (1931) ; J. R. Oppenbeimer, Phys. Rev. 35, 562, 939 (1930). 
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states in regions where normal matter is present, thereby 
furnishing in stable form the normal electrons of ordi- 
nary matter. The removal of an electron from one of the 
negative energy states results in the creation of a posi- 
tron—a particle resembling closely one of the holes 
created by removing an electron from the top of a filled 
band in an insulating crystal. The positron derives its 
dynamical behavior as a positively charged: particle in 
an electromagnetic field from the fact that an electron 
possesses a negative effective mass in one of the states 
of negative energy. 

The introduction of the Fermi sea of electrons to res- 
cue the relativistic theory of the electron indicates that 
there must be a profound relation between Fermi 
statics, the relativistic aspects of the universe and the 
existence of fundamental particles of half-integer spin 
which is only vaguely appreciated at the present time. 
It remains significant that the one-particle theory can- 
not provide a consistent and realistic description of the 
behavior of an electron without employing relationships 
which we normally associate ‘with multiparticle phe- 
nomena. 


FERMI GAS NUCLEAR MODEL 


It is tempting to treat the nucleus, to a first approxi- 
mation, as a liquid droplet in which the nucleons are 
regarded as a Fermi gas of free particles which move in 
a uniform potential field determined by an appropriate 
average of the interactions between particles. This was 
first done by Majorana” and Weiszicker.” The model, 
which is somewhat reminiscent of the free electron 
theory of metals or the Fermi-Thomas model of the 
atom, obviously has strong disadvantages since it 
ignores the finer details of the shell structure of the 
nucleus. Nevertheless it appears to reflect certain as- 
pects of nuclear behavior, most importantly the over-all 
regularity of the sequence of nuclei extending from the 
light to the heavy elements and the fact that most nu- 
cleons appear to be constrained so that they are unable 
to make transitions involving energies which do not 
raise them above the Fermi level. This last property 
may well account, at least in part, for the transparency” 
of the nucleus to neutrons possessing energies in the 
range of 1 Mev. 


APPLICATION TO WAVE FIELDS | 


Fermi statistics finds its application to the theory of 
wave fields in the development introduced by Jordan 
and Wigner” on the basis of the field theory of Dirac, 
Pauli, Guttinger, Klein and Jordan which underlies 
present-day particle physics. They demonstrated that a 
wave field in which the amplitude operators at different 
points anticommute satisfies Fermi statistics in the 


21 E. Majorana, Z. Physik 82, 137 (1933). 

22 Ç, F. v. Weizsäcker, Z. Physik 96, 431 (1935). 

23M. Walt and H. H. Barschall, Phys. Rev. 93, 1062 (1954); 
H. Feshback and V. Weisskopf, Phys. Rev. 90, 166 (1953). 

^ P, Jordan and E. Wigtier, Z. Physik 47, 631 (1928). 
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sense that a given wave is limited to only two states. 
One state corresponds to the absence and the other to 
the presence of a particle having the wave number and 
polarization associated with the wave. This situation is 
in sharp contrast to the one which occurs when the am- 
plitude operators commute, for then the states associ- 
ated with a given wave are infinite in number, though 
discrete, corresponding to the possibility of an unlimited 
number of particles with specified wave number and 
polarization. This alternative scheme evidently repre- 
sents Bose statistics. 


Dr. Bethe: 


Thank you very much, Dr. Seitz, for this talk which 
so clearly illustrated the importance of Fermi’s statistics 
for the development of a*whole new field of physics, in 
fact of many new fields, and which also illustrated the 
essential unity of physics by showing how the dis- 
coveries of many people were combined into something 
new. Soon after the discovery of Fermi’s statistics, 
Fermi became professor in Rome, a very rare thing in 
Europe for such a young man. He assembled a group 
of very bright young physicists in his laboratory and 
revived physics in Italy which had been more or less 
dormant in that country since the days of Volta more 
than a hundred years earlier. 

Fermi’s first interest in those days was the application 
of his statistics and of other methods to the understand- 
ing of atoms. This was the primary concern of most 
theoretical physicists at the time. Many important 
things were discovered, many puzzles of previous days 
were resolved. In all this, Fermi combined an under- 
standing of the theory with a great sense for the essential 
physics of the problem which enabled him to make 
wide use of experimental results to simplify the theory 
and improve the accuracy of theoretical results. One 
example I may mention in this connection is the hyper- 
fine structure in atomic spectra for which he gave the 
first formula which is still in use today. Later on, he 
and his collaborators found expressions for the hyper- 
fine structure that are useful even in complicated atoms. 

One of the less-known achievements of this time is 
the explanation of the anomalous ratio of the intensities 
of the doublet components in the spectra of the alkali 
atoms. In his characteristic way he used extremely 
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It follows that the two types of statistics have a 
natural position in the theory of fields, expressed 
through the two naturally complementary commutation 
rules of amplitudes. In fact’ the relative simplicity with 
which the two types of statistics enter into the field 
formalism seems to carry two implications with it: first, 
the classification of particles into Fermions and Bosons 
is a natural one; second, the development of quantum 
physics into the theory of fields, where the subject 
resides at present, is a natural evolutionary step and 
not a blind digression. 


simple theoretical arguments and much physical in- 
tuition in this theory. Another was the explanation of 
the band spectrum of carbon dioxide which had puzzled 
physicists and chemists for many years. 

Soon the interest of physics and the interest of Fermi 
went from atomic physics to field theory. Many of you 
probably, like myself, have learned their first field 
theory from Fermi’s wonderful article in the Reviews of 
Modern Physics of 1932. It is an example of simplicity 
in a difficult field which I think is unsurpassed. It came 
after a number of quite complicated papers and before 
another set of quite complicated papers on the subject, 
and without Fermi’s enlightening simplicity I think 
many of us would never have been able to follow into 
the depths of field theory. I think I’m one of them. 

Around this time I had the privilege of first meeting 
Fermi by coming to Rome and I can say that I learned 
more from him than from anybody else in my life. I 
visited Rome in 1931 and in 1932. Between the two 
visits work in field theory had gone on and Fermi, like 
so many other of the great theorists, had tried to explain 
away the divergences of quantum electrodynamics. 
But on my second visit the interest of physics had 
shifted to something new. The discovery of the neutron 
had not yet happened, but was visible on the horizon: 
Bethe had made his experiments in which he saw strange 
radiations which then were unexplained. Fermi had 
become interested in these experiments, and generally 
in the field of nuclear physics. The second lecture which 
we shall hear today will deal with an important theo- 
retical contribution which Fermi made to the theory of 
nuclei, namely the theory of heta decay. It will be pre- 
sented to us by Professor Konopinski. 
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N his theory of B-decay, Enrico Fermi at once dis- 
played his characteristically simple directness in 

putting his finger on the essential point concerning a 
given process of nature, and, at the same time, his 
broad comprehension of how it must be fitted into the 
intricate theoretical structure of the physical universe 
which is steadily being developed. 

To bring out the essential simplicity of Fermi’s con- 
ception, consider the primary point of his B-theory: let 
Wx be the probability amplitude of the nucleus ready 
to decay (it may be the neutron itself); let Yr, Yo Ww 
stand for each of the product particle states, the forma- 
tion of which is to be examined; then, according to 
Fermi, the probability of decay into these states is 
simply proportional to the squares of all the probability 
amplitudes, all evaluated at the same point of space. 

The simple idea is that the probability of formation of 
each particle is proportional to the probability that it 
can be found at the site of the transforming nucleon, 
while in the state the formation of which is sought. 

After adopting this idea, Fermi had to fit it in with the 
rest of physical theory. He had to define a coupling 
energy density which is essentially 


h= gn WoW. 


The proportionality constant g is to measure the 
strength of the coupling, much as the electronic charge 
e measures the strength of electromagnetic interactions. 
As it stands here, the expression for / is only schematic. 
To describe each of the participating fermions properly, 
the field amplitudes y must be spinors, quantities pos- 
sessing components. Then there is a variety of ways in 
which the y’s may be multiplied together; one has a 
choice of five independent forms of / as a result, sym- 
bolized by 


Nts, hv, htt, Ita, Np. 


These are known as the scalar, vector, tensor, pseudo- 
vector (axial), and pseudoscalar couplings, respectively. 
To get each, suitable operators must be inserted into 
the above schematic for #. The choice between these 
possible forms was left by Fermi as an exercise for the 
students of @-decay. The current status of the problem 
will occupy much of this discussion. 

Before restricting this appreciation of Fermi’s work, 
one should at least mention the multiplicity of rami- 
fications which his theory has had altogether. It is 
amazing what varieties of observed phenomena (and 
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what thicknesses of The Physical Review!) have their 
roots in his one paper on the subject." 

At the time Fermi constructed his theory, the experi- 
mental knowledge concerning B-phenomena was very 
sketchy. Only the natural B-emitters were then known 
and these form only a small minority of many hundreds 
of cases studied by now. Positron decay was unknown at 
the time, yet Fermi’s theory contained its correct 
description implicitly. The theory predicted the exist- 
ence of orbital electron capture,” and in due time it was 
observed. Certain selection rules on transitions were 
laid down, and, after enough became known about 
nuclear spins, they were duly confirmed. Peculiar elec- 
tron energy distributions werę implied for certain types 
of forbidden transitions.’ For some years, experimenters 
were unable to find these’; they persisted in finding 
spectrum shapes consistent with mere statistical argu- 
ments. In the end, however, the peculiar spectra were 
found,® bearing out the conclusions from the theory. 
Numerous peculiarities have been observed among the 
many cases of 6-decay, and always Fermi’s theory was 
equal to the challenges they presented.‘ 

The ramifications of Fermi’s theory have not been 
restricted to -phenomena proper. Perhaps its most 
important consequence is that it enables the use of 
B-spectroscopy for exploring the internal mechanics of 
nuclei—in much the same way as ordinary spectroscopy 
revealed the behavior of the electrons in atoms. This is 
a vast development, which is helping give some of the 
finest details concerning nuclear structure. 

Perhaps the most far reaching implication of Fermi’s 
conception of the -interaction will come from the ex- 
ample it has set for interactions between fields. The 
simple proportionality (of /) to each of the interacting 
field amplitudes, all evaluated at a common point of 
space, is known as a “simple-contact” interaction, a 
type most directly implied by the ideas of relativity. 
There are indications that this type is best suited to 
represent the interaction of any fields.® For. example, 
the pion-nucleon interaction seems best formulated as 
just such a simple-contact interaction, proportional to 
the pion- and nucleon-field amplitudes. The electro- 
magnetic interactions themselves are earlier examples of 


1E. Fermi, Z. Physik 887161 (1934). 
co kawa and Sakata, Proc. Phys. Math. Soc. Japan 17,467 
3 Konopinski and Uhlenbeck, Phys. Rev. 60, 308 (1941). 
4 Konopinski and Langer, Ann. Rev. Nuc. Sci. II, 261 (1953). 
* Langer and Price, Phys. Rev. 75, 1109 (1949); Peacock and 
Mitchell, Phys. Rev. 75, 1273 (1949). 5 
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the type. Fermi’s example provided the first indication 
that the particular characteristic in question may have 
much wider validity. 


2 


Now the discussion will be restricted to develop- 
ments regarding the core of Fermi’s theory, the 
detailed form of the coupling itself. Any linear 
combination of the five forms Me he, he ha, hp 
would be consistent with the principles Fermi laid 
down. One might say that Fermi left five coupling 
constants gs, gv, Eu Za, 8p to be measured. g, may be 
considered a measure of how strong an invariant source 
for B-radiation a nucleon is. A nonvanishing g, will occur 
if a nucleonic transformation can be treated like a con- 
served current, which @-radiates. The g, corresponds to 
the transforming nucleon being treated as an “intrinsic 
dipole moment” source of B-radiation. Pseudovector and 
pseudoscalar types of sources, as would exist if ga and 
£p do not vanish, have become familiar from theories of 
m-radiation. These interpretations invite comparison of 
Fermi’s g’s, not only with electronic charge as done 
above, but also with intrinsic gyromagnetic ratios, such 
as occur among electromagnetic properties of particles. 

The analogy with gyromagnetic ratios may well be 
unwarranted ; however, it does serve to remind one that 
electromagnetic interactions also can take various 
forms. This may reconcile one to the finding that similar 
complexities, i.e., the existence of more than one form 
of h, arises in -interactions also. 

There have been attempts to adopt simple ratios of 
the five g’s which would give the entire coupling a simple 
symmetry. However, all such attempts have failed.’ 
The analogy to intrinsic gyromagnetic ratios, dependent 
on intrinsic properties of the particles involved, perhaps 


‘makes such failures unsurprising. For the present one 


must be content with simply measuring the five coupling 
constants, g. 

The measurements one relies on are of three main 
types: (1) relations between decay rate and energy re- 
lease ; (2) 8-spectrum shapes; (3) correlations among the 
directions of emission of the product particles. The re- 
sults of the measurements, after comparison with 
expectations from each type of coupling, have led to 
the conclusions’ : 


—both the /, and /, couplings exist, in about equal 
strength; 

—the coupling 1, may or may not exist, the evidence 
being obscure on this point. 


Certain points of the evidewce should be enlarged 
upon to give the above conclusions their proper weight. 
First, it seems well proved that more than one form 
of coupling exists. The proof rests primarily on the 
fact that transitions with unit spin change, as well as 


7H. Mahmond and E. Konopinski, Phys. Rey. 88, 1266 (1952). 
8 G. Gamow and E. Teller, Phys. Rev. 49, 895 (1936). 


O*—O* transitions,’ are just as rapid as other transi- 
tions without spin change. Hither /, or 4, must be 
present to generate the rapid unit spin changes; either 
h, or h, is needed to make O*—O* transitions’ equally 
rapid. Firm confirmatory evidence, based on comparing 
the decay rates of the simple mirror nuclei, has gradu- 
ally developed. The presence of /, or h, receives further 
strong support from certain characteristic spectrum 
shapes® associated with spin changes exceeding the de- 
gree of forbiddenness. 

For the conclusion that not both h, and Ae, and not 
both %, and ha can exist, one relies largely on the fact 
that allowed spectra are observed to have the so-called 
“statistical” shapes. Deviations from these should be- 
come observable if both h, and h, are present, since the 
allowed radiations they generate are able to interfere 
with each other.” The same is true of the pair Ay, Ma. 
Small admixtures (1 to 4% in g*) might have escaped 
observation.’ 

The arguments based on the absence of interference 
effects in the allowed spectra do not lead to the prefer- 
ence of h, over h,, and A over Aa That follows from fur- 
ther evidence. Originally,’ a rather unstraightforward 
analysis of once-forbidden spectra was relied upon; it 
still constitutes strong evidence that at least the com- 
binations (h,, ha) and (ha, hì) cannot exist. However, 
much more straightforward evidence has been provided 
by directional correlation"! measurements on He’ and 
Ne”. The first of these, He’, undergoes a rapid transition 
with unit spin change, hence must be generated by M 
or ta. The observed directional correlations are very 
definitely characteristic of Jy. The Ne” correlations” 
equally definitely prefer 4, over Mtu. 

There remains the question of why the detection of 
a possible #, coupling has been so elusive. Unless g, is 
orders of magnitude larger than g, and g,, its contribu- 
tions are almost always greitly outcompeted by the 
others, according to the theory. The single type of excep- 
tions is formed by cases of parity-change transitions with 
no spin change. Even here, , competes with 4, and A 
only on equal terms. There is some evidence" that parity 
change transitions with no spin change are somewhat 
more intense than those with unit spin change, and this 
might be accounted for by the added contribution of an 
hy coupling. One cannot be sure, however, that variations 
in nuclear -moments are not responsible instead; the 
evaluation of these, in parity change transitions, seems 
to depend essentially on little understood relativistic 
corrections to nuclear states. 


? Sherr, Muether, and White, Phys. Rev. 75, 282 (1949). 

DM. Fierz, Z: Physik 104, 533 (1937). 

D. R Hamilton, Phys Rev. 71, 456 (1947). 

2B. M. Rustad and Š. L. Ruby, Phys. Rev. 89, 880 (1953); 
J. S. Allen and W. K. Jentschke, Phys. Rev. 89, 902 (1953). 

13 Maxson, Allen, and Jentschke, Phys. Rev. 97, 109 (1955); 
W. P. Alford and D. R. Hamilton, Phys. Rev. 95, 1351 (1954). 

1 L, Nordheim, Indiana Conf. on Nuc. Spectroscopy, (1953) ; 
R. King and D. Peaslee, Phys. Rev. $4, 1284 (1954). 
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Jn the previously stated conclusion that the h, and 
I couplings both exist, the quantitative data was not 
given, Feenberg'®, Mayer, „Kofoed-Hansen, Sherr and 
many others have contributed toward establishing it. 
Apparently, about all one can now say is that 


ge/er=0.75 to 1.15 


If one supposes |g.|=|g:], each equals 1.7 (10) 
erg-cm’, This still reveals nothing concerning the rela- 
tive sign of g, and gı. 

The relative phases of /, and /; can only be deter- 
mined from an interference of their radiations. They are 
not able to interfere in the allowed transitions; one 
must go to the forbidden transitions and hope that the 
interference effect can be singled out from the super- 
position of direct radiations. Even if this is done, a 
difficult problem remains. Each type of coupling con- 
tributes through a different nuclear -moment (he 
through ga (r), Mu through g: (@Xr) and g: («)). Thus, 
before the signs of the g’s can be determined, the signs 
of the moments must be calculated. This is difficult to 
do with satisfactory reliability, as evidenced by the fact 
that various attempts made so far disagree with each 
other. 


3 


The ideas conceived by Fermi are pushing into new 
fields, beyond the domain of nuclear B-decay. 

An important discovery'® was that the rates of muon 
decay, and of muon capture by nucleons, are both con- 
sistent with the same, characteristic magnitude of the 
coupling constant g= (10) erg-cm’, as for 6-decay. 
This invites an extension of the Fermi theory to the 
u-phenomena. 

Besides the rates of the muon processes, there is one 
further quantitative datum which experiments have 
yielded. That is the energy distribution of the electrons 
emitted in y-decay. The distribution can be character- 
ized by a single parameter, p: The value p=0 corre- 
sponds to a spectrum with a vanishing intensity at its 
end point; the values p>0 correspond to increasing end- 
point intensities. The experiments still disagree on the 
correct value for p, but the best value!® seems to be 
about p=}. 

The application of a Fermi type of theory to -decay 
would, as for 8-decay, require the determination of five 
coupling constants: gs, ge, Zu Za, Zp. It has been the 
fashion to assume that these constants will have the 
same proportions in u-decay as for B-decay. This still 
leaves two types of ambiguity in the procedure of the 
application. One concerns the “order” in which the 
participating particles are represented in the interaction 
law (the correspondence between participants in the 

p- and B-processes). This ambiguity can be removed in 


15 See E. Feenberg, Phys. Rev. 97, 736 (1955) for the latest 
summary. 

16 Ọ, Klein, Nature 161, 897 (1948). 

17 L. Michel, Phys. Rev. 86, 814 (1952). 

18 J, Vilain and R. Williams, Phys. Rev. 94, 1011 (1954). 
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a way to be described below. The seconä type of am- 
biguity arises from the limited knowledge of the g’s 
from B-decay. That leaves at least the relative sign of 
gs and gı undetermined and leaves the value gp almost 
completely open. Using this freedom, one can fit 
both p= and the meson lifetime with the choices 
{p= 3g.= 3g, to a fair enough approximation. 

However, there is no compelling reason fo expect the 
proportions of the g’s to be the same in u- and f-decay. 
One may remind oneself of the analogy to intrinsic 
gyromagnetic ratios again. The muon is probably a 
much simpler particle than the nucleon and can just as 
well be expected to have a single form of coupling. 
Either scalar or pseudoscalar coupling alone would give 
p=} and a g-value about twice that of B-decay. There 
are not readily settled questions of normalization of 
u- vs B-decay which could account for the discrepancy 
in g. 

It is well worth noting at this point that Fermi made® 
a more direct, though consciously tentative, contribu- 
tion to the theory of the muon spectrum than through 
his part in inspiring the ideas described so far. He 
ignored his 6-theory in this connection and derived the 
spectrum one would expect on a purely statistical basis. 
He merely assumed that the probability for an electron 
to obtain a given share of the energy is proportional to 
the phase space volume for that fraction. The spectrum 
thus obtained is practically indistinguishable from one 
with p=15/28. This is in remarkable agreement with 
the presumably best observed value of p and, very 
characteristically, Fermi obtained it with a minimum of 
assumptions. 


The foregoing discussion of coupling forms for the 
muon interaction was relieved of a type of ambiguity, 
already mentioned, through considerations" to be dis- 
cussed now. These considerations may have an intrinsic 
importance quite apart from their usefulness in reducing 
the ambiguities of the theory. 

If the Fermi type of interaction takes place among 
such different quartets of fermions as (1) PN ev, (2) uvev 
and (3) NPuv, then such an interaction might be ex- 
pected to exist for any quartet of fermions. This is 
known” as the hypothesis of a “Universal Fermi 
Interaction.” The future of a hypothesis so named is 
particularly appropriate to consider here. ~ 

The “Universal Fermi Interaction” hypothesis leads 
us to expect the occurrence of every process, relating to 
quartets of fermions, which does not violate conserva- 
tion principles. With the four presently known types of 
fermions this can already lead to the prediction of 
processes which are known not to exist in nature. One 
can avoid this, however, by taking seriously two more 
conservation principles than the usual ones on charge 
and energy momentum. 


Ps E. Konopinski and H. Mahmoud, Phys. Rey. 92, 1045 (1953). 
C. Yang and J. Tiomno, Phys. Rev. 79, 495 (1950). 
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The first af the additional laws is known as the 
“conservation of heaviness”; it requires the appearance 
of a mass of nucleonic order for each one that disappears. 
Such a law is needed for understanding many elemen- 
tary particle interactions besides the interfermionic 
ones. 

The second new conservation law which seems to be 
needed is!*:sin interfermionic processes, a pair of ferm- 
tons must be created for every pair destroyed. Of course, 
the appearance of an antiparticle is to be understood as 
the disappearance of a normal one, from a negative 
energy state. With the help of this, one can prevent the 
occurrence of all the conceivable processes which are 
not observed. It is only necessary to presume the 
positive muon to be the normal particle, the negative 
muon the antiparticle. 


Dr. Bethe: 


Thank you very much Dr. Konopinski. 

The two lectures that you have heard so far have 
illustrated how two rather small papers of Fermi’s have 
had tremendous influence on the progress of physics. 
The three papers which you are going to hear now are 
of a somewhat different character in that they tell 
directly about the work of Fermi himself, in the last 
twenty-odd years, by some of the close collaborators 
in this work. The paper by Fermi that Dr. Konopinski 
talked about, the theory of beta disintegration, was 
the last paper which Fermi wrote as a pure theoretical 
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Without the hypotheses here described it would be 
difficult to understand the nonoccurrence of (a) 
P+p—P+e- when both (b) P+w—-N-+y and (c) 
P+e—+N-+pv do occur. When the hypothéses are 
adopted, (b) must be understood as P+y-—N-by! 
(/=antineutrino) and (a) would require the appear- 
ance of three normal fermions from the absorption of 
one, in violation of the last conservation law. The 
hypotheses also establish the unambiguous ‘order’ for 
the u-decay implied by u*-+e—>v-+y; the emitted neu- 
trinos must be identical, one cannot be an antineutrino. 

The considerations introduced here are highly specu- 
lative and probably do not warrant the attention given 
them here. However, they well illustrate how the ideas 
created by Enrico Fermi continue to ramify. They 
emphasize how much physicists owe to him. 


physicist. Around that time in the early 1930's it was 
clear to most nuclear theorists that no progress could 
be made in nuclear theory without very many more 
experimental data. Most of us just left it at that and 
left to the experimental physicists the difficult task of 
providing the experimental data. Fermi came to the 
logical conclusion, that he should become an experi- 
mental physicist himself, and thus he provided the 
data on which the theory could be built. The next talk 
that you will hear will tell of the work of the nuclear 
physics group in Rome in the middle 1930’s by one of 
Fermi’s closest collaborators, Professor Segrè. 


VOLUME 27, NUMBER 3 JULY, 1955 
Fermi and Neutron Physics 
EMILIO SEGRE i 


Department of Physics, University of California, Berkeley 4, California 


ELLOW members of the Physical Society, Ladies 
and Gentlemen. 

I have been requested to speak to you about the early 
work on neutrons performed in Rome in 1934 and 1935. 
There is obviously not very much that I could tell 
about it which would be of scientific interest, and even 
the official history of that period has in a certain way 
been written by Fermi himself in the Nobel lecture of 
1938.1 What I will try to do is td recapture, as much as 
possible, some of the atmosphere and the spirit that pre- 
vailed at that time among the people involved in the 
work. For this reason these recollections will be some- 
what personal and I beg to be excused for it. 


1E. Fermi, Les prix Nobel (1938). ° 


The main activity of Fermi until 1934 had been 
theoretical, and it had been very successful having al- 
ready accomplished, among other things, the discovery 
of the statistical laws governing the antisymmetrical 
particles, that is, the famous Fermi statistics, and the 
beta-ray theory. But, in spite of his theoretical activity, 
Fermi liked to make experiments, once in a while, and 
in this he had been associated mainly with Rasetti. 
The principal topics of experimental research were 
spectroscopic and optical because these were the fields 
which were at the forefront of the attention of physicists 
in the late 1920’s and early 1930's. = 

After the formation of the school in Rome, and in 
view of the fact that several of us were interested, not 
only in theory, but prevalently in experimental physics, 
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Fic. 1. Physics building in Rome where the neutron work 
was done. 


we often had discussions on what experimental fields 
looked most promising. Although we had good spectro- 
scopic equipment and a good knowledge of spectro- 
scopic techniques, it had for several years been the feel- 
ing, mainly of Fermi, that we should branch out and go 
into nuclear physics because that was the field which 
promised to become most interesting in the future. (See 
Fig. 1.) 

However, not being quite sure about this, we went to 
several laboratories to learn techniques. Among the 
laboratories visited at length were those of Millikan in 
Pasadena, Stern in Hamburg, Debye in Leipzig, and 
Lise Meitner in Berlin. Several of us spent a year or 
more in these laboratories and finally we all went home 
importing techniques and discussing what we had seen. 
After a somewhat long and heated debate in which dif- 
ferent opinions were maintained (I remember vividly 
a scene in a locker room of a tennis court) with plenty 
of vigor, it was decided, mainly under the impulse of 
Fermi, that we should really branch into the nuclear 
field. For this Rasetti, who had learned techniques at 
Dahlem from Lise Meitner, started to prepare quite a 
few pieces of apparatus such as a cloud chamber and a 
gamma-ray spectrograph. However, while the equip- 
ment was being prepared and we were trying to decide 
what type of work to actually start, Amaldi and I con- 
tinued spectroscopic work. 

The real decision occured in 1934 when we read in 
the Comptes Rendus? and in Nature of the discovery of 
artificial radioactivity by Curie and Joliot. It became 
apparent to Fermi that this was just the occasion for 
which we had been waiting and immediately he saw the 
possibility of expanding the work tremendously by using 
neutrons as projectiles. The start of the experiments was 
facilitated by the very lucky circumstance that Pro- 
fessor Trabacchi had on hand a gram of radium and a 
plant with which it was possible to extract emanation 
and prepare radon-beryllium sources. We did not know 
at that time how dangerous it was to work with beryl- 
lium and we have been fortunate that up to now none 
of us has shown serious signs of berylliosis, although at 


21. Curie and F. Joliot, Compt. rend. 198, 254, 561 (1934). 


e 


that time we probably gave the disease many chances 
to develop. It was in the spring of 1934, during Easter 
vacation, that Fermi decided to put his idea to the 
experimental test immediately; he built with his own 
hands some primitive Geiger-Müeller counters of alu- 
minum (see Fig. 2) which looked very ugly but worked 
adequately for the purpose and then started to bombard 
with radon plus beryllium neutrons (50 millicuries of ra- 
don wasall that he had at the beginning) all the substances 
he could get. Being a systematic man, as most of you 
probably know, he tried in order hydrogen (helium was 
not available), lithium, beryllium, boron, carbon, nitro- 
gen, oxygen, all with negative results; but since he 
knew for sure that the phenomenon was going to happen, 
he kept on trying and finally fluorine gave the expected 
result. 

This was on March 25, 1934 and a letter announcing 
this result was promptly sent to the Ricerca Scientifica.’ 
The Ricerca Scientifica is a general science journal 
similar to Nature or Science and the editors of it would 
give us reprints of the articles within a very few days 
after receiving the manuscript. These reprints were the 
bulletins of our work which were sent at that time to a 
mailing list comprised of most of the nuclear labora- 
tories in Europe and America. The first paper was signed 
by Fermi alone and contained the initial discovery. 

Having seen that he had struck gold, Fermi, who 
wanted to proceed as fast as possible and was quite 
unselfish with respect to the work, asked Amaldi and 
me to help him carry out the experiments. Rasetti was 
in Morocco getting decorated by the Sultan. D’Agostino 
was not originally in our group; he was a chemist in the 
laboratory of the Sanità and at the time had a fellow- 
ship in Paris at the Laboratory of Madame Curie. We 
wired Rasetti, asking him to come back and in the 
meantime Fermi, Amaldi, and I tried to push the work 
as fast as we could. We organized our activities in this 
way: Fermi would do a good part of the experiments 
and the calculations, Amaldi would’takefcarefofjwhat we 


Fic. 2. The Geiger-Miiller counter and lead houses used 
in the first year of the neutron work. 


3E. Fermi,.Ricerca sci. 5 (1) 283 (1934). 
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would now call the electronics, and I would secure the 
substances to be irradiated, the sources, etc. Now, of 
course, this division of labor was by no means rigid and 
we all participated in all phases of the work, but we had 
a certain division of responsibility along these lines and 
we proceeded at great speed. We needed all the help we 
could get and we even enlisted the help of a younger 
brother of éne of our students (probably 12 years old) 
persuading him that it was most interesting and im- 
portant that he should prepare some neat paper cylin- 
ders in which we irradiated our stuff. It was also ap- 
parent very soon that chemistry was an essential tool 
for the work and we were debating on how to get the 
help of a professional chemist when the door opened 
and Dr. D’Agostino entered. We barely knew him at 
the time, but soon we became excellent friends. He had a 
return ticket which was extended three times and finally 
expired (European railroad tickets are not reimburs- 
able) and he did not return to finish his fellowship in 
Paris. 

The first thing we décided to do was the obvious: 
irradiate all the substances we could lay our hands on. 
For this we needed a little money. A phone call by 
Fermi to the National Research Council got us about 
$1000 with no strings attached (of any kind). I became 
the cashier without any obligation of keeping books. 
With some of this money in my pocket and a basket I 
went as a procuring agent to an old chemical shop in 
Rome which I knew. I found there a gentleman by the 
name of Troccoli who would speak Latin because he had 
been educated in a seminary for priests in the province 
of Rome and I explained the situation to him. I was very 
pleased and almost moved to see how enthusiastically 
he collaborated to the extent of even giving to me some 
of the chemicals which he had kept on his shelves for 
twenty or thirty years. He commented that nobody had 
ever asked for them and he was glad to see them used. 
Another firm which was selling precious metals would 
give us quite freely on loan any amount of gold (once 
10 Kg) or rhodium or platinum that they could supply, 
and with a display of confidence and real enthusiasm for 
this scientific work which had started in the University 
of Rome which we deeply appreciated. 

The irradiations went on quite regularly and soon we 
identified what in present-day language are called (1,p) 
(n,a) reactions. We also found that in many cases a 
neutron would produce a radioactive isotope of the 
target and there was great doubt among us whether this 
would be the result of an (n,2n) or (n,y) reaction. It 
must be remembered that at that time we still had the 
thought that the more energetic the neutrons the more 
effective they would be in producing reactions. At the 
beginning of May, while all this was going on, Rasetti 
had come back from Morocco. He joined us and he 
proposed to irradiate uranium in order to see what 


-would happen. This was accomplished with results that į 


puzzled us very much at first and which remained un- 
explained until the discovery of‘ission. Our main result 
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were that irradiated uranium produced elements which 
were of an atomic number not found between that of 
lead and that of uranium, In this we were perfectly 
correct and we had tested our experiments carefully 
with isotopic tracers. Our further conclusion was that 
the substances thus produced were transuranic. In this 
we were only to a small extent correct because we were 
completely blind to the possibility of fission, although, 
remarkably this was called specifically to our attention 
by Ida Noddack who sent to us an article in which she 
clearly predicted the possibility of fission. 

With all this work the sholastic year 1934 came to an 
end. I had been keeping the first isotope chart of my 
life (a much simpler job then than now) and could 
proudly point to about 40 red spots—our new radio- 
active isotopes. Senator Corbino, the director of the 
physics laboratory and also our close friend and adviser, 
took care that the results be properly advertised at the 
Italian Accademia dei Lincei in the solemn royal session. 

I have reread that speech® because it is a good re- 
flection of our thoughts at the time, expressed in a 
beautiful literary style. 

After having discussed the transmutation reactions 
he recalls a previous speech of his given to an Inter- 
national Nuclear Conference in Rome in 1931 in which 
he pointed out that in the stars, matter which is old and 
as it were, dead on our planet, is instead still young and 
in rapid evolution. He then considered, in 1931, the 
possibility of “rejuvenating” matter on the earth, as 
he put it. 

In 1934 he sees this dream realized although on a 
microscopic scale, but he underrates the technical pos- 
sibilities of the future. He says, “The scale of the trans- 
mutation is too insignificant yet to have any practical 
application.” (This was in 1934, in 1935 he had already 
changed his mind, as you shall see.) 

And finally he closes on a somber note: “It is perhaps 
not in vain that providence has limited these phe- 
nomena. Mankind is not yet worthy of having such ter- 
rific sources of power and destruction” and he ¢ontinues 
“whoever has any humanitarian senses cannot think 
with indifference as to the character that war would 
acquire, if nuclear weapons became possible.” 

The Royal Meeting of the Accademia dei Lincei was 
the official end of the season in Rome. As you know the 
summer there is hot and at the end of July we all left 
for a summer vacation. Fermi and his wife went to 
South America, Amaldi and I went to Cambridge, 
England, but I do not remember whert Rasetti and 
D’Agostino went (see Fig. 3). To Cambridge we brought 
the manuscript of a paper summarizing all our results 
which we wanted to have published in the Proceedings of 
the Royal Society,’ and I delivered it to the hands of Lord 
Rutherford in person. He had already shown great 


+I. Noddack, Angew. Chem. 47, 653 (1934). 

50. M. Corbino, Ricerca sci. 5 (1) 609 (1934). 

ê Fermi, Amaldi, D’Agostine, Rasetti, and Segrè, Prot. Roy. 
. (London) 146, 483 (1934). . 
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interest in our results, and he corrected our English. 
When I expressed our wish to have a speedy publication 
he said, “What do you think, I am President of the 
Royal Society for?” and indeed, the paper came out 
within about six weeks. 

In Cambridge during that summer Bjerge and 
Westcott,” with our help, tackled the question of the 
(nyy) versus (n,2n) reaction and found an example which 
we thought was very clear cut of an (n,y) reaction. 
(Indeed the result is correct, but it is based on an erro- 
neous assignment of the period of Na*.) 

At the end of the summer, Amaldi and I went back 
to Rome and Fermi, on his way back from South 
America, went to England to a conference on nuclear 
physics that was held at that time.8 While Fermi was 
at the conference Amaldi and I, always trying to settle 
this question of (x,y) versus (n,2n) reactions, irradiated 
other substances besides the one used by Bjerge and 
Westcott in order to find more examples of the (1,7) 
reaction and we thought we had found a nice example 
in aluminum which we duly communicated to Fermi. 
He referred to this at the meeting in England. Soon 
thereafter I caught a cold and could not go to the labora- 
tory for a few days. Amaldi tried to verify our results 
and found a different decay period for irradiated alumi- 
num which showed that our so-called (1,7) reaction did 
not occur. This was hurriedly communicated to Fermi 
and the result was that we got a real strong reprimand 
for having induced him to say things that were not true 
at the famous London conference. This whole business 
was in the process of making serious trouble because I 
was sure of my results, so was Amaldi but we could 
hardly reproduce them consistently. Fermi thought that 
none of us were reliable. At that time we were joined by 
our fresh Ph.D., B. Pontecorvo. It was now the begin- 
ning of the scholastic seasons 1934-1935 and we again 
started our systematic irradiations. Doing things a little 
more quantitatively it became quite apparent that the 
intensity of the radioactivity that we could obtain from 
various substances depended in a strange way on the 
conditions of irradiation and that there were unexplain- 
able irregularities in the intensity. In particular there 
was a table near a spectroscope which had miraculous 
properties in as much as silver (there were still silver 
coins in Italy at the time) irradiated on that table 
gained more activity than when irradiated in other 
parts of the same room. The explanation was that the 
table was of wood and that the source on the wood gave 
slow neutrons whereas in pther parts of the room the 
tables were of marble and the source on marble did not 
give slow neutrons. But all this was then unknown to 
us and these puzzling results baffled us for several days, 
or perhaps a week or two, until we decided to try to 
filter the radiation that produced the artificial radio- 
activity. By that time we were not even quite sure they 


1T. Bjerge and H. C, Westcott, Nature 134, 286 (1934). 
8 Papers and Discussions at the International Conference in 
Physics (Cambridge University Press, London, 1934), Vol. I, p. 75. 
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were neutrons anymore. In filtering thiseradiation we 
decided to filter with light elements or heavy elements 
and the first filter tried was paraffin. I remember that 
I was giving an examination and all of a sudden every- 
body from the group came rushing into the room telling 
me to run upstairs to the counters to see what had 
happened. I entered the room and my first reaction was 
that a counter had gone bad, as frequently kappened at 
that time (that is, it was discharging continously) and 
that there was nothing to be very excited about. But, 
I soon changed my mind when it was explained to me 
that the counter was not bad but simply was acted on 
by some substance that had been irradiated with filtered 
neutrons. This was around noon and we tried a few 
more substances and saw that the filtration, the power- 
ful filtration, occurred only with paraffin. We went to 
lunch extremely puzzled by this fact and came back 
after a siesta as usual around three o’clock to find that 
Fermi, in the meantime, had had a hunch that what 
possibly could make this strange behavior of the neu- 
trons was the fact that they could be slowed down by 
collisions and become more effective. If this were the 
case . . . we all started to shout with our loud Italian 
voices listing possible consequences and how to test them 
by experiments. I jumped to my old favorite, the (7,7) 
versus (n,2n) reaction, because I was still burned up by 
the alleged wrongness of our results and immediately 
tried to see whether by filtering the neutrons with par- 
afn the reactions (n,y) or (n,p) or (7,a) could be 
effected differently. In about half an hour we had the 
explanation of the disagreement between Amaldi and 
myself. Both were vindicated and by now we even knew 
the explanation of our previous results and could pro- 
duce at will short or long periods by bombarding 
aluminum. Furthermore, we had no need to correct the 
minutes of the London conference where’ Fermi had 
said certainly more than half of the truth and no lies. 

We also tested that the radiation was emitted by 
radium plus beryllium and not by radium alone, and 
that the effect of the paraffin was characteristic of 
hydrogen and not of carbon. Finally, we went all the 
way to think that the neutrons could really be ther- 
malized and instituted in the same day an experiment 
(unsuccessfull at the time) to demonstrate this fact 
by slowing the neutrons in hot water instead of cold 
water. All this happened on October 22 of 1934 and by 
the evening of that day a short letter to the Ricerca 
Scientifica telling all of these miraculous effects was on 
its way. Actually in order to write this letter we had 
to break our habits and write it after dinner. This was 
done at Amaldi’s house® This discovery obviously 
opened a host of problems and the first thing that we 
did was to measure for many substances what we called 
the coefficient of acquaticity,” namely, how much the 
Immersion in water would increase the activity. This 


gave us a confirmation that the (,y) reactions were the- 
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only ones seAsitive to hydrogenated substances and by 
November 7 we were fairly convinced of the correctness 
of the explanation.” Attention turned then more to the 
study on the slow neutrons themselves and we tried 
again to see whether slowing down in a hot or cold 
medium would change the properties of the neutrons. 
We had strong suspicions that the neutrons were 
effectively*thermalized and although we could not, at 
first, show any positive effect (this was shown for the 
first time by Moon and Tillman" in England), we kept 
trying this theme. We also found very soon that some 
substances, for instance, cadmium, absorbed slow neu- 
trons very strongly and we measured crude cross sec- 
tions for this effect. We detected the capture gamma 
rays and we also started crude measurements of the 
density of the slow neutrons in a hydrogenous medium 
as a function of the distance from the source. Finally, 
we tried to slow neutrons down by collisions with 
substances different from hydrogen and found some 
slight effects of inelastic collisions. All this work was 
initiated and had given significant results by December 
6 of 1934, that is about six weeks after the discovery of 
the slow neutrons themselves. Approximately at this 
time, Corbino came to us with the suggestion that dis- 
coveries or inventions of the type we had made by now 
might have important practical applications and that 
we should take a patent. This we did, and it is this 
patent that two years ago was acquired by the A.E.C., 
after a long and complicated story. 

The boron reaction was studied immediately after- 
wards and interpreted correctly as (,a). More sys- 
tematic work followed in line with the previous problems 
and with considerable emphasis on the uranium irradia- 
tion, on which, however, we did not make great progress. 
The main results of this work were summarized in 
a paper for the Royal Society" written in February, 
1935. In it there are the seeds of most of the im- 
portant ideas and facts of neutron physics. One 
serious difficulty was the apparent lack of correlation 
between the scattering and capture cross section in 
cases as e.g. Cd. What we did miss was Bohr’s idea of 
the compound nucleus. The concept of scattering length 
and the possibility of resonances were present in Fermi’s 
mind. Actually the scattering length argument with its 
typical diagram was stock in trade for him since 1933 
having,invented it in order to explain some spectro- 
scopic facts (pressure shift of spectral lines) which 
Amaldi and I had discovered experimentally. At this 
time, Fermi also made a number of calculations on the 
behavior of neutrons in hydrogenous media by what we 
would call today the Monte Carlo Method; he did not 
publish them and I learned of it only much later, after 
the war. 

10 Fermi, Pontecorvo, and Rasetti, Ricerca sci. 5 (2) 380 (1934); 
Amaldi, D'Agostino, and Segrè, Ricerca sci. 5 (2) 381 (1934). 
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Fic. 3, A group taken in early summer 1934. From left to right, 
D’Agostino, Segrè, Amaldi, Rasetti, Fermi. 


Finally, in the spring of 1935 we devised a mechanical 
experiment in which one could compare the velocity 
attained by the neutrons with the mechanical velocity of 
a wheel, and soon after the experiments of Moon and 
Tillman showing the influence of the temperature, we 
could not only confirm their result but also measure by 
a mechanical experiment some data relevant to the 
velocity of the neutrons. With this we had reached the 
summer of 1935 and as usual we stopped the work and 
scattered pretty much all over the world. (See Fig. 4.) 
I do not remember exactly where Fermi wentat that time. 
I came to the United States and spent some time at 
Columbia University perfecting the velocity selector and 
some more mechanical experiments together with the staff 
of Columbia and Rasetti. However, the political develop- 
ments of the time, namely the Ethiopian War and the 
very grave deterioration of the situation in Europe had 
practically affected very seriously our work even in the 
last months of 1935. There was a famous atlas in the 
library of the physics department in Rome which 
automatically opened on the page of Ethiopia because 
everybody was poring over it and the worries connected 
with that unhappy campaign and with the growth of 
Nazism affected the peace of mind even of the imper- 
turbable Fermi. 5 

When the fall of 1935 came and we should have gone 
back to Rome, Rasetti did not want to re-enter Italy. 
I had been appointed director of the physics laboratory 
at Palermo, Pontecorvo had left Italy for France, 
D’Agostino had taken employment in another scientific 
institute, and only Amaldi and Fermi were left in, Rome 
with neutron sources. From Palermo I went quite often 
to Rome in 1936 to see my old friends and work a little 
there. The spirit of the previous years had really*some- 
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Fic. 4. Ionization chamber and electroscope used in Rome in the 
second year of the neutron work when the hydrogen effect made 
stronger activities available. Several chambers similar to this were 
then built in U.S.A. laboratories which we visited at the time and 
were jocosely called “The Roman Sign.” 


what disappeared because the discoveries were not so 
frequent anymore and moreover the physical conditions 
of the work were very bad. I remember Amaldi and Fermi 
locked in rooms from which no light should filter out be- 
cause of the anti-air-raid regulations of the time. They 
were working very hard on what has become the famous 
study of the motion of the neutrons in hydrogenated 
substances.!*4 The so-called groups of neutrons were 
found and a beautiful piece of theoretical work was 
developed by Fermi, probably leaning on the Monte 
Carlo calculations mentioned previously, to find his way 
to the correct approximations. This laid the foundation 
of the slowing-down theory and the subsequent develop- 
ment of slow neutron diffusion theory. I think it was at 
this time that Fermi acquired, by the combined means of 
empirical experience, Monte Carlo calculation, and 
more formal theory, that extraordinary feeling for the 
behavior of slow neutrons which marked him for the 
development of the pile. He told me that he did this 
theoretical work at home from four o’clock to eight 
o’clock a.m. before coming to the laboratory. 

I have had occasion to reread these papers and also 
some of the wartime reports preceding the pile. It is 

13 B, Fermi, Ricerca sci. 7 (2) 13 (1936). 

44 Es Fermi, Ricerca sci. 7 (2) 13 (1936); E. Amaldi and E. 
Fermi, Phys. Rev. 50, 899 (1936). 
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still stunning how he could always find the correct and 
simple way to everything, guessing the essential points. 
At the end he obviously had a knowledge of the be- 
havior of slow neutrons which has probably not been 
equalled. 

Tt is difficult at such a distance of time to recapture 
the exact spirit prevailing. We were quite sure that the 
discoveries we had made were of great impertance and 
I remember also that we discussed repeatedly, in a 
half-serious vein, the possibility of a nuclear explosion. 
In the case of uranium, in particular, we thought that 
there might be enough of an (7,27) reaction to start a 
chain. For reasons which are not very clear even today, 
we did not consider seriously the possibility of fission. 
Fermi, with whom I occasionally discussed this subject, 
always maintained that he had wrong ideas about the 
mass defects of the nuclei. We followed very closely the 
work of Meitner and Hahn and we were well aware of 
the existence of a very important problem in the ura- 
nium irradiations. On the other hand we had wrong 
ideas on the chemical propertiés of elements 93 and 94. 

Among the experiments performed in Rome I might 
mention one in which we looked for the “alpha par- 
ticles” emitted by uranium under neutron bombard- 
ment. However we covered our uranium with enough 
aluminum to stop its natural alpha particles and hence 
also missed the fission fragments. But who can say 
whether even Fermi would have recognized them for 
what they were if he had seen them? 

The proportions that nuclear transmutations were to 
attain in a few years were completely out of our dreams. 
When we mentioned the possibility of having one curie 
of an artificial radioactive substance it was only in a 
jocose mode. 

However when the large neutrons’ sources furnished 
by the pile came along, Fermi was ready to use them for 
the furtherance of pure physics, and his interest in and 
knowledge of the solid state became very handy. The 
scattering length measurements, the neutron-electron 
interaction, the specular reflection, and other features 
of the diffraction of the neutrons were post-war appli- 
cations of neutron physics. The speed of these develop- 
ments staggers the imagination and it seems incredible 
that only eight years should have elapsed between the 
discovery of the slow neutrons and the criticality of 
the pile. 

But already at Los Alamos Fermi had the feeling 
that his next phase of activity would not be in neutrons 
but in something new and he reminded me that just as 
in 1934 he discarded all his investments in spectroscopy 
to go to nuclear work, so now he would leave the slow 
neutrons in order to proceed to new conquests in the 
field of high-energy physics. In a half-joking mode he 
quoted Mussolini: “Rinnovarsi o perire’—‘‘to renew 
oneself or to perish.” 
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Dr. Bethe: ; 


Thank you very much Dr. Segré. I think I can see 
from the applause that you all enjoyed the personal 
flavor of this talk as much as I did. 

Segrè has mentioned the puzzle that was posed by 
the activities induced in uranium by neutrons, and you 
all know that this puzzle found its solution in the dis- 
covery of fission by Hahn and Strassmann in late 1938. 
You also know that the political situation which Segré 
mentioned and which looked bad in 1935 became in- 
creasingly bad in the ensuing years; Italy came under 
the domination of Nazi Germany and Fermi, like Segrè 


REVIEWS OF MODERN PHYSICS 


VOLUME 27, NUMBER 3 


before him, decided to leave Italy for a more hospitable 
country. You know that Fermi received the Nobel prize 
of 1938 for the research in neutron physics which you 
have just heard, and you know that having received the 
Nobel prize in Sweden, he then took the wrong boat— 
instead of the boat to Italy he took that to America. 
We were most fortunate to have him come and work 
with us here in this country in early 1939, and much of 
history would have been different if he had not come. 
Just at the same time that Fermi came to this country, 
came the news of fission and this news led to very 
spectacular developments about which you will now hear 
from Dr. Zinn of the Argonne National Laboratory. 


JULY, 1955 


Fermi and Atomic Energy 


WALTER H. ZINN 


Argonne National Laboratory, Lemont, Illinois 


E are assembled here today to honor the memory 
of a great scientist, and a cherished friend. This 
tribute would be paid to him even if nuclear physics had 
not brought about the discoveries and events of the 
past 16 years which have bequeathed to the world the 
amazing collection of endeavors now included in the 
term “atomic energy.” The discovery of the fission of 
uranium happened to coincide with the arrival of Enrico 
Fermi in the United States. The rapid exploitation of 
this discovery in this country by means of a whole 
series of brilliant theories and experiments was in a 
large part inspired by him and, in a large part was the 
product of the work of his brain and of his hands. He 
inspired others, and he did so by example. 

In relating his contribution to atomic energy, three 
periods of time may be recognized. The first extended 
from January, 1939 to May, 1942, which was the period 
during which Fermi’s work was almost entirely at 
Columbia University. In this period, the group sur- 
rounding him was small, never exceeding nine or ten, 
and the organization for carrying out the work was un- 
complicated, perhaps almost nonexistent. The second 
period extended from May, 1942, to the winter of 
1943-1944. In May, 1942, Fermi and the group working 
with him at Columbia transferred to the Metallurgical 
Laboratory, at the University of Chicago, which had 
been specifically created to exploit the discoveries in 
uranium fission made up to that time. This second 
period marked the time during which Fermi’s work was 
mainly directed toward bringing about the self-sustain- 
ing chain reaction and the construction and operation 
of the production piles at the Hanford Engineer Works 
of the Manhattan District. In May of 1944, his work 


was transferred entirely to the Los Alamos Laboratory 
in New Mexico. In July and August of 1945, the atomic 
bombs were detonated, thus bringing to a close Fermi’s 
immediate participation in the development of weapons. 
For several years after the war, having returned to 
Chicago, he made some investigations using piles as 
the source of neutrons but his main interest rapidly 
shifted to other fields 

As has been related in the Smyth report, that part of 
the Manhattan District which was called the Plutonium 
Project involved many people in a sizable number of 
organizations. In the time available, it is impossible to 
relate all the ways in which the work of Fermi and the 
suggestions he made and the inspiration he gave to 
others contributed to the rapid progress of the project. 
Neither can the help received by Fermi from a devoted 
group of colleagues and assistants be recognized prop- 
erly. It is enough to note that Fermi always was gener- 
ous in acknowledging the contributions of others and 
the help he received from his assistants. 

The list of reports written by Fermi and his collabo- 
rators is long and only a few can be selected for mention 
here. These will emphasize results which applied directly 
to the objective of the project and are chosen with the 
full knowledge that mùch splendid physics research 
thereby is neglected. 

In a conversation I had with him early in the month 
of January, 1939, he expressed the opinion that prompt 
neutrons should be expected to be emitted in the fission 
process; that the way to a chain reaction would be 
opened if it could be shown that the number emitted 
was greater than the number absorbed. An experiment 
to find such prompt neutrons, if they existed, was 
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immediately started and in a short time a favorable 
result was obtained. Other groups in this country and 
abroad also obtained and published this result, although 
there was by no means agreêment as to the number of 
prompt neutrons emitted per fission in uranium. With 
the knowledge of this result it can be said, however, 
that the Plutonium Project was born. Through the 
efforts of Fermi and others associated with him, the 
Government of the United States was approached in a 
serious way and was made aware of the consequences 
which could follow. The story of the approach to Presi- 
dent Roosevelt has been told many times and will not 
be repeated here. Meanwhile, Fermi pushed on with 
experiments and theoretical analyses. The question of 
the production of neutrons in uranium when bombarded 
by neutrons was the subject of several further experi- 
ments which were reported in 1939 in The Physical 
Review. In these experiments his collaborators were 
Anderson, Szilard, and Hanstein. Meanwhile, with 
Booth, Dunning, Glasoe, and Slack, Fermi reported 
experiments on the properties of the fission fragments 
and evidence was presented to confirm the prediction 
that U25 was the so-called fissionable isotope of ura- 
nium. At this point in following Fermi’s work a compre- 
hensive picture no longer can be obtained from the 
published literature. The reports became classified and 
some have remained so although they no longer contain 
material which requires classification. In parallel with 
these experiments in the period 1939 and 1940, Fermi 
began to work òn a chain-reacting system based on 
uranium and graphite. Sometimes alone and sometimes 
in consultation and collaboration with Leo Szilard, 
Eugene Wigner, and others, the theory of the chain 
reaction in a lattice of uranium and graphite was 
evolved. The theory of the slowing down of neutrons in 
a moderator was refined and the concept of “Fermi 
Age” introduced. The advantage of lumping the ura- 
nium and the role of resonance absorption was out- 
lined. In a characteristic manner, Fermi identified the 
various factors upon which information would have 
to be obtained in order to assess the feasibility of a 
chain-reacting unit. Some of the experiments were ini- 
tiated directly by Fermi at Columbia and some, con- 
cerning resonance absorption in lumps or spheres of 
uranium, were undertaken at Princeton University 
under the direction of E. P. Wigner. Details of all the 
experiments which were undertaken in this period can- 
not be related, but it is significant to note that by mid- 
summer of 1940 experiments on the slowing down of 
neutrons by graphite and the absorption of neutrons 
by graphite were underway at Columbia. By the spring 
of 1941, it is my recollection that Fermi gave Jectures 
to the group working with him in which the essential 
aspects of the chain-reaction theory, as we know it now, 
were outlined. The term “reproduction factor” was used 
and the formula k= enpf had been derived. 
The summer of 1941 marked the time when the first 
large-scale experiment was attempted. Money had been 
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made available by the Federal Government and the 
purchase of graphite and uranium of a quality suitable 
for nuclear work, or what was thought to be suitable 
for nuclear work, had been accomplished. This inter- 
mediate pile experiment, as it was called, was the first 
of what later became known as the exponential experi- 
ment. The principal component was a rectangular prism 
of graphite, 8 feet on a side and 11 feet tall. ight tons 
of uranium were to be incorporated in the structure in 
lattice form. Various practical problems had to be faced 
and a closer look at the way in which the work was 
carried on will give an insight into the manner in which 
Enrico Fermi set about an important measurement and 
the manner in which he inspired his co-workers to stren- 
uous effort. Fermi entered into all parts of the work 
with enthusiasm. His meticulous attention to the de- 
tails of the experiment was always coupled with a com- 
plete awareness of the factors which might affect the 
ultimate result. At that time in 1941, the situation 
with respect to the cross sections for neutron absorption 
in uranium and graphite was swch that the real values 
were always in danger of being obscured by the absorp- 
tion due to unknown impurities. Fermi, especially, was 
concerned about the effect of entrained moisture and 
the effect on the result of the experiment of whatever 
provision was made to contain the powdered U8, which 
was the only form in which a sizable amount of uranium 
could be obtained at that time. Once the materials were 
on hand, he spent long days in the mechanical task of 
erecting the exponential structure. The uranium oxide 
was heated to several hundred degrees centigrade and 
then placed in thin sheet steel cubical boxes, 288 of 
them 8 inches on the side, and while still hot the 
containers were soldered to guarantee the exclusion of 
moisture. Fermi insisted on doing his share of the filling 
and sealing of the boxes. In the erection of the graphite 
column, slots had to be cut for the subsequent insertion 
of foils for the measurement of neutron intensity. Fermi 
did not hesitate to push graphite blocks through a 
bench saw while saw and man disappeared in a cloud 
of graphite dust. As is well known, the exponential 
experiment requires a high-precision measurement of 
neutron flux throughout the lattice arrangement. A 
program of measurement of the activation of indium 
foils was developed which used a number of Geiger 
counters. By suitable standardization and careful con- 
trol of the measuring techniques, high-precision results 
were obtained despite the well-merited poor reputation 
of the Geiger counter as a precision instrument. This 
first exponential experiment gave the result that the 
reproduction factor for the particular structure was 
0.87. This result was disappointing and the conclusion 
quickly was reached that this result was not consistent 
with the cross sections as they were then known. Experi- 
ments were set underway to find the source of the im- 


purities which caused the difference between the results . 


observed and what was expected from the cross sections. 
The iron sheet of which the containing boxes were made 
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was tested hy neutron absorption. Experiments on the 
resonance neutron distribution in the neighborhood of 
a single cell of the lattice were undertaken, The purity 
of the uranium was questioned. Fermi took the lead in 
separating the impurities by an ether solvent extraction 
method. The intention was to collect sufficient of the 
impurity material to make a determination by the 
absorption of neutrons of the poisoning effect. This was 
done and Fermi did not fail to do his share of the time- 
consuming and laborious work involved in dissolving 
many pounds of uranium oxide and in a painstaking way 
using separating funnels to isolate the impurity. The 
conclusion was that the uranium did contain significant 
impurities which had not been taken into account in the 
calculations. Incidentally, “natural neutrons” first were 
observed in this experiment. They were, of course, the 
neutrons from the spontaneous fission of U”%, 

As the result of discoveries at Berkeley, a new mili- 
tary objective was set for the graphite-uranium chain 
reaction. Fermi and his associates had always been con- 
vinced that a chain-reaeting system would find military 
uses either as a power source for ships, for instance, or 
for the manufacture on a large scale of radioactive 
isotopes. The discovery of plutonium gave a new and 
definite goal for the project. 

Hence, in the fall of 1941, the Metallurgical Project 
was organized at the University of Chicago under the 
leadership of Arthur H. Compton. Fermi was invited 
to join and continue the work which he had been 
directing at Columbia. He and his assistants made the 
transfer to Chicago by May, 1942. Fermi resumed his 
work with the exponential pile method on an expanded 
scale. With the full power of the Government behind the 
program and with the urgency of a war, both uranium 
and graphite flowed to the Laboratory in ever increasing 
amounts. The exponential pile method was applied with 
great success to analyzing successive batches of mate- 
rial. Experiments were carried out in which the dimen- 
sions of the lattice and the amount of uranium in each 
cell of the lattice were varied. In that summer of 1942 
the planning of such experiments, their execution, and 
the analysis of the results occupied a good fraction of 
Fermi’s time, but not all. Almost 30 separate exponen- 
tial experiments were done. An exponential pile would 
be constructed in one or two days working time. In one 
or two days more, the neutron flux within it would be 
mapped. The results of the measurements would be 
given to Fermi, who in a few hours would announce the 
new value for the reproduction constant. This seeming 
miracle in calculation was possible because sometime 
during the period of construction of the exponential pile 
Fermi would find the time to work out all the correction 
factors and to prepare the equations so that the fewest 
mathematical operations would be required with the 
actual data to find the answer. Much of this labor of 


_ mathematical manipulation was done in the early 


morning hours, since it was his custom to be the first 
to work in the morning. Because the results were im- 


portant for the planning of the next experiments, Fermi 
felt he had to have them at the earliest moment. Offers 
to assist in this drudgery were gently rejected because 
Fermi knew that he could get the answer sooner and 
without error by his own methods. In visiting him in his 
office, it was not necessary to knock on the door to 
discover if he was in. If you didn’t hear the clacking of 
the desk calculating machine while walking down the 
corridor, it was a better bet to try the nearest laboratory. 

In June of 1942, the exponential experiments revealed 
a result, namely $ greater than one, for which it could be 
said that a sufficiently large lattice of uranium and 
graphite would be self-sustaining. At that time, plans 
were laid for the pile which first became critical on 
December 2, 1942. Nevertheless, there was such un- 
certainty in the value of the measured reproduction 
constant that Fermi gave his opinion that there was less 
than a 50% chance that the materials expected to be on 
hand by December would be sufficient to reach a self- 
sustaining reaction. Therefore, he urged that provision 
be made to make the pile as large as the physical space 
would permit so that as additional materials were 
received they could be added to the structure. Also, he 
insisted that provision be made so that the absorbing 
nitrogen of the atmosphere could be removed if needed. 
To permit this, the pile was shrouded in an enormous 
balloon cloth tent. The manufacturers of the balloon 
cloth understandably were perturbed when they noted 
that the order called for a balloon of rectangular cross- 
section. In the days just preceding December 2, 1942, 
Fermi steadily made measurements of the neutron 
intensity within the structure and several days before 
criticality actually was achieved, he was satisfied that 
the critical point easily would be reached with the mate- 
rials at hand. Perhaps this helps to explain why on that 
memorable day while Fermi called to George Weil to 
withdraw the control rod ever farther out of the pile, 
the calmest man, the most self-possessed man, was the 
leader who, in his own mind, several days prior had 
passed the point of doubt about the success of the 
experiment. 

During the months that the materials for the first 
pile were being accumulated, Fermi was actively engaged 
in two other endeavors. A group of physicists, headed by 
Herbert Anderson, continued the work of measuring 
the basic cross sections for neutron reactions of interest 
to the project. Of particular interest was the absolute 
determination of the number of neutrons emitted by a 
given source and the techniques for calibrating detection 
foils and Geiger counters. Although the exponential pile 
method just related gave a way of getting results in an 
integral fashion, persistent efforts were made to obtain 
ever more precise values for the basic constants. A 
standardized neutron flux was essential for bringing 
about this situation and Fermi pursued this goal with 
great determination, 

While the Laboratory entrusted with the al 
sibility of the nuclear phases of the chain reactioh was 
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pursuing its objective, the Project as a whole was pro- 
ceeding with plans for the production of plutonium. 
Fermi’s notebooks, which often were brightly colored 
paper tovered college examination books, show that 
early in 1942 and continuously he made calculations on 
heat transfer, critical size, temperature coefficients, con- 
trol rod effectiveness, and other matters for a variety of 
production pile designs. Two kinds of production seemed 
to be required: A small amount of plutonium, gm 
amounts, to be obtained at the earliest possible moment 
so that the chemistry and nuclear properties of this 
strange clement could be studied; and secondly, a 
large amount for weapon use. Fermi made calcula- 
tions on a surface-cooled graphite pile for the first 
application and for a helium-cooled, a water-cooled, 
and a bismuth-cooled pile for the second. He was a 
member of the Policy Committee of the Laboratory 
and the record shows that he entered enthusiastically 
into the discussion of the engineering problems and the 
probable success for the various types of piles suggested. 
In reading the reports it is interesting to note, however, 
that when the purely engineering considerations began 
to demand ever larger quantities of structural materials 
in order to improve performance, it was Fermi who 
called the matter into question by indicating that the 
reproduction factor had to remain above one or the 
whole thing would not work. Repeatedly he was asked 
for his opinion of the effect of a certain structural change 
or of temperature on the operation, and repeatedly 
Fermi indicated that only recourse to experiment could 
give an answer. He knew that the project was pro- 
ceeding without the detailed knowledge of fundamental 
constants which would have permitted all decisions to 
be made by calculation. 

The demonstration of the chain reaction in December, 
1942, focused attention on the next step, the design of 
production piles. To this task Fermi gave first priority. 
Early in 1943, it became clear that the light water- 
cooled version of the graphite pile evolved in the theo- 
retical section under E. P. Wigner’s direction had the 
best chance of success. Fermi accepted the design and 
put all his experimental resources to work to produce 
the information which the DuPont Company, which 
had accepted responsibility for the construction of the 
production units, demanded in order to be able to build 
these very large and costly machines. Let me cite 
several examples. A reactor operating at high power re- 
quires a massive shield. With many hundreds of cooling 
channels to be provided for, the penetrations of this 
shield become a matter ot crucial importance in the 
design and construction. Which materials to use and 
in what thickness were the questions asked? Information 
on shielding for the mixture of slow neutrons, fast 
neutrons, and gamma rays emitted by the pile just did 
not exist. Fermi proposed to get the information with 

experiments. Time was extremely short and the only 
tool available was the first pile itself. The idea that the 


shield should be composed of a combination of a heavy 
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element and hydrogenous material was reqdily appar- 
ent. Inquiry soon showed that the only practical heavy 
element in the quantities required was steel. Twenty- 
five tons of steel and suitable quantities of paraffin wax 
were used to mock up various compositions of shields 
in a five foot square hole fashioned in the top shield of 
the pile, which meanwhile had been moved to the 
Argonne Forest Laboratory. A period of twp weeks of 
furious measurement transpired. Shielding material was 
stacked in the hole in the pile shield with detectors for 
neutrons and gamma rays inserted at various depths. 
As the shield became thicker, the pile was operated at 
ever higher power. Whereas the original structure was 
assumed to be limited to a total power output of 10 
kilowatts, it was soon operating at 100 kilowatts and 
above. Gamma-ray intensity was measured with a 
Lauritzen electroscope, neutron intensity with stand- 
ardized foils. By no means was this a clean experiment 
since the changing hardness of the radiation as it 
penetrated the shield had a pronounced effect on the 
efficiency of the detectors. Fernai, however, calling upon 
all his knowledge of neutron physics and of nuclear 
reactions did experiments to obtain the necessary cor- 
rections and ended up with a table of attenuation 
factors for neutrons and gamma rays for a certain mix- 
ture of iron and paraffin. Today that table of relaxation 
factors is still acceptable for the radiation emitted from 
a graphite pile. It was an example of brilliant interpreta- 
tion of experimental data where time did not permit the 
careful analysis of detector response which would have 
been a normal procedure. Intuition and mathematical 
skill were substituted, with the result that a useful 
shield for the production piles was designed and at the 
required time. 

Somewhat later, the question of the stability of the 
production piles came into question. Apparently it 
could not be calculated unequivocally that the tempera- 
ture coefficient would have a negative sign which was 
thought to be a prerequisite for safe operation. Despair- 
ing of resolving the question by calculation, Fermi 
suggested removing the central portion of the first 
graphite pile and replacing it with a facsimile of what 
the production machine was to be. By heating the fac- 
simile with electricity and observing the effect on the 
reactivity of the pile, he came to the conclusion that the 
temperature coefficient would be negative, but only 
slightly so. It was not the kind of experiment that was 
his preference. If time had permitted, he would have 
studied the effect of temperature on the basic effects 
involved. Fermi knew when short cuts were needed and 
when it was necessary to produce a concrete result as 
an encouragement to progress for the main project. 

To pursue the plutonium production pile story fur- 
ther, we find Fermi at the Hanford Engineer Works in 
the State of Washington in the fall of 1944 assisting in 


the startup of the first large pile. It is customary even. 


today to take an extended period of time, perhaps three 
months, to bring a large reactor to operating power. 
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FERMI AND 
This would have been clearly inconsistent with the pur- 
pose of the war-time project and it was greatly due to 
Fermi’s advice, work and presence that the production 
reactors were brought to operating power in a brief span 
of time. This is not to say that the startup was un- 
eventful. The first attempts at extended operation at 
design power encountered the serious poisoning by X, 
This was an unexpected phenomenon, but the team of 
physicists assigned to watch the startup soon had the 
answer. The reactivity was being affected by a factor 
which had all the properties of a radioactive substance. 
By studying the behavior of the reactor and combining 
this with the period of radioactive decay and yield of 
fission products, the culprit was identified and once 
identified could be countered. 

Meanwhile, at the Argonne Forest Laboratory, Fermi 
was giving his attention more and more to the use of the 
first reactor as a tool in research. Characteristically, the 
first use was a practical one as far as the project was 
concerned. The reactivity of the pile could be used to 
measure the neutron absorption of samples introduced 
into it. An early example was the fact that the control 
rod calibration had to be corrected for atmospheric 
pressure, Under Fermi’s direction this quickly evolved 
into a method for testing uranium and graphite, and 
became known as the danger coefficient method. It was 
then used in a routine way to analyze for purity, samples 
of all the uranium and graphite which were used in the 
production piles. This was a practical step but the same 
technique, refined and standardized, was used to meas- 
ure the absorption cross sections of many of the ele- 
ments of the periodic table. In fact, the collection and 
tabulation of these constants received a great accelera- 
tion because of the power of this pile method. 

In this period, Fermi initiated the work with a beam 
of neutrons emitted from the pile. Neutrons in the 
resonance energy region, as well as thermal neutrons, 
were investigated. The technique of obtaining very 
slow neutrons, so-called “cold neutrons,” by filtering 
through a long column of graphite was evolved. The 
transmission of neutrons through microcrystalline mate- 
rials was studied and the effect of diffraction noted. 
These early diffraction effects were the direct ante- 
cedents of the crystal diffraction techniques which have 
developed into one of the most powerful methods for 
the study of crystal structures. Leona and John Marshall 
and H. L. Anderson were his principal collaborators in 
this work. 

The nuetron chopper, which has been so successful 
as a tool for exploration of neutron reactions at energies 
in the 10- and 100-volt regions, was a Fermi invention. 
The prototype of the first chopper which Fermi made 
with his own hands was merely a stack of alternate 
aluminum sheet and steel shim stock held together with 
laboratory C-clamps. In a few hours, by measurement, 
Fermi had determined the transmission of a collimated 
beam of neutrons for the stack as a function of the angle 
of the plane of the sheets of tht stack with the neutron 
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beam. A calculation gave the energy resolution to be 
expected if the chopper of aluminum and steel plates 
were to be rotated in the,beam at high speed. A short 
interval for shop work, and an operating chopper was 
in existence. It was not applied first to substances 
having resonances which might give exciting-looking 
graphs, but rather was used to study the cross section 
of boron, for after all it was boron which was used as the 
standard for all absorption cross section work. 
A few titles of papers or reports by Fermi and his 
co-workers will suffice to indicate the range of problems 
made possible by the intense and steady source of neu- 
trons provided by the pile: 
Production of Low Energy Neutrons by Filtering through 
Graphite, 

Phase of Neutron Scattering, 

Transmission of Slow Neutrons through Microcrystalline 
Materials, 

Interference Phenomena of Slow Neutrons, 

Spin Dependence of Scattering of Slaw Neutrons by Be, Al, and 
Bi on the Interaction between Neutrons and Electrons, 


and so on. The list is long and it is abundantly clear 
that the new intensity was being exploited by a master 
of theory and experiment. 

An example of Fermi’s method of starting the in- 
vestigation of a new effect may give a better insight 
into his methods and will illustrate why it was not only 
rewarding to work with him but a source of great 
pleasure as well. 

When the first heavy-water reactor of the Argonne 
Laboratory had reached steady operation, I began to 
experiment with the collimation of the neutrons from 
the thermal column. By means of cadmium slits, a beam 
several inches high, but having an angular spread of 
only one or two minutes, was produced. The intensity 
as measured by a BF; counter was high, a few thousand 
counts per minute. Fermi was intrigued. He followed 
the beam with a counter away from the reactor, a 
distance of 100 ft or so, and noted how it was weakened 
by air scattering. He roughly checked the scattering 
cross section for nitrogen with his observations. We 
discussed the possible use of the beam in a scattering 
experiment and then he left for the day. 

Several days later, he returned with the proposal that 
the highly collimated beam be used to look for the total 
reflection of neutrons. He had a piece of paper with the 
formula for the index of refraction and calculated 
values for the index for aluminum, carbon, and beryl- 
lium. It seemed that we needed a mirrorabout one foot 
wide made of one of these substances. It would take 
considerable shop work to fabricate one of aluminum. 
Beryllium, we didn’t have. Why not graphite, there 
was plenty around? While I found a divided circle from 
a student spectrometer, Fermi worked away at making 
a flat block of graphite by lapping it against æ steel 
surface plate. Soon we were searching for the totally 
reflected beam of neutrons and found it. Thus began a 
technique which has proved te be of great usefulness in 
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the study of the phase of neutron scattering and in the 
production of polarized neutrons. Refinements in ap- 
paratus and technique have followed and in the hands 
of D. J. Hughes particularly, has produced results in 
neutron physics of classical stature. 
Although by the summer of 1943 the principal de- 
cisions concerning plutonium production were made, 
there still was the problem of providing alternate 
methods if it were finally proved that the first choice, 
the graphite pile, should fail. Due to the urging of 
Harold Urey, plants for the production of heavy water 
had already been built. A recurring question was how 
much heavy water would be needed for a heavy water 
moderated pile. The situation here was quite contrary 
to that of the case of graphite. Graphite had always 
been available in quantity. It needed improvement as 
to its contained impurities. Heavy water, however, was 
practically nonexistent and experiments to determine 
the amounts needed for reactor construction had to be 
undertaken with very limited amounts. Fermi saw the 
opportunity for doing this using the first graphite pile. 
Deuterium could be bombarded in the pile. The flux of 
neutrons was known fairly reliably. The amount of 
tritium formed could be measured and thus the capture 
cross section for deuterium determined. Alex Langsdorf 
undertook this important task. No information existed 
for the absorption cross section of oxygen. This was 
determined by the so-called danger coefficient method ; 
namely, a sample of graphite was compared with a 
sample of liquid carbon dioxide in the pile for its effect 
on the reactivity of the pile. With these results, Fermi 
was able to specify that six tons of heavy water would 
be adequate for a pile fueled with natural uranium. 
Fermi took an intense interest in the design of this 
pile, especially as it incorporated openings in the shield 
which would make possible better experiments with 
neutron beams. While this unit was under construction 
during the winter of 1943-44, Fermi left the Chicago 
Laboratory for Los Alamos. Just before doing so, he 


Dr. Bethe: 


Thank you very much Dr. Zinn for this talk which 
had the one fault that the name of Walter Zinn did not 
appear sufficiently. 

You have heard how the researches of Fermi and his 
collaborators led to the successful chain reaction and 
then to the production of plutonium which in turn formed 
the basis of the Los Alamos effort. You have heard also 
that Fermi himself joined Los Alamos in early 1944. 
Long before that time he had been coming to Los 

Alamos to advise us on our program of theoretical and 
experimental research. At Los Alamos we all loved him 
and relied on him to give us the simplicity of viewpoint 
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carried out a number of measurements, using the 
graphite pile, which were of vital significance for the 
work of the weapons laboratory. The first gram of 
enriched U% produced at the Oak Ridge plant was 
delivered to him. By ingenious techniques, combining 
beam and danger coefficient experiments, he determined 
the fission cross section, the capture cross section, and 
the number of neutrons emitted per neutrofi absorbed 
for this highly important substance. The techniques 
used had long been in preparation and in a surprisingly 
short time he had the essential data. Later on, when 
the first gram of plutonium was produced by the Clinton 
Engineer Works, or X”, as it was called, it too was sent 
to Fermi and he repeated the measurements for this 
material. 

Iam not able to report on the contributions made by 
Fermi to the work of the weapons laboratory. I saw him 
only once during the war on a visit to Los Alamos. I 
went to his laboratory and the situation appeared 
normal. There was a pile, the water boiler. Fermi sat at 
a table on which the recorders from counters were 
making their familiar noise. A yellow pencil was be- 
tween his lips and a slide rule in his hand. Another 
experiment was underway. 

Fermi frequently was asked what he thought would 
be the most important result of the developments in 
atomic energy. At least on one occasion, I heard him 
express the point of view that it would take a consider- 
able time before the generation of electricity or the use 
of the pile in research, including research with pile- 
made isotopes could be judged as the more beneficial 
to mankind. His own hunch was that the research uses 
would win out. 

His work in atomic energy clearly shows the enor- 
mous power possessed by an investigator who is skilled 
both in theory and in experiment. As such a man, 
Fermi was supreme. We cannot expect to see his equal 
for a very long time. 


and the masterful combination of experimental and 
theoretical insight which he possessed. After the war he 
often returned to the state of New Mexico which he had 
come to love. He made great contributions to the 
development of weapons, including the H Bomb, after 
the war just as he had during the war. 

At the end of the war, Fermi returned to Chicago. 
You have heard of some of his researches on slow 
neutrons. You have also heard that his main interest 
soon shifted to new fietds at the newest frontier of 
physics. You will now hear about this phase from Dr. 
Anderson who was associated with Fermi in his re- 
searches on meson physics. 
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if THOUGHT you might like to hear more about 
what it was like to work with Enrico Fermi, in an- 
other setting. Let me tell you about the time we decided 
to work together with the pi mesons. 

It was just about four years ago. At Chicago, we had 
just completed a large synchrocyclotron which could 
accelerate protons to 450 Mev. Fairly intense meson 
beams, rather well defined in energy, could be obtained 
from this machine. 

There was reason to believe that the pi mesons played 
a rather fundamental role in the nuclear interaction. 
Of the four interactions which are known in nature this 
was the strongest and yet the least understood. 

Fermi’s idea was that we should do scattering meas- 
urements. This was the classical way to study nuclear 
interactions, since Rutherford’s time. Even if the 
method was not very novel, it had the merit of being 
simple and direct. From Earl Long, who had constructed 
a liquid-hydrogen plant, we could have ample quantities 
of liquid hydrogen. So we could do directly, the most 
fundamental of scattering experiments, pions on 
protons. 

It was Fermi’s style always to add something quite 
individual and original to every enterprise he entered 
into. For the cyclotron he volunteered to make a trolley 
(Fig. 1). This was an exceedingly useful device on 
which the cyclotron target was mounted. The trolley 
could be moved about the rim of the magnet pole by 
remote control and the target could be raised out of, or 
lowered into the proton beam. 

These motions were powered by sending currents 
through appropriate coils, taking advantage of the 
torque which the magnetic field of the cyclotron would 
then exert upon them. The target was connected to a 
heat reservoir through a calibrated heat leak. Thus, by 
measuring, with thermocouples, the temperature drop 
across this, it was possible to record the power delivered 
by the beam to the target. This served as a useful 
monitor for our experiments. 

Fermi’s trolley was not a thing of beauty, but he made 
the whole thing himself in one weekend and it worked 
very well for several years. He was proud to point out 
how much more quickly he had finished it, than if it 
had been made in the shop. 

No one could accuse Fermi, of contributing less than 
twice his share of the sweat and labor that goes into 
every experiment. For although we were joined at this 
time by Darragh Nagle and Ronald Martin, two of the 
hardest working physicists I ever expect to meet, we 
must still give Fermi the major credit for laying out the 
meson beams, designing and ealibrating the deflection 


magnet, assembling scintillation counters and even 
wiring some of the electronic circuits we had to use. He 
traced the orbits of the pions so relentlessly that before 
long we felt we knew the number of every one. 


TRANSMISSION MEASUREMENTS 


The transmission measurement is very simple in 
principle. The pion beam is made to pass first through 
two scintillators (Fig. 2) then through the hydrogen 
target, and finally through two somewhat larger scintil- 
lators. The number D of double coincidences of the 
pulses from the first two scintillators establishes the 
number of incident particles, the number Q of quadruple 
coincidences measured simultaneously gives the number 
transmitted by the hydrogen. Except for a number of 
small corrections which must be made, the total cross 
section is obtained from the relation 


(O/D)5/(O/D) NoH = exp(—aNV) 


where N is the difference in the number of hydrogen 
atoms per cm? in the target when filled with the liquid 
(H) or with the gas (NoH). 

We learned a great deal about pions from these ex- 
periments.! Notice, first of all, (Fig. 3) that at low 
energies ¢~E*. This is typical of a strong P-wave 
interaction. The cross section becomes sizeable around 
one unit of angular momentum. Thus, 


rX p= (h/uc)- ph. 


This seemed to fit rather nicely with the idea that the 


P 


Fic. 1. Trolley for cyclotron built by Professor Fermi. 


1 Anderson, Fermi, Long, Martin, and Nagle, Phys. Rev. 85, 934 
(1952); Anderson, Fermi, Long, and Nagle, Phys. Rev™g5, 936 
(1952). - 
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Fic. 2. Schematic diagram of scintillation apparatus. 


pion was a pseudoscalar particle. It ruled out many of 
the possible meson theories and concentrated attention 
on those in which the coupling was pseudoscalar or 
pseudovector. 

Notice, second of all, that for m~ the cross section 
levels off around the geometrical value 


o=7(h/uc)?=67mb. 


This fits in with the idea that the interaction is strong 
and of range short and of the order of the pion Compton 
wavelength. This was a clear sign that pions must play 
a central role in nuclear behavior. Confirming Yukawa’s 
idea, the process of emission and absorption of pions 
must be the major contributor to the nuclear forces. 

A third feature arises from a comparison of the cross 
sections for r+ and m~. Notice o(x+)=2-3 times e(r”). 
This seemed strange in view of the greater mutiplicity 
of reactions with m~. For m~ we have 


a + P+ P elastic scattering, 


m + P—nr+ N charge exchange scattering, 


m+ P—y-+-N radiative capture, 


o 50 100 150 200 UEV. 


iic. 3. Scattering cross section for pions on protons. 
(Squares are for m~, crosses for x*.) 
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while for x only 
at-+-P—1t-+ P elastic scattering 


seems possible. 

This anomaly puzzled Fermi very much. I recall the 
day we were measuring this cross section. Fermi was 
running the counters. He had a stopwatch in one hand, 
a slide rule in the other, the desk calculator was clatter- 
ing away, and his eyes were keeping a close watch on the 
flashing neon indicators to detect any possible mis- 
behavior. After he recorded each count, he would cal- 
calculate the cross section. He kept shaking his head 
because it kept coming out so high. 

There was so little for me to do that I just sat back 
and began to go through my mail. On this day there was 
a preprint of a paper by Keith Brueckner* on meson 
nuclear scattering. “Enrico,” I said after glancing at one 
of the curves, “‘here’s a guy who seems to think the rt 
cross section should be higher than the m7.” Fermi was 
disparaging in his retort. “How should he know any- 
thing about it?”. 

“But Enrico,” I persisted, now taking the trouble to 
scale off Brueckner’s curve, “this fellow Brueckner says 
we ought to be getting about 120 millibarns for the cross 
section.” “Were getting even more than that” admitted 
Fermi. “Let me have a look at that paper.” Then, “Will 
you take over for 20 minutes while I go up to my office?” 

I suppose he consulted his “Artificial Memory” for 
he was back in 20 minutes with a broad grin. “The cross 
sections will be in the ratio 9:2:1 for m*+:2°:a~ scatter- 
ing,” he announced. This would be the case if the domi- 
nant interaction which took place was for the state of 
isotopic spin 3. 

It was necessary to carry out, forthwith, a com- 
parison’ of the charge exchange cross section with the 
ordinary scattering for m~. This confirmed the predictions 
of a 2:1 ratio in the cross section for these processes. 

The importance of charge independence in nuclear 
processes appeared very evident in these experiments. 
Fermi was so taken by the simple ratios that he made 
them the theme song of his Richtmyer Lecture at the 
following New York Meeting of the American Associa- 
tion of Physics Teachers. 

The result spurred a revival of interest in the role of 
the isotopic spin in the more classical variety of nuclear 
reactions and provided an important key to the under- 
standing of all nuclear processes. 


ANGULAR DISTRIBUTION MEASUREMENTS! 


The next task was to obtain more detailed informa- 
tion about the scattering process. This could be done by 
making angular distribution measurements. In this case, 
one tries to detect the scattered particles. This is in con- 
trast to the transmission measurement which deter- 


2 K. A. Brueckner, Phys. Rev. 86, 106 (1952). 


* Fermi, Anderson, Lundby, Nagle, and Yodh, Phys. Rev. 85, - 


934 (1952). 


4 Anderson, Fermi, Marti: 
(1953). rmi, Martin, and Nagle, Phys. Rev. 91, 115 
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mines the namber of particles lost. There is a check since 


do 
or= | —dQ. 
dQ 


At low energies we could expect s- as well as p-waves 
to be important so that neglecting d-waves, the angular 
distributidn would be of the form 


do 
—=a+b cos-+c cos’h. 
dQ 


Measurements at three angles for each of the processes 
tint, T —r’, r —r, would be sufficient to deter- 
mine the a, b, c’s for each, uniquely. The process m —y 
is weak and could be estimated by detailed balance 
from its inverse, the photomeson production process. 

Taking spin zero for the pion, this led to three angular 
momentum states, 54, p, and p;. These could exist in 
either of two isotopic spin states T=} and T=3; six 
different states in all. Thus, six phase shifts should be 
sufficient to describe the scattering as in Table I. 

Nine experimental data were available to determine 
six parameters, a situation which allowed some opportu- 
nity to check the assumptions. 


PHASE-SHIFT ANALYSIS 


The problem of finding the phase shifts for the pion- 
proton scattering began to occupy the center of the 
stage. A least-squares type of fit was called for to make 
maximum use of the data. Since the relation between the 
phase shifts and the cross sections is not a simple one, 
this looked like an ideal problem for the electronic 
computer. 

Fermi liked to spend his summers at Los Alamos. In 
the summer of 1952 Nick Metropolis had his wonderful 
computer, the MANIAC, in operation. Nothing could 
have worked out better. The MANIAC fascinated 
Fermi and he had been itching to get his hands on it. 
Now he had a problem to put to the machine. Metropo- 
lis would teach him how, and he could feed in his ques- 
tions, watch the machine digest them, and gather the 
answers as they rolled out. 

The machine found the phase shifts all right. The 
trouble was it found too many.® It found several sets of 
phase*shifts which fit the experimental data equally 
well. The puzzle then was, which solution was the cor- 
rect one? 

I’m sorry that Fermi didn’t wait around to see in 
what an interesting way this puzzle has come to be 
resolved. The key turned out to be the Principle of 
Causality. 


CAUSALITY 


The statement of the Principle is simple enough. It 
depends on the fact that signals cannot propagate 


5 Fermi, Metropolis, and Alei, Phys. Rev. 95, 1581 (1954). 


Tape T: Phase shifts for the pion-proton se 
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faster than the speed of light. Because of this, the 
causality condition is that the scattered wave should 
remain zero until the incident wave reaches the scatterer. 

A well-known consequence of this condition is the 
Kramers-Kronig*” dispersion formula which gives a 
relation between the real and imaginary parts of the 
forward scattering amplitude for the scattering of light 
by atoms. 

Recently, Goldberger has been able to extend the 
application of such a causality condition so as to apply 
to particles with mass. Goldberger’s derivation uses the 
formalism of quantum-field theory. The demand that 
waves do not propagate faster than the velocity of light 
is expressed by requiring that measurements of two 
observable quantitities made at space-like points should 
not interfere. Thus, the causality condition is imposed 
by setting the commutator of two Heisenberg operators 
for the pion field equal to zero when they are taken at 
space-like points.? 

The dispersion relation which is obtained thereby? 
enabled us to calculate the real part of the forward 
scattering amplitude from a knowledge of the total 


A j z33 83515 
SINZ ocg e SINZA ISN (exert) 
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®H. A. Kramers, Atti congr intern. fisici. como. 2, 545 (1927). 

7R. Kronig, J. Opt. Soc. Am. 12, 547 (1926). 5 

8 M. L. Goldberger, Phys. Rev. (to be published). 

9 Goldberger, Miyazawa, and Oehme, Phys. Rev. (to be pub- 
lished). See also Karplus and Ruderman, Fifth Annual ‘Rochester 
Conference (1955). . 
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scattering cross sections for negative as well as positive 
pions by protons. These data now extend up to 1.9 
Bevo" 

It is a striking result of this calcualtion” that for the 
isotopic spin $ state the real part of the forward scatter- 
ing amplitude changes sign abruptly at about 180 Mev. 
In Fig. 4 we have plotted 


2k 
——D3=sin2a3+2 sin2a33+sin2a31+ +- 
k 


as a function of pion energy (in the laboratory system). 
Here k, and k are the wave numbers of the pions in the 
center of mass and laboratory systems, respectively. 
The dispersion relations give the real part of the for- 
ward scattering amplitude (in the laboratory system) 
for the isotopic spin $ state D; positive and in good 
agreement with experimentally determined phase shifts 
below 180 Mev. Above 180 Mev, D becomes ab- 
ruptly negative in agreement with the solution recently 


10 Ashkin, Blaser, Feiner, Gorman, and Stein, Phys. Rev. 96» 
1104 (1954). 

NS. J. Lindenbaum and L. C. Yuan, Fifth Annual Rochester 
Conference (1955); Cool, Clark, and Piccioni, Fifth Annual 
Rochester Conference (1955). 

12 Anderson, Davidon and Kruse, Phys. Rev. (to be published). 


Dr. Bethe: 


Thank you Dr. Anderson. 

You have heard the story of thirty years of physics 
and of thirty years of research of Fermi on physics. 
You have heard the story of Fermi’s influence on 
physicists, how, by his example and by his knowledge, 
he inspired his collaborators and many generations of 
students. The main quality in Fermi was his simplicity, 
the simplicity of his person and the simplicity of his 
approach to the problems of physics. It was his ability 
to take a complicated problem, to take it apart, resolve 
it, and then having taken it apart, find the solution to 
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found at 189 Mev® in which a33=98.5°, w= —11.3° 
and a31= —11.6°. Alternative solutions with a33= 81.5°, 
as=-+11.3°, and a23=+11.6° or one with az= —44,5° 
give the wrong sign for D3. A fourth solution at 
189 Mev which is of the Yang type has a;=—11.6, 
33= 126°, and a31:= 236°. This is in agreement with 
causality but must be compared with the analogous 
solution at 165 Mev" in which as=—20°y a33= 34°, 
and a3:=94°. This seems unlikely in view of the large 
jump in phase shift which is called for. For the T=4 
state the experiments give Dı, close to zero, (Fig. 5) 
and so do the dispersion relations provided proper ac- 
count is taken of the bound state which is the neutron. 

This choice of phase shifts coincides with that taken 
by de Hoffmann, Metropolis, Alei, and Bethe" on less 
convincing evidence. 

Fermi had recognized that a good way to decide the 
choice of sign of the forward scattering amplitude was 
to study the Coulomb interference with it. In this way, 
one of Fermi’s students Jay Orear, established the 
positive sign of by such a study’at 113 Mev." Just the 
other day, another of his students, Horace Taft,” was 
able to demonstrate the change in sign of D by de- 
tecting an augmentation of the small angle scattering 
at 216 Mev due to Coulomb interference. 

Looking back now at all the wonderful experiences we 
had in this adventure with the pi mesons, with their 
pseudoscalar nature, their strong coupling, their short 
range interaction, their charge independence, and their 
causal behavior, I wish there were a better way to say 
“Thanks Enrico, for taking me along!” 


13 Anderson, Davidon, Glicksman, and Kruse, Phys. Rev. (to 
be published). 

“H, L. Anderson and M. Glicksman, Phys. Rev. (to be 
published). 

18 De Hoffmann, Metropolis, Alei, and Bethe, Phys. Rev. 95, 
1586 (1954). 

16 J. Orear, Phys. Rev. 96, 1417 (1954). 

1 H. Taft (to be published). 


the parts and find it elegantly. Many times, many of us 
had problems that we were too stupid to solve and we 
came to Fermi for advice. He was always ready to give 
it. We would tell him our problems, and, having listened 
attentively and carefully for a few minutes, he would 
relate our problem in a completely different form. Either 
he gave us the solution, or more often he had made the 
problem so clear that it was then possible for us to solve 
it, and at the same time have the fun of doing it our- 
selves. There is no other physicist like Fermi. His life 
was a full one. His work will live on and so will his 
spirit, but we shall always miss Enrico Fermi. 
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From Professor Fermi’s Notebooks | - 


HE photographs on the following pages are sample 

pages from one of Professor Fermi’s numerous 
notebooks which are being preserved in the Harper 
Memorial Library of the University of Chicago. The 
pages shown are the final ones of one of the books and 
include the beginning of the Index, which shows how 
carefully he organized his written material and also on 
what a wide range of subjects he was willing to make 
calculations. 

We see that he had studied a design for a double- 
focusing deflecting magnet as well as the more abstruse 
question of the pseudoscalar nature of the charged pions. 

The problem which he was investigating on May 16, 
1952, concerned the relative probability of two modes 
of decay of the 7’ meson: one mode being the predomi- 
nant one of the production of two gamma rays and the 
other the production of one gamma ray accompanied 
by an electron pair. Fermi had probably heard that this 
problem had already been worked out by Dalitz.! 


! R. H. Dalitz, Proc. Phys. Soc. (London) A64, 667 (1951). 


It was typical of him that he preferred to work out the 
solution for himself rather than read the paper in which 
this had been done. He needed only to know what the 
idea was. He could then usually work out his own solu- 
tion in less time than he could follow that given by 
someone else. 

He continues the problem into May 17, 1952, where 
he evaluates a complicated integral to calculate the 
total probability of pair production from an expression 
giving the angular distribution. It is furthermore typi- 
cal that when confronted with this integral he plunged 
ahead and evaluated it numerically rather than seeking 
elegant mathematical transformations which would 
have reduced it to elementary forms. 

Finally we see the answer—namely, that pairs should 
be produced in one out of 137 cases. The calculation has 
never been published which indicates again that prob- 
ably he knew that it had already been done by some 
one else. 

H. L. ANDERSON 
SaMuUEL K. ALLISON 
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Magnetic Hyperfine Structure Due to Rotation 


© 


in © Molecules* 


Roperr L. Wiitet 


Columbia University 


INTRODUCTION 


HE ground state of an overwhelming majority of 
gaseous molecules is the '}) electronic state, 
that is, a state in which the total electronic orbital and 
spin angular momenta are both separately zero. The 
net lack of angular momentum is caused by a pairing 
of electrons from the different atoms of the molecule 
in such a manner as to cancel the individual electronic 
angular momenta which are not in general zero. There 
exists a small number of molecules containing an odd 
number of electrons making total cancellation not 
possible, and a smaller number of molecules containing 
an even number of electrons in which cancellation does 
not occur. These kinds of molecules are discussed else- 
where, and will not be treated here. 

In a $, molecule the magnetic dipole-dipole inter- 
actions between any nuclear magnetic moment and the 
electronic spins of the molecule will sum to zero. The 
internuclear magnetic dipole interaction will be small, 
usually less than a kilocycle,! and in most instances too 
small for experimental detection. This type of nuclear 
spin-spin interaction is in any case fairly straight- 
forward and will not be considered here. If the total 
orbital angular momentum of the !)> molecule were 
rigorously zero, the magnetic field arising from elec- 
tronic angular momentum would also be expected to be 
zero, and no interaction of the nuclear magnetic mo- 
ments with this field could be observed. 

Actually, however, the electronic orbital angular 
momentum is zero only for the nonrotating molecule 
and departs slightly but significantly from zero for a 
rotating molecule. The associated magnetic field inter- 
acts with each nuclear magnetic moment to produce an 
energy which for certain common cases has the form 
c:(1;-J), where I; is the spin of the ith nucleus, J is the 
angular momentum of molecular rotation, and c; the 
magnetic coupling constant. This energy is of the order 
of a few kilocycles for most molecules, but may be over 
one hundred kilocycles for certain very light molecules. 
Such an interaction in 4}> molecules was first observed 
by Rabi eż al? “while investigating the hydrogen mole- 

* Work supported jointly by the Office of Naval Research, the 
U. S. Army Signal Corps, and the Air Force Research and De- 
velopment Command. 

t Present address: Hughes Research and Development Labora- 
tories, Culver City, California. 

1 Energy values, iv, where / is Planck’s Constant and » is the 
frequency in cycles per sec, are abbreviated in this article by giving 
only the number of cycles per sec. 


2 Kellogg, Rabi, Ramsey, and Zacharias, Phys. Rev. 56, 728 
(193  Peloge, Ramsey, Rabi, and Zacharias, Phys. Rev. 57, 677 
93 a 


0) 


cule using a molecular beam technique. A similar tech- 
nique was later extended to the alkali halides, detecting 
an I-J interaction in many instances.’ The first meas- 
urement of such a magnetic interaction by microwave 
absorption was made by Gilbert, Roberts, and Griswold‘ 
in CIF. Recently the spectra of a number of molecules 
have been sufficiently accurately measured by improved 
microwave techniques to reveal this magnetic inter- 
action. The molecules so investigated have been DI, 
NH, OCS, OCSe, HCN, CICN, CH;Cl, SiH3Cl, and 
GeH;Cl.° The purpose of this paper is to assemble the 
results already published, present those not previously 
published, and to give an integrated discussion of the 
available data. 


THEORY 


The theory of magnetic interactions in 1$, molecules 
has been treated in considerable detail for the case of the 
diatomic molecule, stimulated by the very accurate 
experimental data obtained by molecular beams meth- 
ods, particularly by Ramsey® on hydrogen. Henderson’ 
and Gunther-Mohr et al. have discussed a more general 
case in reference to the ammonia molecule, as have 
Townes and Schawlow’ in a general discussion of second- 
order interactions in gaseous molecules. Since the data 
to be interpreted in this paper embrace a variety of 
molecular types, it seems profitable to present here a 
unified and general description of the relevant quantities. 

If one assumes in the conventional manner that the 
nuclei of a molecule form a rigid frame, and separates 
out the coordinates of translational motion, one can 
write the Hamiltonian, exclusive of internal interac- 
tions other than monopole Coulombic attraction as 


IM ak 
H=; += 2, 2 UDA V (a,9,2), (1) 


lo 2 a 


where M, is the instantaneous angular momentum of 


3 The references of Tables IT and III and reference (d) of Table 
IV give the instances in which an I-J energy was detected by 
molecular beam techniques. 

A Gilbert, Roberts, and Griswold, Phys. Rev. 76, 1723 (1949). 
a on these molecules are given in Tables IV, V, 

EN. F. Ramsey, Phys. Rev. 90, 232 (1953); Phys. Rev. 87, 
1075 952; Phys. Revi b>) s ae Phys. Rev. 78, 699 (1950). 

. S. Henderson h s ; t : 
Rev. 74 62¢ eee ys. Rev. 74, 104 (1948); erratum Phys 
*Gunther-Mohr, White, Schawlow, Good, and Coles, Phys. 


Rev. 94, 1184 (1954); Gunther-Mohr, Townes, and Van Vleck, _ 


Phys. Rev. 94, 1191 (1954). 


°C. H. Townes and A. L. Schawlow, Microwave S eclroscopy of 
Gasses (McGraw-Hill Book Company, Inc., New York, 1955). 
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the nuclear ,frame referred to the principal axes of 
inertia g=a, b, c fixed in the molecule; A, is a principal 
moment of inertia of the nuclear frame; (P,); is the 
instantaneous linear momentum of the jth electron 
referred to the principal axis of inertia g; m is the elec- 
tronic mass, and V is the potential function for elec- 
tronic motion and involves only the relative coordinates 
of the particle. 

The total angular momentum, J, of the molecule 
exclusive of spin angular momenta is composed of two 
components, one nuclear and one electronic in origin. 
Thus, 


J=N+L, (2) 
[=x r;XP;. (3) 


where 


Here r; is the radius vector from the center of gravity 
of the molecule to the jth electron. Inserting (2) into 
(1), one obtains 


Dg eh eee 
H=} E-I "4 


7A, «@ A, 


EE (PHV ays). (4) 


1 


If the rotating molecule were in a true '!>° state, L, 
would be zero to all orders of approximation. However, 
measurements on molecular magnetic moments, di- 
electric susceptibilities, and in the present case internal 
magnetic interactions all indicate that L,, though 
small, is definitely nonzero. One is led to assume that 
the pure !> state exists only for the nonrotating mole- 


cule, and that for the rotating molecule, the term 


Wit 


=% (5) 


A, 

acts as a perturbation which causes excited electronic 
states of nonzero Lg to exist. Since L,«<J,, one sub- 
divides the Hamiltonian into an unperturbed Hamil- 
tonian H? and two perturbations H! and H? of descend- 
ing order of magnitude. Thus, 


iat | +E A (P)z+V2)| 


A, Imo i 


diojn IS 

+H-z=hhz l © 
g Ag g Ay 

Denoting the electronic state of the rotating molecule 
by a subscript, one has to a first approximation 


3 To! (h| L10) 
VI=W+D E ——_— 8 


— y. (7) 
A A 


where the summation in 7 is over the excited molecular 
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electronic states. Using this wave function one obtains 


(0|£,|0)'= (0| £10)? 


agp eoio 
n g Ena— Eo 
Te 
es ~-thigher-order terms (8) 
with eS 


(0| L,{0)°=0. (9) 


Consider now the energy of interaction of the mag- 
netic moment of the ith nucleus with the surrounding 
magnetic field. The magnetic moment of the nucleus 
is given by 


u:= gion, (10) 


where gz is the gyromagnetic ratio of the nucleus under 
consideration, yoy is the nuclear magneton, and I; the 
spin angular momentum. The field at the ith nucleus is 


qil vi XTi) 
Hen ae (11) 


wir, 


where q; is the algebraic charge on the jth particle of 
the molecule, v;, its velocity, and r;; the radius vector 
from the ith nucleus to this particle. For purposes of 
analysis we will divide the summation over j into two 
sections: the sum over other nuclei, producing Ha; and 
the sum over all electrons, producing H.. 

The energy of interaction with H will be 


qi 


Wy=—p-H=porg. > YY 
7 


& pf of ori} 


(r; ) S a A Ls r, 2 
x| a9 J. g~ g ae tahe (12) 


Ad! Ag. 


The expression for Wy is obtained by expanding the 
vector product (v;Xr;;) after expressing vz in terms of 
the molecular angular momenta and moments of inertia. 
For the general case this does not reduce. For a nucleus 
on the symmetry axis, z, of a symmetric top molecule — 
the expression simplifies to 


qi rier 
Wy=porgr 2 : [= 


i ori? A, 


Cie ree (rdre K” 
[EE], 
A, A, J (J+1). 


For a linear molecule 


qi 
Wy=—povgr © i, 


7 ijz 
In this expression J has been used in o 
is the quantity required. Since N, differs 
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Taste I. Hydrogen. 


WHITE 


gI (ejh? 
pilolestronls) pts 
SponponBgr m 
Be (ai On) (2e 
Cy (bare (elec- p a Exy-Es er A n  En—Eo 
of y i ons J g 
Moer Soit au Ci (Ke) nude) tRy gm cm! (mol) frame) (electrons) K; gm cm? 
Eo a BU —203 89.1 265x10  +0.8787 +1.0000 =0.1213 1.54X107 
+0.030 +0.0070 à 
HD b H — 87.00 —152 65.0 2.60X 10 +0.6601 +0.8333 —0.1732 152° eee 
£0.85 +0.0050 
D — 12.6 —23.3 10.7 2.77X 10°" 
+0,3 
D: a D —8445 — 15.58 (ks 2.77X10™ +0.4406 +-0.5000 — 0.0594 1.52 1035 
=+0.056 +0.0030 


a Harrick, Barnes, Bray, and Ramsey, Phys. Rev. 90, 260 (1953). 
db Kellogg, Ramsey, Rabi, and Zacharias, Phys. Rev. 57, 677 (1939). 
© Referred to center of charge of molecule (see text). 


only a few parts in a thousand for > molecules, the 
error introduced into the energy calculation by sub- 
stitution will usually be too small to be detected ex- 
perimentally. On the other hand, though the electrons 
contribute but little to the total mechanical angular 
momentum, the electronic gyromagnetic ratio is nearly 
two thousand times greater than that of nucleons, so 
the electronic participation in magnetic interactions 
associated with angular momentum is at least the same 
order as that of the nuclear frame. 

The energy of interaction of the nucleus with He is 


ViXTij 


e 
W.= —u-H.=-novgil- 2 
(A 7 ri; 
e P;X Tij 
=——pongni- 2 
mc A GR 


r;jXP; 
= Qponpongrli: 2 ) 
7 


ri; 


(15) 


where poz is the Bohr magneton. To reduce this expres- 
sion to a more useful and tractable form we must call 
in some detail upon knowledge of molecular structure. 
It is convenient to divide the electrons into two classes: 
(1) Tightly bound electrons in closed shells which may 
be assigned to a specific nucleus and (2) valence elec- 
trons which are not in closed shells and which may be 
associated with more than one nucleus. Closed-shell 
electrons belonging to nucleus 7 will cancel one another 
out in the summation over j since their total spatial 
distribution is spherically symmetric about the nucleus. 
Closed-shell electrons on another nucleus can possess 
no angular momentum about that nucleus and will 
have angular momentum relative to nucleus 7 due only 
to the motion of their center of mass. In the rotating 
molecule these shells may be likened to the chairs on a 
ferris wheel, preserving their orientation while the 
frame -rotates. Since the effective Z for closed inner 
e's is relatively high, such electron shells may to a 

= 
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first approximation be considered to have shrunk down 
onto their respective nuclei, decreasing the effective 
nuclear charge.” Their effect may be handled formally 
by omitting them from the summation in j of Eq. (15) 
but reducing the nuclear charge q;, in expressions (11) 
through (14) to an effective change q;’ equal to the 
nuclear change minus the number of inner shell elec- 
trons. We are left then with only the valence electrons 
to consider. 

We know that for a true ‘> molecule $; rojX P; 
=L=0, referred to the molecular center of mass. 
Though >>; (ro; P;/r0#) is not readily calculable, it 
might be expected that the cancellation characteristic 
of the numerator would prevail, and the sum be small 
for a 1$ molecule. Further, just as a small amplitude of 
excited electronic states of nonzero L provide the only 
nonzero contribution to >>; ro; P;, so also would these 
same excited states dominate J; (ro; P;/70;3). We 
have shown earlier that the molecular rotation serves 
to introduce excited electronic states, so making the 
additional assumption that only one electron at a time 
will enter an excited state, and that this electron will 
be a valence electron, we approximate the contribution 
of the valence electrons to (15) by 


rijxXP; 


2uosnovgrli- D = Quozpongr(1/r)w1i- L. (16) 
1 


Tij 


Where 2, j is restricted to the valence electrons, 
(1/17)a0 is averaged over the excited electronic wave 
function, and L is given by expression (8). Thus we have 


W e= 2nospongr{l/1) 2 2 2 
x (0| L,| n) (a| Lg |9)+ (O| LZ; | n) (n| L, | 0) 
E,—Eo 


JT 


g 


A more formal derivation of this result i i 
P rivat , starting with the 
equivalent of Eq. (15), is giren in Chap. 8, of reference 9. 


(17) 
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$ TaBe II. Alkali halides. Data from electron resonancé molecular beams. 


ci lelectror 
Spermo BiT j 
( 1 10! Lei mi” 
çi PEN o e 
pe oe ti ; (valence GC). a Ba ~= Eo ) 
Molecule Reference Nucleus (Ke) ear et aie (gm cm~) (gm om?) 
Tips a T 3730,3 -23 36 oxi" 18x0“ 
Ar A +0.5 
LiF” b FY 32.9:£0.1 phi 35.0 0.420% 10-48 187K 10-" 
iA Li? +2.2+0.6 -+0.9 £13 0.039% 10-1 
KE o c p» 0£10 0-10 0-4:0.65% 10-14 04:2.63% 10-4 
oie d ru 1143 —0.24 11.2 0.95% 10 * 3.85% 10-# 
RDF d p» 1444 0.23 14.2 1.20% 10-4 4.90% 10-4 
CsF¥ e FY 1642 0.2 16.2 


6.05% 10" 


a 
b 

° L. Grabner and V. Hughes, Phys. Rev. 79, 819 (1950). 
dy, Hughes and L. Grabner, Phys. Rev. 79, 314 (1950). 
e J. W. Trischka, Phys. Rev. 74, 718 (1948). 


For a nucleus on the axis of a symmetric top, expression 
(17) reduces to 


a a [| OlLZeln)|? 
We=Aponmongr(1/r*)s, 2 [A 
—— ; | K? 
A, A, J(J+1) 
I-J ' 
x . (18 
En— Eo ) 
For a linear molecule (17) becomes 
; | (0|L.|%)|? 
W .=4uosnorgr(1/r) >, —————I J. (19) 
n inma 10 


The total energy of magnetic interaction will be 
given by the sum of expressions (12) and (17) with the 
q; of (12) replaced by q,’, as previously noted. Attention 
is called to the fact that for a nucleus on the axis of a 
symmetric top the magnetic interaction has the form 
cl-J, where c is the sum of two terms of opposite sign, 
e.g., for the linear molecule 


W _ gf 
Pry ps e Nhe 
E Daez = = | 


“n — Lo i CP ij. 
(20) 


Hongr 
W= 


Ag 


Attention is also called to the similarity in form of 
the electronic contribution to the magnetic energy and 
the electronic contribution to the molecular g-factor. 
The latter may be written™ 


s * mog 
[gs (electronic) ],=—L,, (21) 
Hon 


„and L, can then be obtained from (8), and one has, 


uJ, R. Eshbach and M. W. P. Strandberg, Phys. Rev. 85, 24 
(1952). 


e.g. for the linear molecule, 


3 2uor | (0| Lz|n)|? 
gz (electronic) =—- Z —-—,_ (22) 
HonA, ^ En— Eo 


Since c; and g; are known, one therefore can calculate 
(1/P)«, a fact which will be used in later sections of this 
paper as a check on the assumptions to be made con- 
cerning (1/75),,. 


EXPERIMENTAL EVIDENCE 


As was indicated in the introduction, the experi- 
mental evidence on magnetic interaction in ')> mole- 
cules comes from two sources; molecular beam experi- 
ments and measurement of microwave absorption 
spectra. The data vary widely in quality, uncertainty 
in the constant in hydrogen being only about 0.03% as 
compared to 50% for some of the heavier molecules. All 
experimental information currently available is pre- 
sented in Tables I through VI. 

The tables contain a number of quantities ir? addition 
to the interaction constants themselves, and are not 
entirely self-explanatory. Results are presented in the 
order of ascending complexity of the molecule involved, 
starting with hydrogen and proceeding through the 
other diatomic molecules to linear polyatomic and 
finally symmetric top molecules. In each case the inter- 
action constant or constants are presented and the 
derived quantity 7 


c;(electronic)h? | (Oj L,|m)|? 


=(1/Pw E 23) 


SponHonBgr n— Lio 


listed, where c; is the interaction constant in cycles per 
second and B, the rotational constant, is equal to 
h(8m2 A). The equality sign holds only if certain ap- 
proximations given in the theory section and discussed 
further in the interpretation section below, are Valid. 
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Tapi: IIT. Alkali Kalides, Data from magnetic resonance molecular beams. 


Ci (measured) 
te) 


Ci (rigid 


frame) (Kc) Comments 


Molecule Nucleus Transition used and reference ALS 
Nal Na (J3/2) > (J 1/2) |~1| 


(reference a) 


For most of the molecules an approximation to (1/7°)s 
is also made, and the further approximate quantity 
En (| |L,|n)|?/Z.—Eo) listed. The quantity ci 
(electronic) is obtained from c; (total) by subtracting 
out c; (rigid frame) calculated for the nuclear frame 
plus closed-shell electrons. In illustration of the rigid- 
frame calculation, a net charge of +e would be con- 
sidered to reside at the Zi nucleus and a net charge of 
—e to reside at the F-nucleus. For some molecules the 


NaCl Na (J:3/2) (Je 1/2) |~1| +0.045 
(reference a) {0.37 t Ki i aj 
Data of reference a See text for comments on relerence b 
Interpreted in reference b 
(J&1/2) (JF 1/2) —0.16 
NaBr Na (J3/2) (J1/2) |0.67| +0.030 
(reference b) 3 cus i A 
(J1/2) (J#1/2) Fit of existing theory to observed line not 
(reference b) possible. 
Nal Na (J: 1/2) (JF 1/2) —0.23 +0.022 
(reference b) 
(J3/2) > (J1/2) 10.69] 
(reference c) 
i < Br® J+1/2)9 (J¥F1/2 -+1.21 —0.010 The eqQ reported in this reference is consider- 
i = Bri ‘ UEN b) p +1.30 ably different from that of Fabricand, Carlson, 
f and Lee.4 Latter detect considerable change in 
| egQ with v. 
3 
f RbF Rb* (J+3/2) (J1/2) [1.1|Æ0.1 
f (reference e) 
RbCl Rb** (J+3/2)>(J+1/2) |1.0] 0.1 
(reference e) 
LiF F» Data of reference c 120] —2.3 C;=32.9 Kc from electric resonance molecular 
Interpreted in reference f beams.£ 
CsF Fe Data of reference c |7.2| —0.2 C:=16 Ke from electric resonance molecular 
Interpreted in reference h beams." 
a H. J. Zeiger and D. I. Bolef, Phys. Rev. 85, 788 (1952). e D. I. Bolef and H. J. Zeiger, Phys. Rev. 85, 799 (1952). 
b R, E. Cote’ and P. Kusch, Phys. Rev. 90, 103 (1953). t R. Braunstein—private communication. 
e W. A. Nierenberg and N, F. Ramsey, Phys. Rev. 72, 1075 (1947). e R. Braunstein and J. W. Trischka, Phys. Rev. 90, 348 (1953). 
d Fabricand, Carlson, Lee and Ravi, Phys. Rev. 91, 1403 (1953). b J. W. Trischka, Phys. Rev. 74, 718 (1948). 


molecular g-factors are available. When this is the case 
the g-factor is given as well as the derived quantity 


gz(electronic)h? | (O|L,|7) |? 
i BE 


(24) 


? 


where m and M are the electron and proton masses, 


Taste IV. Diatomic molecules: DI, CIF, CS, TIC. 


ige: Burrus and W. Gordy, Phys. Rev. 92, 1437 (1953). 
Rak 


IL 
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ci (electronic) hi? 
SyonvpoBBgr 
o b ‘ Ci TS 5 Se z | @|Lol»)|* 
C: Gadna eS SANTEN Ep E RENTE 
Molecule Reference Nucleus (Ke) Kc) (Kc) (gm cm7!) (gm cm?) 
a ya +140 —1.03 141 3.88 10718 10-4 
b crs +2243 —0.8 22.8 8.00X 10718 S9 10 
Ci +1843 —0.8 18.8 8.15 10718 3.33X 107* 
c s3 +1915 —0.6 19.6 6.95X 10718 4.06X 10-* 
d T7 732 +0.6 72.4 25.7 X10718 6.35 10-* 
cs 1.40.1 —0.10 1.5 2.65X 10718 1.0810 


© R. C. Mockler and G. Bird, Phys. Rev. i 
d Carlson, Lee, and Fabrica eyes eaves sa tS 


ur 


Ar 


3e 


4) 


ee 
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TABLE V. Toci i 


Bpsnyos Bar 


E 


101 a z! Lain)|? (2 OlLafna 
Zn — Ea " EnEn fi - Butte 


Mole- Nu- Refer- Ci efer- e f| 

cule claus ence EKo : gm em” gm cm? Bi wees j {cid | elec | gm em? 
orcuse OF a -4041.5 270x10" 275X10) —0.025 b +0025 0.050 190X10" 
overs fe S? F C +2 +1 2.03X 10718 1.78x10} -0.002 
OMC4Se” Se d —3.2+1.0 8.25% 10-14 3.75% 10-8 —0.019 e +0.021 0.040 2.27K 1073 

-£0.002 

HCN“ Nu f aaa LOITA T seid eraiveriel alae! dedas ne nend h onne CA 
DCN“ N" f +8 43 1.64% 10 2.92X 10-8 
CHICNi Nu g +2.5-+0.8 3.12% 10-18 5.55X10-* No informa- 
CPECNM Cis h +3.01.0 2.78X 10718 1.70% 10-8 
ClS CNR )E i +3.5+0.6 3.34% 10718 2.04K 10-* tion available 

* Table XIII, 7 pri M Ba 

b J, R. Eshba ich and M, W. P. Strandberg, Phys. Rev. 85, 24 (1952). 
e Table VIII. 
d Table IX. 


© Hardy, Silvey, OE Burke, Strandberg, Parker, and Cohen, Phys. Rev. 92, 1532 (1953). 


t J, A. Klein and A. 
g Table X. 
b Table XI. 

Table XII, 


. Nethercot, Quarterly Report, CU Radiation L ab., 


respectively. The expression is exact, but approxima- 
tions are usually necessary in obtaining g, (electronic), 
the electronic contribution to the molecular g-factor, 
from the total molecular g-factor gy. Only the total 
g-factor, gy is experimentally observed. 

Table I presents excellent experimental data on 
hydrogen in its various isotopic forms. Tables II and III 
give the results on the alkali halides. The results of 
electric resonance molecular beam experiments and 
magnetic resonance molecular beams experiments are 
given separately since there is strong evidence, dis- 
cussed later, that values of c; so far obtained by the 
latter technique are not reliable in many instances. 
The magnetic interaction data on diatomic molecules 
other than hydrogen and the alkali halides are tabu- 
lated in Table IV. Since the microwave spectrum of 
CIF was originally measured some years ago it was 
deemed desirable to remeasure this spectrum using 
the more refined techniques now available. This was 
done and the remeasured spectrum from which the 
magnetic constant of Table IV was calculated is given 
in Table VII. No gy values are available for any of the 
diatomic molecules other than hydrogen. 

Table V represents the magnetic constants, all cal- 
culated from microwave absorption spectra, and the 
molecular g-factors when available for linear polyatomic 
molecules. The spectra of OYCS, HCN, and DCN have 
previously been published.” The spectra of the re- 
maining molecules were measured for the purposes of 
this paper on a high-resolution bridge-type spectrom- 
eter, and the resulting determinations are given in 


2 Geschwind, Gunther-Mohr, and Silvey, Phys. Rev. 85, 474 
(195 

; B SP. A. Klein and A. H. Nethercot, Columbia University Radia- 
tion Laboratory Progress Report, Octəber 30, 1953. 


October 30, 1953. 


Tables VITI through XII. Details of the spectrometer: 
and the techniques of measurement using it have been 
published elsewhere.’"® The interpretation of the O'7CS 
spectrum as originally published did not include con- 
sideration of the magnetic interaction. The improve- 
ment in fit of the theoretical spectrum to the ex- 
perimental spectrum upon inclusion of the magnetic 
interaction is detailed in Table XIII. 

A similar situation concerning sources of information 
prevails for the symmetric top molecules tabulated in 
Table VI. All the magnetic interaction constants are 
derived from microwave spectra measured on the afore- 
mentioned spectrometer. The results on NH; have 
been previously published? as part of an extensive treat- 
ment of the energy levels of that molecule. The micro- 
wave data from which the magnetic constants of the 
other molecules were obtained are given in Tables XIV 
through XVIII of the appendix. 


INTERPRETATION OF RESULTS 


For the linear molecule, of which the diatomic mole- 
cule is a special case, Eq. (20) of the theory section 
applies and we have subject to the genui 
made in derivation: 


ON, 0 Lal z j 
wae =| snlt/ > IOLA e z] (25) 
Az a Ea—Eo Í Crij 


For purposes of interpretation it is desirable to obtain 
En (| (0| L-|n)|?/En—Eo) explicitly to compare with 
the same quantity calculated from the molecular 


1S. Geschwind, thesis, Columbia Kovea 1951 Gani 


o see also S. Geschwind, Ann. N. Y. Acad. Sci. 55, 751 
(1952. . 
1 R, L. White, J. Chem. Phys. 23,,249 (1955). ` 
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Sere g-factor. Therefore some approximation for (1/1°),, for the 
el 9? excited electronic states is needed. If it is assumed that 
Sich Sik 5 sE the dominant state in the sum over excited states is the 
SISE ej™ g on r state, the lowest lying state of nonzero orbital angular 
5 Ta y SS momentum, one would expect the excited-state elec- 
= tron to behave essentially as a p-electron in the vicinity 
oe of a nucleus. We will therefore in many instances make A 
EE R7 the approximation : 
x 1 for excited ag 1 ( 1 ) i bee Ce 
= = meee co , 
EE Zs ( D molecule NA PS wl p-orbital 
o where V is the number of atoms in the molecule. The 
PE Aa z factor 1/N must be included to express the fact that 
om the molecular electron divides its time among several 
z & atoms. A convenient tabulation of (1/7*)a, for atomic | 
x È p-orbitals has recently been prepared by Smith and 
Barnes.!® The validity of the (1/7) assumptions may 
al? be tested by comparing Ln (| 0| Lz| 1) |?/En—LEo) cal- 
ils 3! culated from one nucleus with that calculated from other 
aN 2 nuclei in the same molecule, or with the identical quan- 
2 tity calculated from the molecular g-factor. Both 
a s z LIETE methods will be used in subsequent parts of this 
g 3|: SILL SRK section. 
a ej & LESSEE Only for hydrogen has a serious attempt been made 
E AG RS eS to correlate the magnetic interaction with other molecu- 
5 — lar properties. The magnetic coupling constants and 
2 zla molecular g-factors of the various isotopic species of 
E JS q Ra Se hydrogen have been measured with an accuracy orders 
E 3 RST; Seosee of magnitude better than for any other molecule, and 
E Sli a E XXAAAE the theoretical treatment pushed to a corresponding 
6 Hamy SSS Sais degree of refinement. The effort here will not be toward 
E DA further refinement of the interpretation. Since a com- 
; aaa parative treatment of a number of molecules is one of 
£ ge Cea) ee 2 the aims of this paper, hydrogen will suffer in that it 
BE|prowsa $ will be given a rather coarse-grained interpretation 
of z commensurate with that possible for other molecules. im 
` Š The most recent values of c; and g; for Ha, HD, and | 
=a 2 #2 D; are presented in Table I. Note that the c; are all | 
r Be SS S555 $5 negative,” indicating that the magnetic field due to the i 
ig NOISES “a rotating nuclei exceeds that of the electrons. This is 
28 true in only two instances of all those reported in this 
| T PES og paper; for hydrogen and for the protons in ammonia. ` 
| Sxi 11 a1 H 8a In both instances the nucleus involved is hydrogen, 
flee zE which because of its small Z is ineffective in producing 
; ae a large (1/73), for the valence electrons, and in both 
| Gs PSSS as RE instances the molecule involved exhibits large elec- 
| TARRA H 8 g g& tronic slippage. By this is meant that an appreciable 
f egg Meee 54 2 2% fraction of the total molecular electronic wave function 
f T $ $e is spherically symmetric about the molecular center of 
Sa oonan ec Ga ESS. hence does not rotate with the molecular fae 
E PA PREFE Si That such slippage occyrs in large measure for Hə an 
Bil eena Eeok222° NH; is evidenced by their large diamagnetic suscep- 
E OOGOOZ EE tibility.! When slippage occurs the electronic contribu- 
y TOPADI LT e 
f AAE EEA ES 18 R. G. Barnes and W. V. Smith, Phys. Rev. 93, 95 (1954). 
| F OO26 S EE 17 Ramsey gives c as positive, reflecting only the fact that he po 
: Ši Fe AOS OS OE has taken the Hamiltonian term to be —c(I-J) rather than 
| 10n00 +c(I-J) as is done here. 


a ij 
4 
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tion to magnetic fields associated with the molecular 
rotation would of course be expected to be reduced. 
Moreover, since the field produced by nuclei is propor- 
tional to the reciprocal of the internuclear separation 
the small internuclear distances in H; and NH, help 
enhance this field. ; 

Confining the discussion specifically to hydrogen now 
one finds closely related to the aforementioned proper- 
ties a concomitant small degree of excitation of elec- 
tronic states of nonzero orbital angular momentum. 
The quantity 3°, (|(0|Lz|n)|?/En— Eo) hereinafter 
abbreviated as £ „(L»n) is taken to be a measure of this 
property. Since good values of the molecular g-factor 
are available, it is possible in the case of hydrogen to 
calculate}, (Ln) without any assumption about 
(1/r*)x. As will be seen in subsequent comparisons the 
din (Ln) obtained here is exceptionally small. It should 
be remarked that in Table I, >, (Ln) for HD has 
been calculated relative to the center of charge, rather 
than about the center of mass, in order to render com- 
parison with Hs and Dy immediate. 

Knowing } n (Ln) from gy, and (1/15) Don (Ln) 
from the electronic contribution to the magnetic 
interaction, one can calculate (1/r*),, for the excited 
electronic states. The value is 


(1/3) = 1.72 10> cm7, 


or in a form convenient for comparison with Barnes 
and Smith!® 


((do/r)*) = 2.51, 
where dp is the Bohr radius. This is to be compared with 
{(do/7)*) a= 0.04 


TABLE VII. Observed spectra and calculated molecular 
parameters of CIF and CI*F.* 


Transition Observed transition frequencies 
df F CIV CIF 


0—1 3/2-3/2 30228.344+0.004 Mc 30 807.366-+0.003 Mc 
3/25/2 30 257.135=:0.003 Mc 30 843.875-+0.004 Mc 
3/2—1/2 30 280.056:+0.003 Mc 30 872.9630.004 Mc 


a (eqQ)cys7 = 114.977 0.025 Mc, (eqQ)cyss = —145.837 +0.028 Mc. 
Ci (Cl?) = +1843 Ke, and Ci(Cl*) = +22 +3 Ke. 


Taste VIII. Observed spectrum of OCS! compared 
with theoretical spectrum.* 


Frequencies (Mc) 


a S eo Observed Theoretical 
P=1/23/2 24.012.3380.002 24 012.330 
5/2=>5/2 24 012.939£0.003 24 012.960 
1/2—>1/2 24 019.618+0.002 24 019.617 
EA 24 020.249-+0.002 24 020.247 
3/23/2 24 025.446+0.002 24 020.444 
3/2>1/2 24 032.719£0.015 24 032.724 


a Constants used in calculating the theoretical spectrum: Bo =6004.905 
0.001 Mc, (eq) şu = —29.130+0.008 Mg, and Ci= 2x1 Ke. 


Taste IX. Observed spectrum of OCSe” compared to 
theoretical spectrum," 


Transition 4 Frequencies (Me) 
AS nt "3 Observed Theoretical 
pa /2 "7/2 24 153.201:+0.004 24 153.201 
T 24 159.928-+0.004 24 159.932 
11/2—»13/2 24 170.1790.003 24 170.177 
7/2—9/2 24 190.772+-0.004 24 190.776 
5/297 /2 24 234.352 -+-0.005 24 234.350 

a Constants used in cal ing the theoretical spectrum: Boe 4040.548 

+0.001 Me, (ed0)cers = 752.210 40.020 Me, and Ci = —3,2-£1.5 Ke. 


TABLE X. Comparison of observed spectrum of CCN" with 
theoretical spectrum, showing determination of interaction con- 
stants for N's 


Transition 


Frequencies (Me) 


Fr Observed Theoretical 
5/2 23 863.416-+0.002 23 863.416 
5/2 23 863.724 0.003 23 863.724 
3/2 23 863.959-+0.003 23 863.958 
7/2 23 864.138 = 0.003 23 864.138 
5/2 23 864.479 -4-0.003 23 864.478 
7/2 23 864.888+0.003 23 864.887 
3/2 23 865.015 0.003 23 865.015 


a Constants used in this calculation: v5 /7 5 /2 =23 864.232 Me, (e40) Nu 
= —3,620-+0.010 Mc, and Cn = +2.521.0 Ke. 


for a pure hydrogen 2p-orbital, indicating that cer- 
tainly for hydrogen and probably for other light ele- 
ments, representing the excited state as a p-electron is 
questionable. 

The experimental data available on the alkali halides 
is very much less satisfactory indeed than it is for 
hydrogen. The evidence is entirely from molecular 
beam experiments, but experiments using two different 
techniques, viz., electric resonance and magnetic 
resonance. A salient difference in the two techniques 
for our purposes is that the electric resonance method 
selects a state of specific J and v whereas the magnetic 
resonance method averages properties over, a Max- 
wellian distribution of J and v states. In several in- 
stances when the same molecule has been investigated 
by both techniques the results, specifically for c are 
at considerable variance. 

The full family of alkali flourides has been investi- 
gated by the electric resonance method. The results of 
these measurements and the derived quantities ob- 
tained therefrom may be found in Table II. The 
electric resonance method does not give,the sign of ci, 
only its absolute value. Since, however, the measured 
magnitude of c; greatly exceeds in each case the c; 
which would be produced by the rotating rigid frame, 
exclusive of valence electrons, and since according to 
the theory presented in the aforementioned, the rigid — 
frame c; defines the maximum negative value possible, 
it must be assumed that the measured coupling con- 
stants are all positive, indicating domination of the 
electronic contribution to the ¢; Upon obtajning 
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TABLE XI. Comparison-of observed spectrum of Cl*CN"™ with theoretical spectrum, 


showing determination of interaction constants for Cl." 


Theoretical relative 
, , intensity r 
(© intensities =100) 


Transition 
J =1—2 
Fi Fi sF Fi 


Observed 
transition frequency 


Frequencies (Mc) 
Observed group frequency» 
aan 


emo dy 


Theoretical 
group frequency 


1/2 3/2-3/2 1/2 


0.16 
1/2 3/2-3/2 5/2 4.16 vat 
1/2 1/2-»3/2 1/2 1.23 23 862.409-40.003 
1/2 3/2-3/2 3/2 1.23 23 862.464-+0.004 23 862.461 
1/2 1/2—3/2 3/2 1.54 23 862.487 0.003 
5/2 3/2-5/2 5/2 0.31 23 863.416+0.002 
5/2 7/25/2 5/2 0.32 23 863.7244-0.003 
5/2 3/2-5/2 3/2 1.68 23 863.959-0.003 
5/2 7/2-5/2 7/2 3.68 23 864.138-+0.003 23 864,232-L0.002 23 864.233 
5/2 5/2-5/2 5/2 2.35 23 864.479-+0.003 
5/2 5/2—5/2 7/2 0.32 23 BOTER EOON 
5/2 5/2—5/2 3/2 0.31 23 .015-£0.003 
5/2 3/2-3/2 1/2 biter z 
5/2 3/2—=3/2 3/2 0.18 23 878.795=0.004 
5/2 3/2-3/2 5/2 0.02 c 23 879.117 0.004 23 879.112 
5/2 7/2-3/2 5/2 0.44 23 878.930=+0.003 
Wa Sa a5 Nea 6 
5/2-3/2 5/2 0.53 
1/2 3/2-1/2 3/2 3.08 group unresolved 
1/2 3/2-1/2 1/2 2.46 lower half power point 
1/2 1/2—»1/2 3/2 one n 883.210 0.003 23 883.270=-0.006 23 833.265 
=> peak 
23 883.246+-0.002 
1/2 1/2->1/2 1/2 0.31 upper half power point 
23 883.294-+-0.003 
5/2 7/237/2 7/2 1,09 neo, A 
3/2 3/25 /2 5/2 clea) 23 884.197-+0.003 
3/2 3/2—5/2 3/2 1.12 23 884.7350.003 
3/2 5/2—=5/2 5/2 1.12 
23 884.891 0.002 3/2 6/2 = 23 885.055 0.004 2 5.05 
52 5/2-1/2 7/2 12.23 ¥3/2 6/ 5.05 3 885.058 
—7/2 9/2 16. 
He pal ap so 23 885.142 0.002 ¥s/2 7/2= 23 885.058+0.004 23 885.063 
3/2 25/2 7/2 9.33 23 885.298-+0.002 
3/2 1/25/2 3/2 3.48 23 885.988 £0,002 
5/2 5/2—7/2 5/2 1.08 23 886.210-0.002 
3/2 3/2-3/2 1/2 0.79 maxima at; 
minor 
3/2 3/2-3/2 5/2 0.86 23 899.3624-0.003 
Mp Saab a5 tee m33.899.421-4.0,003 
2 50 
3/2 5/2=3/2 3/2 0:36 23 900.090-:0.002 23 899,938+0.003 23 899,937 
2 1/2-3/2 1/2 f Y 
3/2 1/232 3/2 0.79 23 900-695:£0.002 
3/2 3/2-1/2 3/2 0.25 23 920.2. 
3/2 3/21/2 1/2 0.31 \ prea : 
We R T oa 23 920.912+-0.002 23 920.742 +0.003 23 920.742 
3/2 1/2=>1/2 1/2 0.25 23 921.520-+-0.004 n 


a Constants used. in calculating the theoretical spectrum: vo =23 883.275 -+0.003 Mc, (eq) Cj:s = —83.285 +0.020 Mc (e40) Nis = —3.620 +0.010 Mc, 
» D Ve O , 


Cci= +31.5 Ke, and Cx = -42.5 41.0 Ke. 


b By observed group frequency is meant the observed frequency with nitrogen-interaction subtracted out (see text). 


© Not measured. 


c; (electronic) from c; by subtracting out the readily 
calculable and usually small c; (rigid frame), one can 
calculate (1/r*)s Din (Ln). This latter quantity is 
observed to increase monotonically from LiF through 
CsF, j.e., with molecular size. One would expect 
(1/13), on (Ln) to increase roughly as r, where 7, is 


ae 


the equilibrium internuclear distance of the molecule, 
since Xn (Ln) contains in its denominator E ,—Eo, 
the spacing between ground and lowest excited energy 
levels. On a simple particle-in-a-box model one calcu- 
lates energy levels spaced as the inverse square of the 
linear dimensions of tha box. Though insufficient points 
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TABLE XII. Observed spectrum of CCN compared 
with theoretical spectrum.* » 


Comp. Transition Frequencies (Mc) 
No. si J =1—2 Observed Theoretical 
1 F=5/2-5/2 22 973.205+0.003 22 973.208 
7 1/2—1/2 22 992.235-+0.003 22 992.235 
S/a}  22994.0380.002 22 994.038 
4 : 3/2—3/2 23 008.896-+-0.003 23 008.899 
5 3/2—1/2 23 029.709+0.003 23 029.709 
a Interval 2—3, which is the most critical in detena ci aa je. 
ured to be 1.793 +0.002 Mc. 


v Constants used in calculating the theoretical spectrum: rs =22 992,242 
+0.005 Mc, (eqQ) Cis = ~83.280+0.020 Mc, and Cci = +3.5-20.6 Ke, 


are available to establish any curve unambiguously, 
the graph in Fig. 1 shows that >>,(Z,) does indeed 
increase at least as fast as 72 for the diatomic halides. 
In obtaining 2°, (Za) approximation (26) was made 
for (1/1) py. 

The zero value for ci reported for KF need not be 
considered an importantsexception in view of the large 
experimental error involved. Furthermore KCI" and 
KBr'* have also been investigated by the same tech- 
nique, and the I-J energy reported as too small to be 
detected. This is not surprising since these molecules 
have a larger moment of inertia than KF, and since 
the nuclei Cl and Br both have smaller gr than does F. 

The c; measured in a number of alkali halides by the 
magnetic resonance molecular beams method are listed 
in Table III. In the interests of completeness, all avail- 
able literature values have been included. Before an 
interpretation of these values is made, however, some 
inspection of their probable validity is in order. 

The difficulty in obtaining reliable values of c; from 
magnetic resonance molecular beams data has several 
sources. 

First, the resonance is observed in molecules having 
a Maxwellian distribution of excited rotational and 
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Fic. 1. Plot of 2,[| (0|L,|n)|?/£,—Eo] os r, for the alkali halides. 


vibrational states determined by the source oven tem- 
perature, A most probable J of 50 is, for instance, 
typical. In calculating the theoretical line shape of the 
resultant absorption, an average of the electric and 
magnetic interactions over the populated states must 
be made. This calculation is not simple and does not 
lead to strictly unambiguous determination of the 
interaction constants. In particular, when a nucleus has 
both an electric quadrupole interaction and a magnetic 
dipole interaction, the latter, being much smaller in 
13, molecules, will be poorly determined. Of all the 
atoms involved in Table III, only fluorine has no quad- 
rupole moment and is free from this trouble. Further- 
more, the assumption is made in averaging over excited 
rotational and vibrational states that the interaction 
constants are not functions of J or v. Fabricand, Carl- 
son, and Lee” detect a considerable change of electric 
quadrupole interaction energy, with v in KBr, and the 
same variation probably occurs to some extent in all 
the molecules observed. 

Second, the equilibrium interatomic spacing enters 


TABLE XIII. Observed spectrum of OCS compared with theoretical spectrum. r, 
Line Transition Frequencies (Mcj 
No. J=1-2 Observed: Theoretical (A) A Theoretical (B) å 
1 F=3/2—5/2 23 534.101+0.014 23 534.106 —0.005 23 534.102 —0.001 
2 7/237/2 23 534.164£0.012 23 534.159 +0.005 23 534.163 +0.001 
3 3/293 /2 23 534.308+0.012 23 534.295 +0.013 23 534.305 +0.003 
4 1/29/2 23 534.422 23 534.422 0 23 534.421 +0.001 
. 5/2—7/2 
5 Pa 23 534.481 +0.014 23 534.489 —0.008 23 534.493 —0.011 
—5/ 
e 
Intervals in Ke bc 
1-2 2-3 34 4—5 
Experimental 6349 1446 114-12 59-14 
Theoretical (A) 53 136 127 67 
Theoretical (B) 61 142 116 72 


Theoretical (A) =constants of reference a: egQ=—1.32 Mc and C;=0 Ke. 
Theoretical (B) =constants as follows: egQ=1.38 Mc and C;—4.0 Ke. 


. a Geschwind, Gunther-Mohr, and Silvey, Phys. Rev. 85, 474 (1952). 


18 Lee, Fabricand, Carlson, and Rabi, Phys. Rev. 91, 1395 (1953). i 


19 Fabricand, Carlson, Lee, and Rabi, Phys. Rev. 91, 1403 (1953). 
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Tante XIV. Observed spectrum of CHsCl compared 
with theoretical spectrum.* 


Isotopic ` Transition Frequencies (Mc) A 
species J=0-41 Observed Theoretical 
CH;CH F=3/2-1/2 26 191.004-+-0.002 26 191.002 
3/2—5/2 26 179.2194-0.002 26 179.220 
3/2—3/2 26 164.490+0.002 26 164.489 
CH,C1* 3/21/2 26 604.365::0.002 26 604.366 
3/2-5/2 26 589.421-0.002 26 589.420 
3/23/2 26 570.734+0.002 26 570.735 


a The following are constants used in calculating the theoretical spectra. 
For CH:CI": ro =26 176.273 40.002 Mc, (e7Q)ci= —58.921 +0.006 Mc, 
and Cci = +1.24:1.0 Ke. For CHCI; vo =26 585.679 4:0.008 Mc, (e40) ci 
= —74,740 4-0.006 Mc, and Cci = +1.441,.2 Ke. 


TABLE XV. Observed spectrum of SiH;CI*7 compared 
with theoretical spectrum." 


Frequencies (Mc) 


ee Disa Observed Theoretical 
1 F=1/2-3/2 26 041.590+0.005 26 041.588 
2 5/2—5/2 26 042.257=0.005 26 042.255 
3 1/2—1/2 26 049.414+0.010 26 049.414 
: Saal 26 050.091+0.010 26 050.091 
5 3/23/2 26 055.681=-0.007 26 055.682 
6 3/2—1/2 26 063.513+0.010 26 063.508 

Interval in Mc 
1—2 2-3 3—4 
Experimental 0.667-£0.002 13.424+0.004 0.677 0.004 
Theoretical 0.667 13.424 0.679 


a Constants used in calculating theoretical spectrum: vo =26 049.419 
0,010 Me, (egQ) ci = —31.323 40.010 Mc, Coi = +2.5 +1.0 Ke. 


explicitly into the calculation of c and the radius is 
in some cases uncertain by about 10%. 

Third, the presence of dimers in the beam might be 
expected to alter the absorption line.” Since the mag- 
netic interaction constants are inferred from line shapes, 
the determination of c; might therefore be strongly 
affected by dimerization. 

The aforementioned difficulties pertain to all mag- 
netic resonance molecular beam experiments quoted in 
Table III. In addition, the measurements of a low- 
frequency transition (J=-+3—J=+}) reported by Coté 
and Kusch? suffer from difficulties peculiar to it. A 
satisfactory quantitative theory of the low-frequency 
transition cannot be said to exist, and the values of ci 
from reference 21 might more properly be considered 
suggested values rather than measured values. Since 
these suggested values are in two instances consider- 
ably more negative than the negative lower bound set 
for c; (total) by c; (rigid frame), it is not likely that 
they are correct. 

In view of the foregoing set of vitiating factors, it 
seems unprofitable to attempt a detailed interpretation 
of the results quoted in Table III. Though some of the 
determinations are probably valid, sufficient doubt 


surrounds the c; values obtained from the magnetic 
— A 4 < 

Ochs, Coté, and Kusch, J. Chem. Phys. 21, 459 (1953). 

21 R. L. Coté and P. Kusch, Phys. Rev. 90, 103 (1953). 
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resonance molecular beams experimenis to make 
detailed analysis of them unprofitable. 

There remain four diatomic molecules other than 
hydrogen and the alkali halides for which c; has been 
measured by either electric resonance molecular beam 
or microwave techniques. One of them, TICI, is chemi- 
cally very like an alkali halide. For this molecule c; 
has been measured for both nuclei, and >>, (Zn) can be 
calculated from either using the approximation for 
(1/r*)%, discussed earlier in this section. As one sees 
from Table IV, Xn (Zn) as calculated from the Tl 
interaction is nearly six times as large as that calcu- 
lated from the Cl interaction. Furthermore 5°, (Ln) 
as calculated from the Cl data fits well the variation 
in Ð» (Ln) with re as calculated from the fluorine 
interaction in the alkali fluorides and plotted in Fig. 1, 
whereas >, (Ln) as calculated from the TI data would 
be far off scale. The conclusion is that (1/r°)s depends 
strongly on the chemical environment of the nucleus 
in question. The approximation (26) is useful for com- 
paring the magnetic interaction of different nuclei in 
similar chemical circumstances, but may generate dis- 
crepancies of as large as a factor of five if used to com- 


TABLE XVI. Observed intervals in spectrum of SiD;Cl?* 
compared with theoretical intervals." 


Frequency (Mc) 


Interval Measured Theoretical 
(1/2-3/2) (5/2—5/2) 16.969 +0.004 16.969 
(5/2-5/2) (3/2—3/2) 0.84720.0012 0.8472 


a Constants used in calculating theoretical intervals: (¢qQ)c1 =39.595 
+0.010 Mc and Cci = +41.0 +0.7 Ke. 


TaBLeE XVII. Observed spectrum of GeH;Cl** compared 
with theoretical spectrum.* 


Frequencies (Mc) 


Transition 
J=2-3 Observed Theoretical 
F= patel 25 988.908 0.002 25 988.908 
—. 

123/2 25 997.749+0.002 25 997.749 

5/2—7/2 E 
7/2-39/2 26 000.658 0.002 26 000.658 
5/2—5/2 26 006.145 0.002 26 006.145 


a Constants used in calculating the theoretical spectrum: vo =26 000.099 
+£0.002 Mc, (egQ) cl = —47.013 +0.004 Mc, and Ci =0.5 +0.7 Ke. 


TABLE XVIII. Comparison of observed spectrum and theoretical 
spectrum for CICN?§ in the excited bending vibration state v= 1. 
All lines listed belong to the lower frequency /-doublet group.* 


Transition 5 
K=1 Frequencies (Mc) 


J=1-2 Observed Theoretical 
F=3/2-5/2 23 026.264+-0.003 23 026.268 
3/23 /2 23 033.745-+0.002 23 033.744 
5/27/2 23 046.962+0.003 23 046.963 
1/2-3/2 23 052.177=0.004 23 052.173 
1/2—1/2 23 062.643=-0.002 23 062.644 


a Constants used in calculating theoretical spectrum: vo =23 041.917 
0.003 Mc, Eeo E482 2:30 Kc, (eqQ)ci=—82.825=0.015 Mc, 
qı =6.966 +0.004 Mc, (Czz)c1= +3.5 £0.6 Ke, and (Cz:)01= F8 +5 Ke. 
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pare nuclei in distinctly different chemical situations. 
In the present instance we have no molec ular g-factor 
from which to calculate Xn (L,,) more direc tly. For the 
linear polyatomic molecules to which we will proceed 
very shortly, both ci and gy are available in some 
instances, and more insight into the validity of approxi- 
mation (26) will be gained. 

Both C$ and CIF are covalent molecules of similar 
dimensions. One would expect the (1/13), approxima- 
tion to have about the same validity in each case, and 
the resultant >>, (Ln) to be nearly equal, as is indeed 
the case. The Xn (L,) for DI is considerably smaller 
than for the other molecules of Table IV, but even so is 
surprisingly large. The total molecular electronic wave 
function closely resembles that of an iodine ion, that 
is to say, may be represented by spherically symmetric 
shells about the iodine nucleus. Since the molecular 
center of mass nearly coincides with the iodine nucleus, 
the electronic wave function should be little perturbed 
by molecular rotation. Large electron slippage and small 
Din (Ln) would be expected to result. Possibly the 
(1/r*)a, approximation is poor for very heavy atoms 
(cf. TI in TICI), leading to a spuriously large ©, (La). 
It is interesting to note that DBr,? for which a simi- 
larly sized interaction might be expected, has been 
investigated by virtually identical techniques and no 
magnetic interaction detected. 

Two groups of related linear triatomic molecules 
have been investigated as can be seen from inspection 
of Table V. For OCS and OCSe both c; and gy are 
available; furthermore c; has been measured at more 
than one nucleus in the molecules OCS and CICN. 
Looking first at the carbonyls one sees that >, (La) is 
larger for the heavier molecules, both as calculated from 
ci and from gy. The oxygen and the sulfur of OCS are 
in similar chemical situations, so we may use approxi- 
mation (26) to reduce (1/75), on (Ln) to Don (Ln) for 
comparison. In this instance we find >>, (La) as de- 
rived from the two interactions nearly the same, in fact 
equal within the fairly large experimental error. More- 
over the values of >>, (Ln) calculated from the two c; 
agree well with the values calculated from gy, indicat- 
ing that here the approximation to (1/75), has con- 
siderable validity. Again for OCS there is reasonably 
good agreement between )/, (Zn) as calculated from 
gy and as calculated from c; using approximation (26). 

We find in the data on the various isotopic forms of 
HCN and CICN continued manifestation of the trends 
which we have seen to characterize the interaction. 
Nitrogen occupies a similar chemical position in HCN 
and CICN and >, (Ln) calculated from its magnetic 
interaction nearly doubles in ‘going from the light to 
the heavy molecule; chlorine and nitrogen occupy 
chemically different sites, and >”, (Z,) calculated from 
the two c; of the same molecule using approximation 


(26) differ by a factor of over two. 


2 W. Gordy and C. A. Burrus, Phys. Rev. 93, 419 (1954). 
d CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 2 
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A linear polyatomic molecule ina bending vibrational 
mode may be treated to a good approximation as a very 
prolate slightly asymmetri¢ top. As such it is capable of 
acquiring rotational angular momentum parallel to the 
symmetry axis, and the nuclear magnetic moments 
will couple to this angular momentum. The theoretical 
expression for the interaction energy for a nucleus on 
the axis of a symmetric top molecule is the sum of 
expressions (12) and (17) of the theory section, and 
may be written 


W=[Coct (Ca Cz) Oy GENERI Jos (27) 
J(J-+1) 
where 
_ Hon gr '(O\L,\n)* 
Czz= [hontis ye, peers > - 
‘As n Za — Eo 
R — i)e 
meee | 
HEG (ri)? 
and 
_ Bawge KOVAT 
C::= Spon 1 /T)u ‘a =-= S 
A, n E,—Eo 


qi (r) — (ri Ja 
DE seta is | (29) 


If A; is small, as is the case for the bent linear molecule, 
one might expect a large c: Actually the measured 
cz: for CICN in the bending vibrational mode proved 
to be quite small, leading to a very small value of 
En |(0|L.|)|?/E,—E». The inference is that the 
amplitude of bending is so small that the nuclear frame 
acquires angular momentum about the z-axis without 
very much affecting the electronic wave functions. 
Ammonia, on the other hand, offers an instance 
where A, and A, are nearly equal. We are fortunate in 
this case in having good experimental data both on the 
magnetic interaction and on the molecular g-factor. 
The agreement in $n (Lan) as calculated from the two 
sources is good. Furthermore $» | (0) L.|n)|*/E,—Eo 
is predicted to be slightly larger than X» | (0| L./m)/?/ 
E,—Eo by both sources, and the magnitudes in each 
instance are in reasonably good agreement, indicating 
that here again the approximation to (1/r) is justified 
and more than just a proportionality factor. The ¢oup- 
ling constants of the off-axial protons have also been 
measured,’ and as pointed out previously at least one 
component of the magnetic field at their site is domi- 
nated by the nuclear frame rather than the electrons. 
However, both since the interpretation of the off-axial 
interaction is complicated and since the constants are 
not very precisely known, no analysis of them further 
than the comments already made in the hydrogen sec- 
tion will be attempted. Ammonia also-affords a case 
where the functional form of fhe magnetic interaction 
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has been tested over a range of values of J and K from 
1 to 6. Good agreement with expression (27) was 
obtained. 3 

An intermediate type of symmetric top molecule, one 
with 4, considerably smaller than + but not as much 
smaller as is the case for a bent linear molecule, is pro- 
vided by methyl chloride, chloro-silane, or chloro- 
germane. Unfortunately only cz: has been determined 
for these molecules, and the experimental error, es- 
pecially for chloro-germane is relatively large. It can 
be seen, however, that the approximate magnitude of 
Xn (La) is the same as it has been for other molecules 
of similar dimension, and that the progression toward 
larger Xn (Ln) with increasing molecular size also 
appears to occur. 


SUMMARY AND CONCLUSIONS 


A number of conclusions can be drawn from the fore- 
going experimental evidence. 


1. The theory of magnetic interactions in 13) mole- 
cules, as summarized in the theory section of this 
paper, is consonant with all existing experimental 
evidence on magnetic coupling constants except that 
obtained from measurement of line shapes in magnetic 
resonance molecular beam experiments. However, 
there appears to be convincing evidence that the values 
of ci so far obtained from these line shapes are unre- 
liable. Eliminating the suspect data and introducing 
additional data obtained specifically for the purposes of 
this paper allows a considerable degree of order to be 
recognized in magnetic interactions of the type discussed. 

2. For all nuclei in > molecules except hydrogen the 
contribution of the valence electrons to the magnetic 
field at any given nucleus exceeds considerably that due 
to the rotating rigid frame composed of the nuclei 
plus firmly bound closed-shell electrons. This result is 
undoubtedly a consequence of the fact that the valence 
electrons may penetrate near to the nucleus in question 
and thussproduce a large effective (1/7°),,. 

3. The magnetic interaction constants of a molecule 
can be approximately predicted if the molecular param- 
eters >>, |(0|LZ,|)|?/E,—Eo and (1/r5)s, for the va- 
lence electrons are known. Also, X n | (0|L,| 1) |/En—Eo 
can be calculated from the molecular g-factor if known, 
or can be estimated in many cases by interpolation or 
extrapolation from the data on known similar molecules 
when, gy is not available. For a given series of struc- 
turally similar molecules, this quantity increases with 
molecular size. An approximation to (1/75), has been 

presented and discussed. The available evidence indi- 
cates that for medium weight atoms, 6<Z<30, the 
approximation will not be in error by more than a 
factor of two for any chemical environment. A more 
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detailed application of the theory than has been pre- 
sented, or a more thorough understanding of the ap- 
proximations allowable in applying it, must await 
the arrival of a considerable quantity of more precise 
experimental data on this type of interaction. 


ACKNOWLEDGMENTS 


The author wishes to express his gratitude to Pro- 
fessor Charles H. Townes for the continued guidance 
and assistance he has given during the foregoing pro- 
gram of investigation and interpretation. The author is 
also indebted to Dr. Gene A. Silvey for the preparation 
of the several compounds whose spectra were measured 
for the above study, and for advice in the handling of 
them. He is further indebted to Paul Fletcher for as- 
sistance in the calculations and upon occasion with the 
measurements. 


APPENDIX 


The microwave spectra presented in this appendix 
(Tables XIV-XVIII) were all measured or remeasured 
in order to determine the nuclear magnetic interaction 
with the fields due to molecular rotation. All measure- 
ments except those on CIF were performed on a high- 
resolution, bridge-type, K-band spectrometer. The 
energy expression from which the theoretical spectrum 
was calculated for molecules containing only one nu- 
cleus possessing an electric quadrupole moment was”.?4 


W=B(J)(J+1 ees 
OUDH«O| = | 
[Pe n 
2(2J-+3)(2J—1)I(2IT—1) 


J+ W (mag), 


where 
C=F(F4+1)—I(I+1)—J(J+1) 
F=J+I, J+I-1---|J—1|. 


Here (egQ) is the electric quadrupole coupling as de- 
fined by Townes and Bardeen,?® and W (mag) is the 
expression for the magnetic energy appropriate to the 
molecular type under consideration obtained from the 
theory section of this paper. For.Cl3®CN™, in which two 
nuclei have electric quadrupole moments, the theory 
of Townes and Bardeen? derived for this case was 
used. The relatively complex nature of the theory 
prevents its being presented here, and the reader is 
referred for such information to the article cited. 


3H. B. G. Casimir, On the Interaction between Atomic Nuclei 
and Electrons (DeErven F. Bohn, Haarlem, 1936). 

% J. H. Van Vleck, Phys. Rev. 71, 468 (1947). 

2 C. H. Townes and J. Bardeen, Phys. Rev. 73, 97 (1948). 
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INTRODUCTION 
E recent years there have been many attempts 
(44K, 46B, 46G, 46K, 47B1, 47B2, 48B1, 48B2, 
49B1, 49B2, 49B3, 49K1, 49K2, 52K, 52R, 52S2)! 
to justify the use of statistical mechanics in describing 
physical systems. As far as classical statistical mechan- 


* Part of this paper was written véhile the author was on leave 
of absence at Purdue University and he would like to express his 
thanks to Dr. Lark-Horovitz for his hospitality. He would also 
like to express his thanks to Dr. N. G. van Kampen for some help- 
ful criticism. 

1 References are placed at the end of the paper in chronological 


‘order. Each reference number consists of the year of publication 


and the first letter of the first author's surname; e.g., M. J. Klein, 
Phys. Rey. 87, 111 (1952) is referredsto as 52K. 


ics, that is, statistical mechanics before the advent of 
quantum mechanics, is concerned, the Ehrenfests 
(11E2) in their well-known survey article in the Enzy- 
hlopaedie der mathematischen Wissenschaften have dis- 
cussed the foundations of statistical mechanics and, as 
we shall see, their conclusions are still valid in more or 
less the same form as stated by them.2 However, no 
such account of the state of the foundations of statis- 
tical mechanics including quantum mechanics is avail- 
able, and it is the purpose of the present paper to at- 
tempt to give such an account. The present author has 
discussed in a recently published monograph (54H1)} 
some of the problems to be covered here, but, on the 
one hand, since that monograph was written a number 
of new developments have arisen, and, on the other 
hand, the emphasis in the present paper will be more on 
the present status of the foundations of statistical 
mechanics that on the historical development of these 
foundations as was the case in the earlier discussion. 
The present discussion is thus to a large extent com- 
plementary to the discussion in ESM. This does, of 
course, not mean that now we are to neglect completely 
the historical aspects of the development of statistical 
mechanics. 

One might argue that the proof of the pudding is in 
the eating, and that the fact that statistical mechanics 
has been able to predict accurately and successfully the 
behavior of physical systems under equilibrium condi- 
tions—and even under certain circumstances in non- 
equilibrium conditions‘—should be a sufficient justifica- 
tion for the methods used. This is especially the case 
nowadays since the introduction of quantum mechanics 
has cleared up such seemingly unsurmountable difficul- 
ties as the paradox of the specific heats or Gibbs’ 
paradox. However, it is only logical that many physi- 
cists have not felt satisfied by this argument and have 
therefore attempted to show that the statistical formal- 
ism follows in a straightforward manner from classical 
or quantum mechanics. In the present survey we shall 
discuss in how far these authors have succeeded. Even 
if success has been limited, these attempts have been 
extremely important and very helpful in,showing both 
the possibilities and the limitations of the statistical 
approach. 


? R, Kurth (55K) has also considered the foundations of classical 
thermostatistics from a more axiomatic point of view. I should 
like to express to him my thanks for showing me his paper before 
publication. 

3 We shall use, as far as feasible, the notation of this publi¢ation, 
and we refer to it as ESM. 

‘ For a discussion of the statistical mechanics of nonequilibrium 

processes we refer to a recent review article by Montroll and 
(Seek (55M) and its bibliography. e 
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Before we can start the discussion of the various 
papers on the fundamental ideas of statistical mechan- 
ics, we must remind ourselves that we shall mainly be 
concerned with the following two questions. 


(A) Why is it possible to describe the behavior of 
almost all physical systems by considering only equi- 
librium situations? 


(B) Why is it possible to describe the behavior of an 
actual physical system by considering a large number 
of identical systems (in ensemble theory) and identifying 
the average behavior of this group of systems with the 
behavior of the physical system in which we are 
interested? 

In the following discussion it will become clear that 
these two questions are intimately related. The first 
one arises not only in statistical mechanics, but also 
in thermodynamics and kinetic theory, and a related 
problem is the question, 


(A1) How can one define an equilibrium situation? 

The second question is of a truly statistical nature, 
and is related to the question, 

(B1) How can we construct an ensemble such that it 
will “represent” an actual, given physical system? 

This question of representative ensembles which will 
play an important role in our discussion was especially 
stressed by Tolman (38T, 40T). 


At this point we must introduce a definition of 
statistical mechanics, and we shall do this by using the 
definition given by Kramers in his presidential address 
to the International Conference on Statistical Me- 
chanics at Florence in 1949 (49K3): “For a physicist 
nowadays Statistical Mechanics is just that branch of 
physics which deals with the atomistic interpretation 
of the thermal properties of matter and radiation; for 
this reason one might also call it ‘thermostatistics’.” 

In our discussion we shall not be concerned explicitly 
with radiation, but restrict ourselves to systems con- 
taining particles. However, in discussing transitions 
from one state to another, the existence of a radiation 
field, which in many cases will be the agent enabling 
the transitions to take place, will tacitly be assumed. 

Statistical mechanics has its origin in the kinetic 
theory of gases which was developed in the nineteenth 
century. It is probably justifiable to call Krénig the 
father of the kinetic theory, even though more than a 
century earlier Bernoulli (38B) had related the proper- 
ties of a gas to the properties of individual particles. 
Kronig (56K) draws attention to the fact that although 
the orbit of any atom in a gas will be so irregular that 

it is impossible to follow the atom, one can use the 
theory of probability to reduce this completely chaotic 
behavior to the ordered behavior of a gas. The kinetic 
theory of gases was further developed by Clausius 
(57C15 57C2, 58C, 62C, 70C1, 70C2), Maxwell (60M, 
67M, 68M1, 68M2), and Boltzmann (68B, 72B, 96B, 
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98B). In his Gastheorie Boltzmann was especially 
concerned with question (A) and he had introduced in 
1872 (72B) his famous -theorem in order to show that 
any nonequilibrium situation would develop in such 
a way that it would approach an equilibrium situa- 
tion. The answer to question (A1) is in this case that the 
equilibrium situation is the most probable situation, 
that is, the most probable situation compatible with a 
few restricting conditions. In its original unrestricted 
form the H-theorem, if correct, proved that any system 
will tend toward equilibrium, if equilibrium does not 
initially exist.© This would mean that provided one 
waited a sufficiently long time one would find the system 
in an equilibrium situation and, moreover, this equi- 
librium situation would persist for ever after. From this 
it would follow that equilibrium would be the rule and 
nonequilibrium the exception so that question (A) is 
answered. 

However, it was soon realized that the H-theorem in 
its original unrestricted form was not an absolute proof, 
but was based on certain assumptions about the number 
of collisions which a given particle will undergo during 
a specified time interval. Moreover, Loschmidt (76L, 
77L) and Zermelo (96Z) showed conclusively that the 
H-theorem in its original form could not be true. In 
his later papers Boltzmann was therefore careful to 
stress the statistical aspects of the H-theorem. This 
means that the H-theorem is a statement about the 
most probable behavior of a system. Fluctuations 
around the equilibrium situation are, however, no 
longer prohibited. The statistical aspects of the 
H-theorem were especially stressed by the Ehrenfests 
(07E, 11E2) both in general and by considering simple 
models.” 

In the first part of the present article we shall discuss 
this stage of the discussion of the foundations of sta- 
tistical mechanics. In Sec. A(1) we shall discuss the 
unrestricted H-theorem. The arguments of Loschmidt 
and Zermelo against the H-theorem in its original form 
will be discussed in Sec. A(2), while we discuss in 
Sec. A(3) the statistical aspects of the H-theorem. 

Apart from introducing the H-theorem to prove (A) 
Boltzmann also tried to prove that the average behavior 
of a system is the same as its equilibrium behavior. 
The contents of this statement are that the time aver- 
age taken over an infinite period of any phase function, 
that is, a function depending on the values of the coordi- 
nates and velocities which completely determine the 
physical situation, or phase, of the system under con- 
sideration, should be equal to the value of this phase 
function at equilibrium. It can easily be seen that this 


5 We may refer here to the Bibliographical notes at the end of 
chapters I and II and at the end of Appendix I of ESM for more 
historical details. 

“We exclude clearly hypothetical systems such as completely 
perfect gases within idealized walls. In such a system there would 


be no mechanism to alter the distribution function, and a non- 


equilibrium situation would persist. 
‘The importance of the statistical nature of the H-theorem is 
not properly taken into account by Sartre (54S) in a recent note. 
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second way of attacking (A) is equivalent to the first 
one. Firstly, if the H-theorem is correct, it follows that 
any nonequilibrium situation will develop into an 
equilibrium situation which will then persist. In taking 
a time average of a phase function over an infinite 
period we thus obtain the value of this phase function 
at equilibrium. Secondly, if the time average is equal 
to the valut at equilibrium, it means that the system 
must be in an equilibrium state during most of the time 
and hence return to equilibrium from any nonequilib- 
rium state.® In order to prove that average behavior and 
behavior at equilibrium are the same, it is necessary to 
calculate time averages and one then meets with serious 
difficulties. Boltzmann tried to get over these difficul- 
ties by assuming that most physical systems are ergodic.’ 
An ergodic system is one which has a representative 
point in I’ space" which passes through every point of 
the energy surface corresponding to the energy of the 
system. The evaluation of the time average is in this 
case the same as taking the average over the energy 
surface, that is, taking the average over a collection of 
systems which possess all the same energy. This par- 
ticular collection of systems, or ensemble, to use the 
term coined by Gibbs, is a so-called microcanonical 
ensemble. Since according to our assumptions the 
representative point of any system in the ensemble will 
pass through every point on the energy surface, the 
orbits described by the representative points of the 
systems in the ensemble will all be identical, the only 
difference between the systems being the exact moment 
at which a point of the orbit is passed." It follows then 
that the time average which is the average taken over 
the one and only orbit on the energy surface where 
different points are passed at different times will give 
the same result as the average taken over the micro- 
canonical ensemble which is the average taken over the 
same orbit, but now considering all points at the same 
time. 

In considering averages over the energy surface, or 
over a microcanonical ensemble, Boltzmann departed 
from kinetic theory and really and truly entered the 
realm of statistical mechanics, if we use Gibbs’ defini- 
tion of statistical mechanics in the preface to his famous 

8 This equivalence remains if one substitutes the H-theorem in 
its statistical form for its unrestricted form as can easily be seen. 

3 Ergodic from the Greek ergon (=work, used here in the 
sense of energy) and 4odos (=path): the representative point in 
T space passes through all points of the energy surface. The term 
ergodic was first introduced by Boltzmann in 1887 (87B). The 


assumption of ergodicity is called by Maxwell the assumption 
of the continuity of path. i 

0 A system with S degrees of freedom can be described by S 
(generalized) coordinates and S (generalized) momenta. The 
values of these 2S quantities at a gsven time will define a point, 
the so-called representative point, in a 2S-dimensional space, 
called r space (I for gas), or phase space. A 2S—1-dimensional 
hypersurface in T space, defined by the equation «=constant, 
where e is the energy of the system, is called an energy surface. 

11 We use here the fact that the direction of the orbit at any 
-point of T space is uniquely defined from the Hamiltonian equa- 
tions of motion. From this it follows too that an orbit can never 
return to a point in T space through which it has passed—unless 
we are dealing with a strictly periodic orbit. 
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monograph (02G).2 It must, however, be noticed that 
ensemble theory enters, so to speak, by the back door, 
as it is only introduced as a mathematical trick to 
calculate the behavior of one isolated system. 

Around the turn of the century with the develop- 
ment of measure theory it was realized that ergodic 
systems would never occur, but it was hoped that most 
physical systems would be quasi ergodic. A quasi- 
ergodic system is one which has an orbit in I space 
which covers the energy surface everywhere densely, 
even though not actually passing through every point 
of it. It was also hoped that quasi ergodicity would be 
sufficient to ensure the equality of time averages and 
averages taken over a suitably chosen ensemble. This 
was, for instance, the point of view taken by the 
Ehrenfests (11E2). In 1913 Rosenthal (13R) and 
Plancherel (13P) independently proved the impossi- 
bility of ergodic systems, while in 1923 Fermi (23F1) 
showed that a certain class of systems was quasi 
ergodic—without, however, proving the equality of 
time averages and ensemble averages." This equality 
had been proved by Rosenthal (14R2), but his proof 
was not completely rigorous.'4 More recently this 
problem of the equivalence of time and ensemble 
averages has been studied mainly by mathematicians. 
It is usually called the ergodic theorem. In 1931 and 1932 
Birkhoff (31B1, 31B2, 32B) and von Neumann (32N1, 
32N2) have shown that, provided certain plausible 
mathematical conditions were fulfilled, the two kinds of 
averages would give the same result, while Oxtoby and 
Ulam (410) recently showed that a very wide class of 
systems would satisfy these mathematical conditions. 
The ergodic theorem will be discussed in Sec. B. In 
Sec. B(1) we shall discuss the developments up to 
about 1930, while modern developments are discussed 
in Sec. B(2). 

The approach via the ergodic theorem is still con- 


12 We may at this point perhaps advise every student of sta- 
tistical mechanics to read and reread carefully this preface, as it 
expresses more clearly than anywhere else the basic ideas of 
rational thermodynamics, as Gibbs calls it. It is in this preface 
also that the term statistical mechanics is coined. In view of the 
difficulties encountered by kinetic theory at the time that Gibbs’ 
monograph appeared (paradox of specific heats, the under- 
standing of Planck’s radiation law, ---) Gibbs tried to build 
up a rational system based on a few axioms and not necessarily 
connected with natural phenomena. As Gibbs put it, “It is 
well known that while theory would assign to the gas (of di- 
atomic molecules) six degrees of freedom per molecule, in our 
experiments on specific heat we cannot account for more than five. 
Certainly, one is building on an insecure foundation, who rests 
his work on hypotheses concerning the const#ution of matter. 
Difficulties of this kind have detérred the author from attempting 
to explain the mysteries of nature, and have forced him to 
contented with the more modest aim of deducing some of the more 
obvious propositions relating to the statistical branch of - 
ics. Here, there can be no mistake in regard to the agreement of 
the hypotheses with the facts of nature, for nothing is assumed in 
that respect. The only error into which one can fall, is the want of 
agreement between the premises and the conclusions, and this, 
with care, one may hope, in the main, to avoid.” 

13 We shall see later on, however, that this equality holds 
indeed for all quasi-ergodic systems. 3 . y 

1 In his proof a double trarfsition to the limit is taken, s Nig 
pointed out by Rosenthal himself té Epstein (see 36E, p. 478). 
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sidered by many physicists to be the most satisfactory, 
if not the only one, and the one providing the best 
justification for the use of, statistical methods. Tt is 
felt by these people that mechanics can only deal with 
isolated systems and that the statistical approach is a 
necessary mathematical device without any further 
physical interest. It seems to the present author that 
this point of view fails to take into account the physical 
reason for a statistical approach.!® The reason is that 
the enormously large number of particles in most physi- 
cal systems will make it impossible for us to determine 
the values of all the integrals of motion. In general,we 
shall only determine the energy, the total linear mo- 
mentum, and perhaps one or two more. We then must 
use this very inadequate knowledge about the system to 
predict its future behavior. Since our knowledge is 
clearly insufficient to predict the future of the system 
with complete certainty, we must have recourse to 
statistical methods and this is the point where repre- 
sentative ensembles enter.'® Instead of considering one 
system, we consider a large collection of systems all 
possessing the same values of those quantities of which 
we know the value, but otherwise differing widely. In 
order to construct such a representative ensemble 
satisfactorily, we must know what weight to give to the 
various systems in the ensemble, or, in other words, 
we must make assumptions about a priori probabilities. 
Once the need for representative ensembles is recog- 
nized and accepted, question (B) is solved in ensemble 
theory by showing that the great majority of the sys- 
tems in the ensembles considered behave in practically 
the same way and, moreover, give just those values for 
phase functions which one should expect from a system 
at equilibrium.” However, question (B1) remains to 
be answered. One always finds that a canonical en- 
semble is the representative ensemble of a system in 
temperature equilibrium. In order to prove this one 
introduces a generalized H-theorem and shows that, if 
the representative ensemble at one moment is not a 
canonical ensemble, the situation at a later moment 
will be such that we are forced to use a representative 
ensemble which resembles much more closely a canonical 
ensemble. This question of the approach to equilibrium 
described by ensembles and the question of the repre- 


_ 1 We might also add that completely isolated systems are of no 
interest to a physicist, since it is impossible to perform experi- 
ments-with or on them. Of course, as long as we are dealing with 
classical systems; we can always consider idealized experiments 
and thus still talk about isofited systems. Moreover, Pauli 
(49P) has pointed out that also in the case of quantum-mechanical 
systems the influence of the act of observation on the system can 
be neglected, as long as we are dealing with statistical mechanics, 
although it is of paramount importance in the case of quantum 
mechanics. 

16 See Secs. 24, 86, 112, 117, and 120 of Tolman’s monograph 
(38T) and part B of ESM for a discussion of representative en- 
sembles; compare also the discussion in Secs. C2 and D3 of the 
present paper. 

17 The- proof of this statement can be found in any text- 
book on statistiCal mechanics dealing with ensemble theory 
(02G; 38T, 54H1). a 
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sentative ensembles will be discussed in part C, as far 
as classical ensemble theory is concerned. 

The transition from classical to quantum statistics 
does not introduce any fundamental changes. As a 
matter of fact, one could develop the two cases, that is, 
the classical and the quantum-mechanical case, at the 
same time to emphasize the close similarity. There are, 
however, certain differences and for that reason we have 
preferred to discuss the two cases separately. In the 
quantum-mechanical case one can again approach the 
problem either via ensemble theory or via an ergodic 
theorem. We shall discuss these two alternative methods 
of approach in parts D and E, respectively. 

In concluding this Introduction, I should like to 
make a few remarks concerning some general aspects of 
the second law of thermodynamics and its place in 
statistical mechanics.'§ The fact that in all practical 
applications of thermodynamics the second law holds, 
that is, the entropy increases, is due to the combination 
of two effects. First of all, if the entropy at a certain 
moment is smaller than its equilibrium value, the prob- 
ability that it will increase is overwhelmingly larger 
than that it will decrease. Secondly, our observations 
are always made in such a way that starting from a 
given situation we watch the future development, but 
it is impossible for us to start from a given situation 
and watch the preceding time interval. The fact that 
we can do the one, but not the other is related to the 
fact that we have a memory of the past and we can 
thus possess knowledge of what happened at an earlier 
time, but not of what will happen at a later moment. 
To this extent the irreversibility of the second law of 
thermodynamics is physiological and basic. 

Another point is that, as far as one can judge at this 
moment, all observations regarding phenomena in the 
universe are reconcilable with the idea that the universe 
as a whole is developing from some reasonable, though 
thermodynamically or statistically unlikely, state in 
the distant past. This may be thought to be connected 
with cosmogonical ideas involving a more or less singular 
beginning at, say, three to five billion years ago. On 
the other hand, it can also be argued, as was, for in- 
stance, done by Boltzmann (95B), that the fact that 
our world seems to be developing from a less probable 
to a more probable state is not necessarily in contradic- 
tion to the idea that the universe as a whole is in thermo- 
dynamic equilibrium. To quote Boltzmann: “We 
assume that the whole universe is, and rests for ever, 
in thermal equilibrium. The probability that one (only 
one) part of the universe is in a certain state is the 
smaller the further this state is from thermal equi- 
librium but this probability is greater, the greater is 
the universe itself. If we assume the universe great 
enough, we can make the probability of one relatively 
small part being in any given state (however far from 
the state of equilibrium) as great as we please. We can- 


18] should like to express my thanks to Professor R. E. 
Peierls for some clarifying ‘and critical remarks on these points. 
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also make the probability great that, though the whole 
universe is in thermal equilibrium, our world is in its 
present state. It may be said that the world is so far 
from thermal equilibrium that we cannot imagine the 
improbability of such a state. But can we imagine on 
the other side how small a part of the whole universe 
this world is? Assuming the universe great enough, the 
probability,that such a small part of it as our world 
should be in its present state is no longer small. 

If this assumption were correct, one would return 
more and more to thermal equilibrium; but because the 
whole universe is so great, it might be probable that 
at some future time some other world might deviate 
as far from equilibrium as our world does at present. 
Then the aforementioned H-curve!® would form a repre- 
sentation of what takes place in the universe. The 
summits of the curve would represent the worlds where 
visible motion and life exists.” 

In giving this quotation we are not subscribing to 
Boltamann’s point of view, only mentioning it. In 
judging this point of view on its merits one must in- 
vestigate the difficult problem in how far it is possible 
to talk about thermal equilibrium in an open system 
such as our universe. 


A. THE H-THEOREM IN CLASSICAL STATISTICAL 
MECHANICS 


(1) The Kinetic Aspect of the H-Theorem; 
Stosszahlansatz 


In 1872 Boltzmann (72B) introduced his famous 
H-theorem in order to prove that any nonequilibrium 
distribution will tend to an equilibrium distribution. 
Let us briefly summarize his reasoning. He was con- 
cerned with the case of an isolated system, and in 
particular with the case of a monatomic gas and no 
external forces, although he later extended his considera- 
tions to the case of polyatomic gases under the influence 
of external forces (75B). 

We are interested to compute the rate of change due 
to collisions of the distribution function f, which in the 
present simple case is a function of the three (Cartesian) 
components «u, v, and w of the velocity c of an atom, 
and of the time ¢ only. We must therefore consider, on 
the one hand, the rate at which atoms with velocities 
between c and c+de change their velocities, and, on 
the other hand, consider the rate at which atoms with 
velocities within the selected range are produced 
through collisions. In formula 


(Of/dt)dudvdwdt= —A+B. (A1.01) 
In Eq. (A1.01) fdudvdw is the namber of atoms per unit 
volume which have velocities with components in the 
intervals (u, u+du), (v, v-t+dv), and (w, w+dw), A is 
the number of atoms per unit volume with velocities 
inside the selected range which during a time interval 


19 This is essentially a curve giving the entropy as a function of 
time; compare Sec. A(3). ° 
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dt change their velocities, and B is the number of 
atoms per unit volume which during a time interval 
dt change their velocities in such a way that after colli- 
sions their velocities lie inside the prescribed ‘range. It 
can be shown (see, e.g., ESM Sec. I.4) that A is given 
by the equation 


A= f(u,v,w)dudredwdt 
X f Herswi)duadouts fado. (A1.02) 


In Eq. (A1.02) dw is an element of solid angle around 
the so-called line of centers w which is the vector from 
the one colliding atom to the other at the moment of 
impact, a is a quantity which depends only on the ab- 
solute magnitude of the relative velocity cre, and on 
the angle @ between the relative velocity vector and w, 
and the integration is over all values of m, o1,and wy, 
and over all possible directions of the line of centers. 
Equation (A1.02) is obtained by multiplying the num- 
ber of atoms with velocities within the specified range 
(f(u,t,w)dudidw) with the number of atoms which 
collide with one atom in a time interval dt. Consider 
atoms having a velocity between cı, with components 
1t, 21, and w, and ¢;-+de;. Provided we may assume that 
there is no correlation between velocities and positions of 
different atoms, the number of collisions between these 
atoms and one particular atom in the case where the 
line of centers lies within a solid angle dw will be equal 
to it: af (u,v; w )dudvdwdidw, where the quantity a is 
a normalizing constant depending on crei and 9. Equa- 
tion (A1.02) now follows by integration over all possible 
values of u, 21, and w, and over all possible directions 
of o. 
Similarly, we have for B the equation 


/ 


Baas f du'dv'dw' f(W v w) 
1 ° 
dn'de/dany fan's!) | a'do'. (A1.03) 


The integral signs are primed to indicate that we now 
may integrate only over those values of ¢’ and cy and 
those directions of œ’ which are such that the velocities 
after the collision, which are completely determined 
by c’, cy’, and w’, satisfy the condition that one of them 
lies between c and c+de. Expressing the primed quanti- 
ties in terms of the unprimed ones we can write instead 
of Eq. (A1.03) (see, e.g., ESM Sec. I.4) 


B=dudidwdt | duidridw f(a’ ,v’ wo’) 
x finan) fado, (A1.04) 


. 4 
where u’, 0’, w’, 21’, 01, and w'sare functions of u, 2; w, 
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u, W, and w, through the equations expressing the 
conservation of linear momentum and energy. 

Combining Eqs. (A1.01), (A1.02), and (A1.04) we 
get for the rate of change Of the distribution function 
due to collisions, 


af i 
==- f Uh- f'f"adindextiode, (A1.05) 
ot 


where we have used the short-hand notation 


f=f(ur,w), =S), f= fvw), 


fr’ = fln). (A1.06) 
Introducing a quantity H by the equation” 
H= fi In fdudvdw, (A1.07) 


we get for the rate of change of H from Eq. (A1.05) 


aH 
== S UR- 
dl 
Kadudrvdwdujdvdwydw. (A1.08) 


By observing the equivalence of c, ci, c’, cy’ in Eq. 
(A1.08), one can write this equation in the form (see, 
e.g., ESM Sec. I.5) 


dH 1 yp! , yp 
aif Uh SMUSS fr) 


Xadudedwdiujdijdwidw. (A1.09) 
Since 
(p—9) In(p/q)>0, if pq, and =O, 
if p=q, (A1.10) 


and since a is a positive quantity, we have from 
Eq. (A1.09) 
dH/dt<0, (A1.11) 


where the equality in Eq. (A1.11) only holds, if for any 
two velocities c and c, we have ffi= ff’ fi’. 

It follows thus from Eq. (A1.11) that, provided our 
basic assumptions are correct, H will steadily decrease 
until it reaches a minimum value which is reached as 
soon as the distribution function satisfies the relation 


fh=f'f’ (A1.12) 


for any pair of velocities c and cı. The minimum or 
equilibrium value of H is in most cases reached rapidly, 
if H differs appreciably from this value. The relaxation 
time in the case of a gas at room temperature and one 
atmosphere of pressure is, e.g., of the order of 10-7 sec 
(ESM p. 390). 

One can derive a formula for the equilibrium distribu- 
tion function either by using Eq. (A1.12) or by taking 
as the equilibrium condition that for equilibrium H is 
minimum, in both cases under the restricting conditions 


20Tn his first, paper on the #7-theorem Boltzmann used E 
(entropy) for H. 
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of given total number of particles, given total energy, 
and given total linear momentum. The result is the so- 
called generalized Maxwell distribution. 

Unfortunately the case of a real gas is not easy to 
treat in detail and, although it is possible to derive the 
H-theorem, it is not easy to discuss critically the various 
assumptions made in arriving at Eq. (A1.11) and the 
consequences of the removal of these restricting condi- 
tions. However, it is possible to introduce simplified 
models (11E2, 53H, 54G, 55G, 55H2) and we shall 
especially consider a model which is essentially the same 
as the model used by Lorentz (09L) in his discussion 
of the electron theory of metals. In this model we have 
two kinds of particles. The first kind (the lattice points, 
or metallic ions) are fixed in space, and we shall assume 
them to be randomly distributed in space with a density 
of n per unit volume. The second kind (the electrons) 
are traveling through the lattice and their density is M 
per unit volume. We neglect electron-electron en- 
counters and, furthermore, assume that the electron- 
lattice collisions are elastic aad isotropic. This means 
(a) that on such a collision the electron will not change 
the absolute magnitude of its velocity, and (b) that if 
(0,9; 6’,¢’) is the probability that an electron traveling 
in the direction determined by the polar angles @ and ¢ 
changes to a direction characterized by 6’ and ¢’, 
(0,0; 6',¢’) will be independent of 6’ and g’. Since 
the absolute magnitude of the velocity does not change 
during the collision, we can simplify our model by 
assuming that all electrons are moving with the same 
speed c. Finally, we shall denote by o the cross section 
for an electron-lattice collision. 

In order to consider the approach to equilibrium, we 
introduce a distribution function. Let /{(0,~)dw/4r 
(dw=sinédédy=element of solid angle) be the number 
of electrons per unit volume with velocities in direc- 
tions within the solid angle dw. We have then the 
normalization condition 


f 16,0)d0=40N, (A1.13) 
where the integration extends over the unit sphere. 

According to our assumption about the model we 
have for the number of electrons per unit volume, 
No, «dldwdw’, which change their directions from within 
an element of solid angle dw to within a solid angle do’ 
during a time interval dt: 


No, wdidodo'= f(6,)nco (dw/4mr) (dw'/4r)dt, (A1.14) 


where we have used the assumption of isotropic scat- 
tering. 

Using Eqs. (A1.13) and (A1.14), we can easily com- 
pute the rate of change of f(@,¢), and we get 


1s(0,¢)/dt=al f ORE seo] 


‘=a[N—f], (41.15) 
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with 
3 
a=ncs. 


(A1.16) 
From Eq. (A1.15), we get 


{@e)=N-Ac*, A=4 (6,2)=N—fi.0(6,¢), (A1.17) 


that is, an exponential approach to the equilibrium 
value, if at /=0 a nonequilibrium situation were 
present. ” 

Once again, as in Eq. (A1.11) we see a uniform ap- 
proach to the equilibrium situation. However, once 
again we have introduced a basic assumption in writing 
down Eq. (A1.14), and again this assumption is one 
regarding the number of collisions, or a so-called Stoss- 
zahlansalz. As collisions play the dominant role in our 
discussion, we call this method of treating the problem 
the kinetical treatment. We shall show, however, in 
the next section that the unrestricted Stosszahlansalz 
leads to contradictions and that we therefore rather 
should base our discussion on the so-called statistical 
treatment, which pays due regard to possible 
fluctuations. á 


(2) Reversibility and Recurrence Paradox 


Let us restate the H-theorem in a slightly different 
way. We are considering systems made up of a large 
number of identical particles and we have discussed the 
distribution function, and its rate of change, corre- 
sponding to such a system. However, a different way of 
looking at the behavior of the system is by depicting 
its situation, or phase, at each moment by a representa- 
tive point in I space (see ESM p. 100 for a definition 
of T space). Each situation which, on the one hand, 
corresponds to a given distribution will, on the other 
hand, correspond to one point in T space. We have thus 
a one-to-one correspondence between, on the one hand, 
points in T space, and, on the other hand, values of the 
distribution function. Consider now a series of instants 
++, bi, fo, +++, tn ++. The phase of our system at ¢, will 
correspond to a point Pa in T space and the orbit 
described by the representative point of the system 
will pass through the sequence of points 


++) Py, Po, -+*, Prt, Pa, Pnti, ***. (AXON 


As each point corresponds to a distribution function, 
we can evaluate the value of H corresponding to each 
point. Lét H,, be the value of H in the situation corre- 
sponding to P,. If we accept the H-theorem in its un- 
restricted form, it then follows that the following 
inequalities should hold 


"> M> H> 2 Hna2 Hn ii (A2.02) 


where the equal signs only hold, if equilibrium has been 
reached. 


21 This is, of course, only true as long as we restrict ourselves to 


the discussion of the relatively unimportant specific phases. If 
we use generic phases to each distribution corresponds a set of 
points in T [compare Sec. A (3)]. « 
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In the general case H is given instead of by Eq. 
(A1.07) by the equation 


H= f f In fdes, y (A203) 


where the integration extends over the whole of y space 
(ESM p. 30), where f is now a function of the s gen- 
eralized coordinates q and the s generalized momenta 
p+ (s=number of degrees of freedom of one particle), 
and where dw is a volume element in y space, 


dw= [I dpidgy. 


k=l 


(A2.04) 


Consider now two situations corresponding to the 
representative points P; and P,’ which differ only in 
that all the p; have the same absolute values, but with 
opposite signs, while the q+ in the two situations are the 
same. Since the Hamiltonian 5(p,,9;) of the system is 
a homogeneous quadratic polynomial in the ps it is 
invariant against the transformation qi—qx’, pi>— px’, 
which is the transformation from P; to P,’. We see then 
from the canonical equations of motion, 


Ge=IR/OP:, Pr= — âI dqr, (A2.05) 


that the transformation from P; to P; corresponds to a 
reversal of the time axis. If we now consider a system 
going through the sequence (A2.01), then, on trans- 
formation, we obtain a system which goes through the 
sequence 


o, Pl nity Pa) Pra, °°; Pe, Pi, 2 (A206) 


From the definition of H, Eq. (A2.03), it follows 
easily that 


H;=Hi. (A2.07) 


This can be seen by performing the transformation 
under the integral sign and then changing to new 
variables, 


H= S POA) nf (Pade 
= f f(b.) mfi pado 


= Siwan In fp" jdu" = H, (A2.08) 


where pe’ = — Pry ae = qk. 

From Eqs. (A2.07) and (A2.02), it follows that the 
sequence (A2.06) corresponds to a sequence of H-values 
which satisfy the inequalities 


r- CH ner SHa HW nk -+ SHS Mg ---. (A209) 


We see thus that for each system which shows a steady 
decrease of H we can conStruct a system for which H 
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is steadily increasing. This fact constitutes the so- 
called Umkehreinwand or reversibility paradox of 
Loschmidt’s (76L, 77L). 

We can easily see the difficulty by considering the 
simple model introduced at the end of Sec. A(1). By 
reversing the direction of time we get a steady depar- 
ture from the equilibrium distribution as can be seen 
from Eq. (A1.17). 

Another difficulty was pointed out by Zermelo 
(906Z) who used a theorem of Poincaré’s (90P; see also 
97B). Poincaré had shown that if a system enclosed 
in a finite volume passes through the sequence (A2.01) 
from f; to fn41, say, this sequence will be repeated as 
accurately as we wish it to be repeated after a finite 
time interval. More specifically, for any finite length 
element As in I’ space we can find a time interval T such 
that the sequence at +7, HT, +, lnHT, tna tT 
will be given by the points 


AMY Riis Py!” Sa IPA Ploy neg (A2.10) 


where 


|P;— Py" | <As. (A2.11) 


The time interval T can be extremely long. For the 
case of 10'§ atoms within 107° mê moving with an aver- 
age velocity of 5.10° m sec™, Boltzmann (96B, see also 
43C) estimated that it would take more than 10!” years 
to reproduce the positions within 10A and the velocities 
within 104 m sec™. 

If As is chosen sufficiently small, we have for the 
H-values corresponding to the sequence (A2.10) 


Hi! =H, (A2.12) 


and hence we have the following sequence at 41, t2, ++: 
tn, tty HT, ++, te FT: 


+++) Hr, He, EHT ob -, Hy", H", T og Jet", 000, 
(A2.13) 


Using Eqs. (A2.02) and (A2.12) we see that in going 
from /,, to hT we have an increase in H. This paradox 
is the so-called Wiederkehreinwand or recurrence paradox. 

In the next section we shall show how statistical 
considerations can clarify the situation. However, we 
wish to emphasize at this moment that in so far as the 
reversibility and recurrence paradoxes invalidate the 
H-theorem in its unrestricted form, that is, the state- 
ment that -H can never decrease, they show that the 
Stosszahlansaiz cannot be true under all circumstances. 
We shall illusjrate this by considering two simple cases. 

The first example is drawn from the consideration of 
the simple model of the previous section. Consider an 


? 


22 We refer to the literature (96Z, 43C, ESM p. 341, which 
Jatter proof is not very rigorous) for a proof of Poincaré’s theorem. 
The proof follows essentially from the fact that a finite region in 
T space will sweep through the available part of phase space, 
which is finite in the case of a system enclosed inside a volume, 
within a finite period. 

23 For the relationship between Poincaré’s theorem and Birk- 
hoff’s ergodic theorem we refer toa monograph by Wintner (41W, 
pp. 90-91). à 
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arbitrary time interval AZ during the evọlution of the 
system and denote by /(0,%) the distribution function 
at the beginning of this time interval. Denote further 
once again by Nu, wAtdodo’ the number of collisions 
leading during this time interval to the charige of a 
velocity within the solid angle dw to a velocity within 
the solid angle dw’. Consider now the corresponding 
evolution with the time direction reversed end consider 
the corresponding time interval. The distribution func- 
tion at the beginning of the time interval in this second 
case may be denoted by f’(0,¢). If N'w, Aldwdw’ is the 
number of collisions leading from w’ to w within A of 
the reversed evolution, we have 


(A2.14) 


IN" ot, o= No, w 


for any pair of directions œ and a’. From Eq. (A1.14) 
it then follows that, if the Slosszahlansalz were valid for 
both evolutions, we would have 


[@e=f0¢'); (A2.15) 
independent of 6, ¢, 6’, and ¢”, or 
f{(,¢)=constant. (A2.16) 


We see thus that ihe Stosszahlansalz cannot be valid for 
al least one of the two evolutions, unless we are dealing 
with the equilibrium situation. 

The second example is the following one.4 Consider a 
stream of point particles moving all in the same direc- 
tion, say, the +a«-direction and being scattered by a 
system of hard spheres which are randomly distributed 
over a plane perpendicular to the x-axis. The result of 
the scattering will be that the point particles after their 
encounter with the plane of spheres will form a system 
of particles the velocities of which are isotropically dis- 
tributed. Consider now the evolution of the system 
obtained from the previous system by reversing the 
direction of the velocities of all the particles. Clearly 
after a while all the particles will be moving in the 
— y-direction, that is, the system will proceed from a 
disordered condition to a highly ordered pattern. The 
reason for this can easily be seen when we consider the 
collisions of the particles of the second system with the 
spheres. Although the system seemed to be completely 
random, all collisions with the spheres occur on that 
half of the spheres which is turned to the — x-axis and 
clearly the collisions are not taking place in a random 
manner. For this second system we see thus that the 
Stosszahlansatz would certainly not be satisfied. 


(3) Statistical Aspects of the H-Theorem 


We mentioned before that the nature of the situations 
studied by statistical méchanics is such that we are led 
to statistical considerations. The first man to use a 
truly statistical approach was Boltzmann (77B) and. 
at that point kinetic theory changed into statistical 


4 
% I am greatly indebted to Dr. H. M iding me 
a e o Dr. H. M. James for providing m! 
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| mechanics evon though it was another twenty odd years 
before Gibbs coined the expression. . 

| In his statistical considerations Boltzmann was con- 

| cerned first of all with the answer to question (A1), 

/ especially for the case of an isolated system, and he sug- 

gested two possible definitions of an equilibrium 

situation. 


» 
| (a) The equilibrium situation is that one which is the 
most probable one for a given energy. 
(b) The equilibrium situation is the average situation 
in which the system will find itself during an infinite 
| time interval. 


Let us first in this section consider the approach via 
the most probable situation. In order to prove that the 
| most probable situation corresponds to the equilibrium 
situation characterized in the case of a system of inde- 
pendent particles, such as Boltzmann was considering, 
by the Maxwell-Boltzmann distribution we proceed 
as follows. d 
Consider an isolated system of V independent par- 
ticles. Each particle will have a representative point 
Q% in u space, where k numbers the particles in the 
system (k=1, ---, N). The phase of the whole system, 
and thus its representative point in I space, is then 
determined by the .V points Q%®, ---, Q0. We now 
divide u space into very small but finite cells of equal 
volume w, which we number consecutively, w1, #2, ***, 
wi +++. This division is one of the fundamental steps in 
the argument. The size of w is such that, on the one hand, 
the dimensions of a cell are small compared to the 
smallest macroscopically measurable dimensions, but, 
on the other hand, the number of representative points 
Q) contained in each of them is large.” 
Let now N; be the number of representative points in 
the ith cell. The situation Z** is then completely de- 
> scribed by giving the ;. The description of the situa- 
tion by the W; instead of by the actual position of the 
Q% corresponds to our experimental limitations, since 
the size of the cells has been chosen such that we cannot 
distinguish by experimental means between different 
points within one cell. 
The relation between the representative point of the 
system in I’ space and the situation Z described by the 
N; is as follows. 


(a) Tò each point in I space corresponds one situa- 


tion Z. ; 
(b) For each Z there exists a volume in I’ space such 


25 This division seems on first sight artificial in classical sta- 


| 9 tistics, while it occurs naturally in quantum statistics. Uhlenbeck 
f (27U) discussing this step remarks ‘eIt is as if Boltzmann had a 
if premonition of the occurrence of discrete quantum states in 


p space.” There is, however, more to it than this and to a large 
extent one may say that even in classical statistics the division 
is natural as it corresponds to the limitations of our macroscopic 
measurements. Compare in this connection the discussion of the 

“coarse grained ensemble density in Sec. C(1) and the discussion 

a t e present section. 

= Sree ne the Ehrenfests’ notation (11E2): Z for Zustands- 


verleilung. 


that each point of this volume corresponds to the same 
Z. Such a region we shall call a Z-star, and its volume 
is given by the equation 


W(Z)=LN ILN iio. (A3.01) 
Equation (A3.01) can be derived if we remember that 
the collective of the numbers V; remains unchanged, 
and hence Z remains the same, under the following two 
operations. (a) Each of the .V points O™ sweeps through 
its own cell, resulting in the filling out of a volume 
Q=w" in I space. (b) Any permutation of the V points 
such that only points in different cells are permuted. 


We now introduce a quantity H(Z) by the equation 
H(Z)=—\nW(Z). (43.02) 


Substituting expression (A3.01) for W(Z) into (A3.02), 
using Stirling’s formula for the factorial in the form 


Ing !=a Inv—x (A3.03) 
and neglecting additive constants, we get 
H(Z)=ZiNilnNi- (A3,04) 


Comparing Eqs. (A2.03) and (A3.04) we see that 
H(Z) is the same as Boltzmann’s H with the proviso 
that instead of integrating we take a sum over the cells 
which we introduced in ø space. 

It can easily be shown (e.g., ESM Sec. 1.7) that, if 
we define the equilibrium distribution V,” as that dis- 
tribution for which W(Z) is maximum, it follows that 
in the case where the total energy Æ and the total 
number of particles .V is fixed, corresponding to the 
restricting conditions 


N=ZN,, (A3.05) 
E=ZN Ey (A3.06) 


where E; is a representative energy for the ith cell, 
the V; obey the equation 


N=exp(u—BE;), 


which is the so-called Maxwell-Boltzmann distribution. 
In identifying the equilibrium distribution with the 
distribution for which W(Z) is maximum, we have 
defined the probability for a situation as the correspond- 
ing volume in I space. í 

In order to prove that the equilibrium distribution 
is the one for which W (Z) is maximum, or H (Z) mini- 
mum, we should like to introduce an H-tñeorem, that is, 
show that dH/dt is always negative, unless H=Hmin, 
or something of this kind. However, as soon as we start 
discussing the behavior of H(Z) as a function of time, 
we realize that any function which depends on the 
coordinates and momenta of the particles in the system 
through the V; will be a discontinuous function,’ since 
whenever one of the representative points in » space 
leaves one cell and enters another, the two N; coricerned 
alter by unity. For the time dependence of H(Zj, we 


` (43.07) 
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get thus a step function’? and its time “derivative will 
only have three possible values, — ©, 0, and 4-77" 

From the step function we now obtain the so-called 
H-curve by selecting a discrete set of points which are 
separated by constant time intervals. The length of the 
time interval 7, say, is chosen in such a way that it is 
small compared to experimental time intervals, but 
sufficiently large so that during r a large number of 
collisions occur.*8 

We can now use the properties of the H-curve to 
illustrate the following points.’ We shall use the simple 
model introduced in Sec. A(1) for this illustration. It 
can be shown (54G, 55G) that this model is only one of 
a larger class of models all showing the same behavior. 


J. It is possible to find a function A which, on the one 
hand, measures the departure from equilibrium and, 
on the other hand, is directly related to the entropy of 
the system, or H.*! 

II. If the history of a system is followed in time, the 
equilibrium distribution, for which A is equal to its 
minimum value, will be realized much more often 
than any other distribution. Furthermore, the chance 
of observing another distribution is so small that the 
equilibrium distribution is the same as the average 
distribution. 

ILI. If the system is in a situation corresponding to 
A> Amin, the development of the system will be such 
that A will most probably decrease. 

IV. The previous statement will be true whether we 
read the A curve—which is obtained from the step 
function A(/) in the same way as the H-curve is ob- 
tained from H(/)—from i=—œ to t=+ 0, or from 
t=+o tot=—o, i 

V. The A curve will practically always be in the 
neighborhood of Amin. 

VI. The time between the reoccurrence of a A value 
different from Amin increases steeply with increasing A. 

VII. If Dn is the average of all A values at to-+17, 
starting from Ao at fo? the D-curve, that is, the se- 
quence of values Di, De, ---, Dz, +- will decrease 
monotonically from Ap approaching Amin asymptotically. 

VIII. By far the most of the A curves will follow the 


*7 Tt must be noted that this fact does not depend on the char- 
acter of the intermolecular forces but is solely a consequence of 
the introduction of finite cells w. 

2ta More mathematically expressed: the time derivative is 
everywhere zero except where H is discontinuous when it is not 
defined. n 

28 Compare the choice of the Size of the cells in u space. 

2? The numbering of the points, and the contents of some of 
them, is different from that of the Ehrenfests (1112) or that 
in ESM. 

2 See also 51B2 and 52H. 

3! We have not proved that apart from additive and multiplying 
constants, H given by Eq. (A2.03) is the entropy of the system. 
For such a proof we refer to the literature (see, e.g. ESM pp. 22 
and 43). 

82 In considering points VII and VIII (and also to some extent 
point IJI), it must be borne in mind that H and thus A does not 
determine the situation uniquely. For a given value of A there are 
thus*several possible situaticns Z. 
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D-curve for an appreciable period, but practically none 
of them will follow the D-curve at all times. 

We shall call the sequence of values Ay’, A,’, ---, 
An’, «++, which would follow from Ao by an application 
of the unrestricted //-theorem based on the Stosszahlan- 
salz, the Stosszahl curve. We then have the following 
point. 

IX. The Stosszahl curve and the D-curve åre identical. 


Let us now consider the Lorentz model. The function 
A mentioned in I will be defined by the equation 


+n 


A= Ds = h) 


v=—m 


(A3.08) 


where the f,° are defined by Eq. (A3.11). In Eq. 
(A3.08) we have introduced the division of u space— 
which in this case reduces to the unity sphere—into 
finite cells. Each cell is an element of solid angle of 
extension dw, where 


ôw=4r/(2m-+1). (A3.09) 


The 2m+1 expressions f, which are functions of time 
and which together determine the state of the system 
are defined in such a way that f, is the number of elec- 
trons moving in the direction specified by the vth 
element of solid angle. Only 2 of the f, are independent 
as they satisfy the relation 


+m 


DR O=N. 


=m 


(A3.10) 


The equilibrium values f,° of the f, are all equal and 
given by the equation 


foc=N/(2m+1). (A3.11) 


As one can show that the f, are practically always 
in the neighborhood of f,* [compare Eq. (A3.16)] so 
that we can assume f,— fe’ fe we have up to second- 


order terms in (f,— f,°)/ fy 


H= yf, nf,=, fo? In fy A/2N 
=HegtA/2N. (A3.12) 


From Eqs. (A3.08) and (A3.12) we see first of all 
that it makes no difference whether we use H or A to 
describe the departure from equilibrium, and secondly 
that A is a non-negative function of the fo which is only 
zero when all the f, are equal to their equilibrium values. 

As far as point II is concerned, we can first of all 
calculate the relative occurrence of a state of the system 
corresponding to a nonequilibrium value of A. We find 
in this way for the normalized probability w(A)dA that 
A has a value in the intezval (A, A+-dA) the expression™ 


w(A)dA=[ (2Qm-++ 1)/2N ]m[A"-/ (m—1) "J 
Xexp{—(2m-+1)A/2N}dA. (A3.13) 


= In the Ehrenfests’ discussion where H was considered instead 
of A, this curve is called the H-theorem curve (see also 54G, 55G). 
For a detailed discussion and derivation of the formulas in 
the remainder of this section, we refer to Appendix I. 


of 
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The function w(A) takes the place of the W(Z) of the 
beginning of the section and we see that, indeed, w(A) 
reaches its maximum value for A=(), that is, for the 
equilibrium situation. 

We now make the assumption that the time 4(Z) 
spent in a situation described by Z will be proportional 
to W(Z).*° In that case the time average G of a phase 
function G will be given by the equation 


G=[22G(Z)W(Z)V/[22W(z)]. (43.14) 


The assumption which we have just made about the 
proportionality of (Z) and W (Z) is necessary in order 
to continue the argument. It is also an assumption which 
is closely related to assumptions made in such theories 
as those which discuss the outcome of lotteries. In the 
latter case since each draw is supposedly independent 
of all the preceding ones, the proportionality of the 
probability for a certain result to the number of times 
such a result occurs in an infinite series of draws is 
fundamental and, indeed, used to define the probability. 
However, when one is dealing with processes which are 
in fact determined by the preceding history, the pro- 
portionality of time spent in a situation and the 
probability for this situation must be assumed. More- 
over, it cannot be used to define the probability since 
the processes considered are not even Markovian. The 
difficulty lies in the fact that ¢(Z) is concerned with the 
long-time behavior of a system, but as the orbit is 
completely determined once one point of it is known in 
T space, the longer we follow the orbit the more data 
we obtain from which to determine completely the 
orbit and hence we are no longer dealing with purely 
statistical situations. We may refer in this connection 
to the discussion given by Tolman (p. 148 of 38T; see 
also 50B). Our assumption is thus closely connected 
with the Markovization of the processes giving the 
development of a system (compare 49S, 54G, 55G, 
55H2). 

We must also draw attention to the fact that the 
proportionality of /(Z) and W (Z) is nol proved by the 
ergodic or quasi-ergodic theorem—as one might have 
hoped—since this theorem only proves the propor- 
tionality of /(Z) and Wx(Z), but not the proportion- 
ality of IVz(Z) and W (Z). It remains thus one of the 
basic assumptions in the approach to the H-theorem 
in the form in which we discuss it at the moment and, 
moreover, one which seems to be no nearer to a proof 
notwithstanding the vast amount of work done either 
on simplified models or on the =rgodic theorem. 

To continue our discussion >f points II to IX, we 
shall confine ourselves mainly,to those phase functions 


35 We do not wish to stress here the point that this assumption 
really involves two steps (see ESM p. 350). The first step is the 
assumption that the area W’g(Z), cut out of the energy surface 
on which the representative point of our system is situated, will 


- be proportional to W (Z). The second step consists in assuming 


1(Z) to be proportional to Wz(Z). It may be mentioned here that 
Einstein (03E, 10E) made the samg (combined) assumption as 
we are making here. 


which depend on the p’s and q’s only through A. For 
such functions Eq. (A3.14) can be written in the form 


Ga f w(A)G(A)da, (A3.15) 


since w(A) is normalized. We can, for instance, caleu- 
late the average value of A and the dispersion around 
this average value, and we find 


An= N, ((A — Ky)? w= N/m. (A3.16) 


In order to discuss the variation of A with time, we 
introduce the time interval + and calculate the prob- 
ability w(A,A’) that A changes its value from A to A’ 
during this time interval z. Once these transition prob- 
abilities are obtained, we find the average value of A’ 
after 7 when at the beginning of the interval! the value 
was A from the equation 


at= fotad, (A3.17) 
provided w(A,A’) is normalized with respect to A’, and 
for the average rate of change of A [and thus of H; 
compare Eq. (A3.12) ] we have 


(dA/dt) u= (Aw —A)/r. (A3.18) 


In the special case of the Lorentz model, we find for 
w(A,A’) the expression (see Appendix I) 


w(A,A’)= (16r VAA) 
Xexp{—[A’—-A+AA(Qm+1) #/16VAA}, (A319) 


where A is given by the equation 
A=2ar, (43.20) 


with @ given by Eq. (A1.16). From Eqs. (A3.17) to 
(A3.19) we now get 
An! =(1—A)A, (A3.21) 


(dA/dl) u= —AA/r. * (A3.22) 


Point III follows immediately from Eq. (A3.22). 
We see here the same exponential approach to equi- 
librium as was found in Sec. A(1) (Eq. (A1.17)). We 
shall see presently the close connection which exists 
between Eqs. (A1.15) and (A3.22). 

In order to prove point IV, we must calculate the 
average value A’’star, of A which after a time interval 7 
leads to a value A. This quantity is given by the 
equation y 


Manm faroera a I. 
[ f (a po(a”ayaa” (43.23) 


where the denominator arises from the fact that it is 
not equal to unity [w(A”)w(A’” A) is only normalized 
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if integrated over both A and A”J. From Eqs. (A3.23), 
(A3.13), and (A3.19), we find 


Ì Alaan = (—A)A, (A3.24) 


and point IV has been proven." 

Point V follows from the expression we found for 
w(A) and the assumption that the chance of finding a 
value of during a sequence is proportional to w(A). 
In order to see point VI, we use a formula of Chan- 
drasekhar’s (43C) for the recurrence time O(A) of a 
state characterized by A,” 


@(a)=7[1—w(A)]/[2(A)—w(A)w(A,A)], (A3.25) 
or 


O(A)&7/w(A), (A3.26) 


from which point VI follows [see Eq. (A3.13)]. 

In order to calculate Dy, that is, the average 
of all A values at ¢otmz, starting from Ao at 
fo, we must compute first of all the normalized prob- 
ability w(An3Ao,A1,A2,°°:,An1) for a sequence of 
values Ap at fo, Ar at botr, Ag at fo+27, tt, Ani at 
lot (n—1)r, and An at lonr. This quantity is given by 
the equation (compare 55G)" 


w(An; AoA1,As,**+,Ant) 


=w(Ao,Ar)w(Ai,As)+ + +77 (An1,An). (A3.27) 
For Dn we now get 
Dar f Alan; Ao, A1,A2," tAn) 
XdAdA2: --dAn, (A3.28) 
from which follows that 
Dr= (1—4) "Ao, (A3.29) 
and putting ‘=r we get 
DSA ce 447 = Ae", (A3.30) 


Point VII follows immediately from Eqs. (A3.29) or 
(A3.30). 


We can also calculate the dispersion on of A, for 
which we have the equation 


on= f (Da— An) wAn; Ao,Ai, ma Ani) 
' XdAr: dAn, (A3.31) 
or 


5 o,=0. 


z (A3.32) 
From Eq. (A3.32) and the fact that nonequilibrium 


36 It must be remarked here that in an earlier paper (53H) we 
used for the proof of the equivalence of the sequence with 
j=—w—f=+ and the sequence with =+0—i=—o the 
relation w(A)w(A,A‘)=w(A‘)w(A’,A) which holds also in the 
present model. 

37 The objections raised by Bartlett (50B, 53B) against Chan- 
drasekhar’s formula are no longer valid, if we consider the se- 
quences which are continually Markovized (see 50B). 

38 Tt may be nofed that in writing down Eq. (A3.27) use is made 
of tie assumption of continued Markovization. 
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values of A recur with a frequency givew by ©(A)-1, 
point VIII follows. 

Let us now consider finally the S/osszahl curve. We 
note that from Eq. (A3.08) it follows that 


dA/dt=2,2(fo— fe*)(dfo/al).  (A3.33) 


Provided the Stosszahlansalz is valid, the rate of change 
of fe with time is given by the equation’ [compare 
Eq. (A115); the proof of Eq. (A3.34) is given in 
Appendix I] 


df,/at=[A/r IL foe fo]. (A3.34) 


Combining Eqs. (A3.33) and (A3.34), we get for the 
Stosszahl curve the differential equation 


dA/dit=—AA/r, (A3.35) 


and we see from comparing Eqs. (A3.30) and (A3.35) 
that point IX is also proved for the model considered. 

Let us now summarize the situation as regards the 
present position of the H-theorem in classical statistics. 
We notice, first of all that, although the situation has 
improved by the study of simplified models since the 
Ehrenfests (11E2) considered this problem in great 
detail, much remains to be done. As far as the ap- 
proach to equilibrium and the fluctuations arounc the 
equilibrium situation are concerned, the study of sim- 
plified models has shown that the situation is much as 
one would have expected (points II to VI)—provided 
we may treat the process as a true Markov process. 
We have discussed the behavior of the system mainly 
from the point of view of the fluctuations, but we could 
have done the same for the return to equilibrium. Siegert 
(49S) has, for instance, shown that starting from any A 
at t=, the probability for finding a value of A at 
l= will always be given by w(A). We have tacitly 
assumed that we are always dealing with a situation 
which has essentially forgotten its initial state. This 
may be done provided the relaxation time of the system 
is sufficiently short. The relaxation time fre) is essen- 
tially given by 7/A which, if we assume 1=10% m7, 
c=10' m sec, c=10- m?, m=105, leads to fr ~1078 
sec (compare ESM p. 390). This fast relaxation accom- 
panied by long recurrence times for appreciable devia- 
tions from equilibrium®® are the reasons why there are 
very few exceptions to the second law of thermo- 
dynamics, or, to put it slightly differently, why-fluctua- 
tions very seldom play an important role in the 
behavior of macroscopic physical systems. As Smoluchow- 
ski (12S) puts it, a system will appear to behave ir- 
reversibly if its initial state is characterized by an 
average time of recurrence which is long compared to 
the period available to the experimenter. In this con- 
nection, we also see confirmed Tolman’s suggestion 
(38T, p. 179) that the relaxation time would be very 
short compared to the period of the Poincaré cycle 


(Sec. A(2)), provided the size of the elementary cells 


® For an average deviation of 10-8 f,¢ e t O(A 
a eats ees o fat from fes, we get O(A) 
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were not too Jarge. Since A is inversely proportional to 
2m-+1, the number of the cells, we see indeed that { 
increases with m, and we saw a moment ago that lre 
took on a reasonable value for a reasonable value of m. 

The main problem still remaining is the justification 
of the Markovization (point VII of the Ehrenfests 
(11E2), or point IX of ESM). This seems an extremely 
difficult preblem to tackle, but it might be possible, 
certainly with the powerful calculating machines avail- 
able these days, to calculate the H-curve of a given 
system for a variety of initial conditions using the exact 
equations of motion. 

In conclusion we wish to point out that even though 
to some extent the ergodic, or better, the quasi-ergodic 
theorem has solved the question of the equivalence of 
time averages and ensemble averages, because of the 
incompleteness and inexactness of available experi- 
mental data we can only define a situation up to a 
limited accuracy. Even if we are dealing with an 
isolated system so that its representative point will stay 
on an energy surface, we®lo not know on which energy 
surface, and the only truly representative ensemble will 
be one which consists of all the systems whose repre- 
sentative points at Zo fill up a Z-star. Hence, the discus- 
sion of the H-theorem still remains germane to the 
problem we are discussing. 


rel 


B. THE CLASSICAL ERGODIC THEOREM 
(1) The Ergodic and Quasi-Ergodic Theorem 


We mentioned in the Introduction that many authors 
prefer the approach via the ergodic theorem to the one 
using the H-theorem. At the end of Sec. A(3) we 
mentioned our reasons against this point of view and we 
shall return to this discussion in Part C. However, the 
ergodic theorem has played such an important role in 
the development of the subject of statistical mechanics 
that the omission of a discussion of this problem in an 
article such as the present one would be unforgivable. 

We mentioned at the beginning of Sec. A(3) that 
Boltzmann considered two different definitions of an 
equilibrium situation, namely, as the most probable 
situation or as the average situation. We considered 
the first possibility and we shall now consider the second 
one. Boltzmann (71B1, 71B2; see also 68B) and 
Maxwell (79M) thought that one could prove in a more 
or less Straightforward manner the theorem: The 
average behavior of a system of independent particles 
will correspond to the Maxwell-Boltzmann distribution. 
The average is taken in the sense of a time average over 
an infinite period. 

The reason for believing inthis theorem was that 
Boltzmann expected the existence of so-called ergodic 
systems. An ergodic system is one such that the orbit 
of its representative point in I space will go through 
every point of its energy surface. Once the existence 
of ergodic system is accepted one can easily see the 
advantage of considering time averages. First of all it 
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has to be borne in mind that one always measures time 
averages in any physical experiment. Moreover, as 
relaxation times are usually small as compared to the 
period over which an experimentally determined quan- 
tity is measured, it does not matter whether we take 
the averages over a finite interval or over an infinite 
period. Secondly, if one could prove that actual physical 
systems were ergodic the following equalities would 
hold for any phase function ¢(p,q): 


(g)n=(9)u=(Bn= p, (B1.01) 


where the bar (—) indicates a time average and ( ),, an 
average taken over a microcononical ensemble. The 
second and third terms in Eq. (B1.01) indicate the time 
average of the ensemble average and the ensemble 
average of the time average, respectively. We shall 
presently indicate a proof of Eq. (B1.01), but at the mo- 
ment only remark that from these equations it follows 
that one can calculate instead of time averages ensemble 
averages and, since those can be calculated much more 
easily, we have a clear gain. 
The time average of ¢ of a phase function is defined 
by the equation 
1 eT 
g= lim — 
T-2 IT hap 


elp dt, (B1.02) 


where p’s and q’s are the values at the time £ of the 
coordinates of the representation point of the system is 
T space. In order to see whether or not the average 
behavior of one system corresponds to a Maxwell- 
Boltzmann distribution, one selects a set of phase 
functions which together determine the distribution 
and compares their time averages with the values to be 
expected for a Maxwell-Boltzmann distribution. 

The ensemble average (¢)s is taken over a micro- 
canonical ensemble. This average is given by the 
equation 


(en=| T eloae(bayts| y. [ f spaas), TA 


where the two integrations extend over the whole of 
the energy surface $, and where o(p,q) is the surface 
density on $ corresponding to a microcanonical en- 
semble. This surface density is given by the equation“ 


(p,q) = [> BE SHF (B1.04) 


where S is the number of degrees of freedom of the sys- 
tem, and thus the number of dimensions of I space, 
and 3 is the Hamiltonian of the system. 

From Liouville’s theorem (ESM p. 102) it follows 


The expression inside the braces in Eq. (B1.04) is nothing but 
the absolute value of the “velocity” of the representative points 
in T space. Saa 
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(see, eg, ESM p. 355) that the only Stationary en- 
sembles on an energy surface are those for which the 
surface density along an orbit satisfies Eq. (B1.04). 

We cari now proceed to the discussion of Eqs. (B1.01). 

The first equality follows immediately from the fact 
that a microcanonical ensemble is a stationary en- 
semble. The second equality is due to the interchange- 
ability of averages. There remains the last equality 
which holds for ergodic system, since ¢ is the same 
for all systems in the ensemble. This can be seen as 
follows. Since for an ergodic system the orbit of the 
representative point passes through every point of $, 
and since it follows from the canonical equations of 
motion (A2.05) that in each point the direction of 
the orbit is uniquely determined, therefore there is 
only one orbit on $. The only difference for different 
systems in the ensemble lies in the exact moment at 
which a point of the orbit is passed. As all systems 
follow the same orbit, it follows from Eq. (B1.02) that 
@ is the same for all systems and the last equality of 
Eq. (B1.01) follows. 

For an ergodic system one can even go one step 
further and prove that the following equation holds 
(71B): 

di od8 
lim —= 5 


T= T 
f ods 


where di is the time spent by the representative point 
within a surface element dS of $ during the period T. 
From Eq. (B1.05) we can, in principle, calculate the 
frequency of the occurrence of nonequilibrium situations. 

The problem is now reduced toa proof of the ergodicity 
of mechanical systems. However, one can easily see that 
it is unlikely that any system is ergodic (38T, p. 67). 
We remind ourselves that the system of Eqs. (A2.05) 
possesses 25 integrals of motion. One of these is the 
energy and one fixes the time along the orbit. By 
choosing a point P on the energy surface, we fix the 
values of the 2S—2 other constants of motion. Choosing 
different values for these constants, but the same value 
for the energy, we find another point Q on the same 
energy surface, but there is no orbit connecting P with 
Q. In order to complete the proof one should, however, 
show that the 2S—2 constants of motion are not con- 
stant almost everywhere on the energy surface—which 
seems unlikely. 

We enter the realm of measure theory” with the 
expression “almost everywhere,” and it is by using this 
theory that one can prove the nonexistence of ergodic 
systems (13R, 13P; see also 52R). The crux of this 
proof is that the points of an orbit form a set of measure 


(B1.05) 


41We have excluded exceptional orbits which are strictly 
periodic. £ 

“1t may be noted here that many mathematical statisticians 
(see, eg., 53D) consider statistics purely as a branch of measure 
theory. For a discussion of measure theory we refer to a recent 
tere book by Halmos (50H; see also 53D). 
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zero on the energy surface, the measure of which is 
not zero, We can see this as follows (52R). Consider 
a point P of the orbit of the representative point and 
consider a small region @ of finite measure around P, 
As the orbit can clearly not stay indefinitely in Q, if it 
is to cover $ completely, the only part of the orbit in @ 
will be a collection of separate segments," corresponding 
to a succession of finite intervals of time. As the set of 
such time intervals is enumerable, the segments of the 
orbit inside @ will also be enumerable and thus have 
measure zero. 

Although the impossibility of ergodic systems was 
only proved by Rosenthal and Plancherel in 1913, the 
Ehrenfests (112, especially footnote 89a; see also 
91K, p. 484) were apparently well aware of the im- 
probability of ergodic systems and of the fact that the 
points of an orbit and all the points of an energy surface 
have different measures. In order to avoid this difficulty 
it was suggested that mechanical systems might be 
quasi ergodic. The representative point of a quasi- 
ergodic system will come arbitrarily near every point 
of the energy surface without, however, passing through 
each point. 

The introduction of the possibility of quasi-ergodic 
systems did not, however, solve the problem haw to 
prove Eq. (B1.01), although Rosenthal (14R2; see, 
however, reference 14) gave a not completely rigorous 
proof of Eq. (B1.05) based on the quasi ergodicity of 
mechanical systems and also proved the equivalence of 
time and ensemble averages. Fermi (23F1) proved that 
a certain class of systems, the so called Kanonische 
Normalsysteme,’® were quasi ergodic. A sketch of his 
proof is given in Appendix II. 

In order to advance it was necessary to find an inde- 
pendent proof of the equivalence of time and ensemble 
averages and that is the purpose of more modern de- 
velopments of the ergodic theorem which are discussed 
in the next section. The term ergodic theorem is nowa- 
days used to indicate a proof of the equivalence of time 
and ensemble averages, and the term quasi-ergodic 
theorem would probably have been more relevant from 
a historical point of view. 


(2) Recent Developments‘’ 


l Modern ergodic theory started with the paper by 
Birkhoff (31B2) who considered the following time 


#8 The measure of a point set M on § is defined as follows: 
Let f(P) be a function which is 1, if P belongs to sm and to 0 
otherwise. The Lebesgue integral "fds extending over the whole 
a gi rhen called the Lebesgue measure of M on $ and denoted 

y t. 

“ They have to be separate, as for a nonperiodic orbit no point 
of $ is ever passed twice. 

4 It may be noted here that in view of the impossibility of 
ergodic systems in the original sense of the word, modern authors 
often refer to quasi-ergodic systems as ergodic systems. 

SS interesting to note that Einstein (02E, 03E) when he, 
independently of Gibbs, introduced ensemble theory, restricted 
himself to these systems (sce also 11K2). 

41 See also 20M1, 20M2,°27H2. 
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{(P)dt, (B201) 


5 

; 1 pio 

W= ji 
T ts 


9 


where P; is the point passed at ¢ of an orbit which passed 
through P at to. He showed, first of all, that for practi- 
cally all points P of the energy surface $ the limit of 
S(P; to, T) *xists for T«. Moreover, Birkhofi showed 
that f(P; lo T) in the limit of T=% is independent of 
lo for practically all points P., The proof of this property 
of time averages on an energy surface is given in 
Appendix III. “Practically all” is understood here, as 
surmized by Birkhoff in 1922 (22B; see also 2682), in 
the sense that the exceptional orbits would form a set 
of measure zero on $. 

Secondly, Birkhoff showed that f(P; ~), i.e., the 
time average taken over an infinite period, is constant 
almost everywhere on 8, provided the group of trans- 
formations P—/, is metrically transitive, or in other 
words, provided S$ is metrically indecomposable.‘7* The 
constancy of f(P; ©) follows from the fact that, if it 
were not fulfilled, one could find a value F of f(P; æ) 
such that the conditions f(P; %)<F and f(P; ©)2F 
would define two sets of positive measure on $ which 
would both be invariant against the transformations 
P>P,8 

Birkhoff went even further and proved the equivalent 
of Eq. (B1.05), namely, that for practically all points 
Pons 


dt(P; m) Mm 


ae B2.02 
li ae (B2.02) 


where di(P; M) is the fraction of time a representative 
point of the orbit going through P spends in the region 
M during a period T. Once again Eq. (B2.02) is only 
true provided the group of transformations PP, 
is metrically transitive.” 

Birkhoff’s proof is slightly stronger than the one given 
by von Neumann (32N1, 32N2) who showed that the 
convergence of f(P; toT) was true in the mean, while 
Birkhoff showed that it was true for practically all P, 


‘The group of transformations P—P, is called metrically 
transitive and § is called metrically indecomposable, if § cannot 
be decomposed into two parts §ı and $2 both of positive measure 
and both invariant under every transformation of the group. 

48 We do not wish to discuss here the complications due to the 
existence of other integrals of motion such as linear momentum, 
angular momentum, and so on. These introduce what is called by 
Rosenfeld an inessential decomposition of $, while we are only 
concerned with the essential indecomposability of §. For our 
discussion we shall assume that there are no uniform integrals of 
motion apart from the energy and, moreover, we shall only con- 
sider metrical indecomposability in the physical sense. For a dis- 
cussion of complications which wefivoid this way we refer to the 
literature (see, e.g., 52R). We must also refer to the work of 
Grad (52G1, 52G2) who has considered in detail statistical me- 
chanics of systems with integrals other than energy. see 

From Eq. (B2.02) we see the relation between Birkhofi’s 


. ergodic theorem and Poincaré’s recurrence theorem (compare 41W 


90-91). This equation expresses the fact that the representa- 
ie ore will spend a finite period in each region of positive 
measure. ° 
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The problem of the ergodic theorem is now reduced 
to the problem of proving that in general energy sur- 
faces are metrically indecomposable. In 1941 Oxtoby 
and Ulam (410) proved this for a quite general class of 
surfaces, which were polyhedra of dimension three or 
more. They thus showed, in their own words, “that the 
ergodic hypothesis in its modern form of metrical 
transitivity is at least free from any objection on 
topological grounds.” According to Gamow (49K1, 
p. 54; the italics are Gamow’s) this implied “that in a 
certain sense almost every continuous transformation is 
metrically transitive.” Gamow unfortunately does not 
give a justification for his statement. However, the 
energy surfaces considered by Oxtoby and Ulam are not 
quite physical systems, and it has often been felt there- 
fore that, although Birkhoff’s ergodic theorem had 
brought the problem a great deal nearer solution, one 
was still faced with a hypothesis. As Birkhoff and Koop- 
man (32B) put it: “The quasi-ergodic hypothesis has 
been replaced by its modern version: the hypothesis of 
metrical transitivity.” However, it has apparently been 
overlooked that the metrical indecomposability of the 
energy surface had been proven for a certain class of 
physical systems of which some are known to exist. 
In order to see this, we first observe that, for any 
system which is quasi ergodic, the energy surface is 
metrically indecomposable; thus justifying the con- 
jecture of the Ehrenfests that, for quasi-ergodic systems, 
Eq. (B1.05) would hold. Indeed, quasi ergodicity implies 
that every orbit on the energy surface will pass through 
any region @ of positive measure. Since an orbit is 
produced by the collective of the transformations 
P—P, with £ ranging from —~ to +, it follows that 
the condition of metrical indecomposability is satisfied 
for the energy surface of any quasi-ergodic system. 
Secondly we remember that Fermi (23F1) proved the 
quasi ergodicity of Kanonische Normalsysteme and that 
Poincaré (92P) has shown that in the reduced three- 
body problem one is dealing with such a system.” Of 
course, in this case the number of degrees of freedom 
is very small and it would be of interest to show that 
other physical systems also belong to the same class. 
Kanonische Normalsysteme are characterized by the 
following properties. After eliminating the uniform 
integrals of motion such as the components of the total 
momentum, it is possible to introduce sets ef canonically 
conjugate variables x; and y; such that (i) the energy £ 
is independent of the time; (ii) there exists in the system 
a parameter a, such that the Hamiltonian 5¢ can be 
expressed as a power series in a, 


R=Kotakitasrt--+; (B203) 


(iii) the first term Xe in Eq. (B2.03) does not depend on 
the x; but the other terms may depend on both the 


® The parameter @ occurring in Eq. (B2.03) is then the ratio 
mz/m1, where the three masses, mu ms and ms, satisty the in- 
equalities m&m <mai., 3 
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x; and the y;; (iv) all the 3; are periodic with a common 
period in the x; 

Before we discuss briefly Hopf’s ergodic theorem, we 
want to draw attention to two points. The first one is 
that Einstein and other writers (02E, 03E, 06P, 11K2) 
explicitly state that they believe all physical systems 
to belong to the class of Kanonische Normalsysteme. 
Secondly, we want to point out that, in the case of a 
system satisfying the condition of metrical indecom- 
posability of its energy surface, the various integrals of 
motion are nearly*! constant over the whole of the 
energy surface and that thus Boltzmann’s and Maxwell’s 
assumption of ergodicity, which involved as we saw 
exactly constant integrals of motion almost everywhere 
on the energy surface, was not as far removed from ac- 
tual fact as one might think. 

The ergodic theorem of Birkhoff’s is restricted to the 
discussion of orbits on one energy surface. However, as 
we have emphasized before, it is more in accordance 
with the physical reality to consider a set of energy 
surfaces corresponding to energies within a finite in- 
terval, say, 


Ev—8<E<E;+5, (B2.04) 


(compare ESM p. 99) where 6p in order that we can 
still speak of a system with a (more or less) well-defined 
energy. For such an energy shell Hopf (37H; see also 
30H2, 32H1, 32H2, 32H3, 32H4, 34H, 52R) has given 
a slightly modified ergodic theorem which again proves 
the equality of time and ensemble averages. The neces- 
sary condition for the validity of Hopf’s ergodic theorem 
is that not only each energy surface in the shell is 
metrically indecomposable, but that also almost every 
energy surface of each product space, which is obtained 
by introducing a phase space the coordinates of which 
consist of a pair of sets of coordinates of T space,® is 
metrically indecomposable. If this condition is satisfied, 
any distribution in the energy shell will ultimately 
become a more or less uniform one (compare the dis- 
cussion in Sec. C). Hopf’s ergodic theorem does not 
introduce any new essential feature. The main impor- 
tance lies probably—apart from its mathematical in- 
terest—in the fact that the quantum-mechanical 
counterpart of the ergodic theorem is more closely 
related to Hopf’s than to Birkhoff’s theorem, as we shall 
see in Part E. 


C. THE H-THEOREM IN CLASSICAL ENSEMBLE 
THEORY 


(1) Fine-Grained and Coarse-Grained Densities 


Up to now we have been considering only the be- 
havior of isolated systems. As we have already men- 


i We do not wish to make our statement more rigorous as 
this would involve a lengthy discussion. 

52 This reference gives an extensive bibliography on the mathe- 
matical aspects of the quasi-ergodic theorem. 

Bs Thep. ase space is so to speak. duplicated. For a more exten- 
sive discussion we refer to Hepf’s papers (see also 52R). 
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tioned a few times, this approach does not do justice, 
in our opinion, to truly statistical considerations. It js 
rather the approach appropriate to the kinetic theory of 
gases, and it is, therefore, not surprising that both the 
original form of the -theorem—which we discussed in 
Part A—and the classical ergodic theorem (Part B) 
found their origin in Boltzmann’s work which found its 
culmination in his Gastheorie (96B, 98B). When Gibbs 
developed his statistical mechanics and introduced the 
ensemble theory, a new element entered the discussion, 
Gibbs tried to show (02G, especially Chapter XII; see 
also 07L) that an ensemble of systems would evolve in 
such a way that it would approach to a micro- or a 
macrocanonical ensemble.*! In order to do this one 
must introduce a slightly different H-theorem and this 
will be discussed in the present section. We shall base 
our discussion on grand ensembles, partly because one 
can easily adopt the discussion to the case of petit 
ensembles, and partly because it is, in our opinion, the 
logical consequence of statistical considerations to 
consider these ensembles, first’ introduced by Gibbs in 
the last chapter of his monograph (02G, Chapter XV), 
Once the importance of the idea of representative en- 
sembles is accepted, one is led automatically to grand 
ensembles as we shall see in the next section (see, also 
ESM p. 235 and 55H1). 

To simplify our discussions we shall consider only 
systems containing one kind of particles. We shall 
denote by » the number of particles in a system. A 
system containing » particles each with s degrees of 
freedom can be described by vs generalized coordinates 
(q) and vs generalized momenta (p). Its phase space, or 
T space, has thus 2sv dimensions. If we consider a grand 
ensemble, we are dealing with systems of a varying 
number of degrees of freedom. Let D(v; p,q)dQ, denote 
the number of systems in the ensemble with v particles 
and with a representative point (the point in its phase 
space with coordinates equal to the values of the 
generalized coordinates and momenta is called the 
representative point) within the volume dQ, of its 
phase space. Here we have 


d2,=TI dprdgr. (C1.01) 
k=1 


We have indicated the dependence of D on the gẹ and 
br by p, q. Let Nens be the number of systems in the 
ensemble, so that we have 


Nens= D foe: Pg) dQy, (C1.02) 
v=1%. 


where the integration extends each time over the whole 
of phase space. We now introduce the density of the 


4 We often use the term macrocanonical for ensembles called by 


Gibbs canonical. The reasons for this deviation from orthodoxy 
have been given elsewhere (54H2). 
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ensemble,” p(v; p,q) by the equation 
P=D/ Neus, (C1.03) 


and from Eqs. (C1.02) and (C1.03) it follows that p 
satisfies the normalization condition 


E, fo; pio =1. (C1.04) 


The index of probability y is given by the equation 
(C1.05) 
The average value (G) of any phase function G (v; p,q) is 


given by the equation 


G@=y, f Gdo. 


n=Inp. 


(C1.06) 


Up to this point we have not yet taken into account the 
implications of the fact that in a system containing only 
one kind of particles we are dealing with v identical 
entities, and that we must make up our minds whether 
or not to consider situations which differ only in the 
permutation of some of the particles as being different. 
If we consider them to be different, we are dealing with 
specific phases, but if we consider them to be the same, 
we are dealing with generic phases. As is easily seen each 
generic phase contains v! specific phases. Although, 
even before the introduction of quantum mechanics, 
there were strong reasons to prefer the generic phases 
to the specific phases (see, e.g., 02G, or ESM p. 142), 
the decisive argument in favor of generic phases is that 
only they lead to formulas which are the limiting cases 
of the quantum-mechanical ones. In the following we 
shall only deal with generic phases and, moreover, 
assume that integrations such as occur in Eqs. (C1.02), 
(C1.04), or (C1.06) are extended only over all different 
generic phases; that is, of all v ! different specific phases, 
only one is taken into account. 

We now consider the properties of a quantity o de- 
fined by the equation 


G= fema, (C1.07) 


We see, first of all, that ø is the average value of the 
index of-probability, 7. Secondly we shall show that o 
has the following properties. 

(a) If our ensemble is such that all systems have the 
same number of particles V and that the energy e of all 
systems lies in the interval EXe< E+46£, o will be 
minimum if p is given by the equation 


p=constant ô(v—N) for EXe< E+6E; 
p=0 for EZ or E+6E<e, 


where 6(x) is the Dirac delta function (35D, Secs. 20, 21). 


| (C1.08) 


55 Gibbs calls D the density (in phase) and p the coefficient of 
probability. ° 


(b) If our ensemble is such that all systems have the 
same number of particles, but only the average energy 
is given, that is, p must satisfy the condition® 


ae, f cota. =E, 


then o will be minimum, if p satisfies the equation 
(C110) 
where 6 and y are constants which can be determined 
from Eqs. (C1.04) and (C1.09), 

(c) If only the average number of particles and the 
average energy of the systems in the ensemble are given 
so that p must satisfy Eq. (C1.09) and the condition 


(C1.09) 


p=(v—N)ee vo, 


a faex, (C1.11) 
a will be minimum, if p satisfies the equation 
p=exp[—q+vp— ße]. (C1.12) 


Once again 8, u, and q are constants, that is, they do not 
depend on the px, qw, or v, and they can be determined 
from Eqs. (C1.04), (C1.09), and (C1.11). 

Before we prove these properties of o, we may just 
remind ourselves that the densities given in the three 
cases correspond, respectively, to an energy-shell 
ensemble,*’ a macrocanonical ensemble, and a canonical 
grand ensemble. It follows from the usual considera- 
tions (e.g, ESM Secs. 5.3 and 6.1) that 8, which is 
equal to the inverse of the modulus of the ensemble, is 
equal to 1/kT (k: Boltzmann’s constant; T: absolute 
temperature), that y is the free energy of the system 
represented by the macrocanonical ensemble, that y 
is equal to @ times the partial free energy or the partial 
thermal potential, that g is Kramers’ g-potential —which 
for a homogeneous system is equal to BpV (p: pressure 
V: volume)—and that in both case (a) and case (c) 
(n) is equal to —S/k (S: entropy).™ 5 

The proof of the minimum properties of ø rests mainly 
on the fact that the function y, given by the equation 


(C1.13) 


is positive for x0 and zero for x=0. This property of 
y follows most easily if one sees (i) that y{0)=0, and 
(ii) that dy/dx(=.xe*) has the same sign as x. 
Compare now in the three cases (a), (b), and (cf two 
densities p, and ps where p; in each case is the one which 
should lead to the minimum value of a, that is, pı is 
given, respectively, by Eqs. (C1.08), (C1.10), and 


y=xe7—e7-+1, 


$$ The summation over » is here trivial as only one term of the 
sum contributes. 

‘7 This ensemble is sometimes called a microcanonical ensemble, 
although the microcanonical ensemble is really only the limiting 
case of an energy-shell ensemble. i 

58 This is another example of the advantage of the sgeneric 
densities over the specific densities, as this relation between S and 
p does not hold for the specific density. anoe 
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(C1.12) while pẹ is given by the equation 
po=pie™, (C1.14) 


where Ay may be any function of the pẹ and qx and in 
case (c) of v. Both pı and p: satisfy Eq. (C1.04), 


Z f aio- f pxto= 1, 


while in cases (b) and (c) both satisfy Eq. (C1.09), 


(C1.15) 


E f oa=¥ f odo- (C1.16) 
In case (c) both satisfy Eq. (C1.11), 
D feaz. fean. (C1.17) 


Finally we have 


a—n=> | (pz Inp2—p1 Inpi)dQ.  (C1.18) 


From Eq. (C1.18), one can derive for all three cases the 
equation® 


Gace J pifAne—e+-1d2>0, (C1.19) 


the last inequality being a consequence of the proper- 
ties of the function given by Eq. (C1.13). 

We have now proved that g is minimum for an energy 
shell, macrocanonical, or canonical grand ensemble 
under certain circumstances (we shall investigate in the 
next section the physical significance of these circum- 
stances). Comparing Eq. (C1.07) for o with Eq. (A2.03) 
for H, we might think for a moment that we might be 
able to prove that do/dt will always be negative and thus 
be able to show a tendency for the establishment of 
canonical ensembles. However, it is easy to show that 
do/di=0. This was pointed out soon after the publica- 
tion of Gibb’s monography by Burbury (03B). In 
order to see this we write 


o(t”)= = fo" lnp”dQ" = zji Inp’JdQ’ 


=> if p’ Inp'do’=o(0’), (C1.20) 


where p’[p’] is a short-hand notation for p(p’’,q’’; t) 


59 The derivation of Eq. (C1.19) follows in case (a) by adding 
to the right-hand side of Eq. (C1.18) the expression 2 f(1—Inp;) 
X (¢;—p2)d2, which is zero by virtue of Eq. (C1.15) and the fact 
that pı=constant. In case (b) one adds to the right-hand side of 
Eq. (C1.18) the expression 2 /[8(y—¢)+1](o:—p2)dQ which is 
zero by virtue of Eqs. (C1.15) and (C1.16). Finally in case (c) one 
adds the expression 2 f{q—vu-+Be—1} (p2—p1)d® which is zero 
birtue of Eqs. (C1.15) to (C1.17). 
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[p(p’,q’; t], where the relation betweer p”, q” and 
p', 7’ is such that a representative point p’, g'at time /’ 
will have moved to p”, q” by 0’, where J is the 
Jacobian of the transformation from p”, q” to p’, q' 
which by virtue of Liouville’s theorem (e.g., ESM 
p. 102) is equal to 1. 

Although we see that o is constant, there is still in a 
certain sense an approach to a stationary state. We 
shall first illustrate this by an example due to Gibbs 
(02G; see also the discussion in 03B, 04B1, 04B2, 
062, 1112). Consider a container with a liquid, say 
water, in which is put some coloring material and let 
us assume that this coloring material is nondiffusible 
and consists of colloidal particles. It is a well-known 
empirical fact that, if we start from a state where the 
coloring material is unevenly distributed, practically 
any kind of stirring will produce a situation where the 
color distribution is, as far as our eye can see, uniform. 
That means that stirring will produce an “equilibrium” 
state. However, if we look at the system very closely, 
we will still find that, in microscopic volumes, part of 
the space is occupied by the water and part occupied by 
the colloidal particles. Although the coarse distribution 
is uniform the finer distribution is still uneven. 

From this example it follows that it might be ad- 
vantageous to introduce apart from the fine-grained 
density p a coarse-grained density P defined as follows. 
Divide for each v the corresponding T space into finite, 
but small, cells Q; of volume W(Q;), and let P; 
be the average of p over 0;, 


ra=] f pio, | if W(Q:), 


where the integration extends over the cell Q;”.” 

We now introduce the coarse-grained density 
P(v; p,q) by putting it constant in each cell 0; and 
equal to P;“. From Eq. (C1.04) we now get, as can 
easily be verified, 


Dae ry POW Q:)= 1, 


Z [rao=t, 


and we see that P is normalized. 
Instead of o we now introduce a function 2 by the 
equation 


(C1.21) 


(C1.22) 
or 


(C1.23) 


D=), EW (UGORO InP, (C1.24) 


As we are dealing with systems with only one constituent the 
representative point of a system will stay in the same T space and 
there will not be a change in v. The situation is, however, much 
more complicated if we consider systems in which chemical reac- 
tions can take place. These might be tackled using the second 
quantization methods developed by Schönberg (52S1, 53S1, 5352). 

St Compare introduction of cells in Sec. A3. 

®We could have slightly generalized the definition of the 
coarse-grained density by also dividing the possible v-values into 
intervals containing a few integers each. However, in order not to 
complicate our formulas we have not done this, as it is not neces- 
sary for our arguments in the case of systems containing only one 
constituent. 2: 
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which expressed in the coarse-grained density P gives 


z=5 fP InPdQ, (C1.25) 
or, if we remember that InP is constant over cach cell 
Q; and that the integration of P over 9, is the same 
as that of p over 2;, 

n 


Y= (InP). (C1.26) 


The properties of ø of cases (a), (b), and (c) now hold 
for X, if we everywhere change p to P. However, now X 
is no longer constant and we can, indeed, show that > 
will decrease; thus showing a tendency toward the 
establishment of canonical ensembles—at any rate in 
so far as the coarse-grained density is concerned. We 
may remark at this point that although the distinction 
between fine-grained and coarse-grained densities is 
assumed in Gibbs’ monograph (02G), the two are not 
clearly distinguished in the discussion and the same is to 
a large extent true also.for the discussion of Burbury 
(03B, 04B2) and Bumstead (04B1); we may further 
refer to discussions by the Ehrenfests (06E2, 11E2), 
Poincaré (06P), Lorentz (07L), and Krod (11K2). 

Let us now consider the change of with time. In 
ordet to do this we have to anticipate some of the argu- 
ments of the next section where representative en- 
sembles are discussed. Let us assume that we have made 
some observations about a physical system at /’. Since 
these observations will never give us the maximum 
possible information, we can construct an ensemble, the 
average properties of which at ¢’ correspond to the 
observed properties of the system under observation 
at t’. Because of experimental limitations, we shall at 
most be able to give p(v; p,q) changing from one cell 
to another, if we have chosen the size of the cells in 
accordance with the experimental limitations, as we 
shall assume to have done. We choose thus the fine- 
grained density constant in each cell and have then at ¢’ 


P=, (C1.27) 
and for , 


D/=¥ f prinp'do=> fo" Inga, (C1.28) 


If the situation at ¢’ already corresponded to an 
equilibrium situation, © would have had its minimum 
value and no change could be expected, since the three 
ensembles corresponding to minimum © in cases 
(a), (b), and (c) are all three stationary ensembles 
(see, e.g., ESM pp. 105 and 137). Let us therefore as- 
sume that p’ does not correspond to a stationary en- 
semble. At a later® time ¢’’ Eq. (C1.27) will no longer 

6 Strictly speaking we should say “at another time,” leaving 


open the question whether ¢” is later or earlier than f’. However, 
since we have set up our ensemble to represent a system observed 


„at l, we are only interested in predictions which might be verified 


AA : F; an aii Sacer 
by experiment, that is, in Jafer times i”. In this connection it is 
EEE to read Burbury’s and the Ehrenfests’ remarks on this 


point (03B, 04B2, 06E2; see also 02G, 04B1). 


hold and we have 


p"#ÆP", (C1.29) 


because of the fact that, although p will stay constant 
in extensions in phase of unchanging volume W (0), 
the shape of these extensions will change and at a 
later moment each of the cells will be covered by 
points which at f belonged to many different cells, 

We write Eq. (C1.29) in the form >» 


t 
p ee Pret, 


(€1.30) 
where A is a function of v and the p’s and q’s. For 3 
we have now 


Y= f P” Inpa. (€1.31) 


and for the change of X we have“ 
z-2"=5 | PEaes—est >O (C1.32) 


We see now that 2” will be less than X’ because of the 
fact that p” and P” are no longer everywhere equal. 
Comparing the present situation with the case of the 
coloring material in a liquid, we may expect that, as 
time marches on, p and P will differ more and more and 
that X will continue to decrease until equilibrium dis- 
tributions of P have been reached, that is, 


case (a) P=6(y—.V)-constant, E<e<E+6E; 
P=0; e<E or E+6E<e; 
(C1.33) 
case (b) P=ô(v— Vje? %9; (C1.34) 
case (c) P=exp[—q+ru— ße]. (C1.35) 


As soon as the distribution (C1.33) to (C1.35) are 
reached, X has reached its minimum value and will 
therefore no longer decrease. In that case we would 
thus, on observation, reach the conclusion that the 
state of the system under consideration would best be 
represented by a microcanonical or a canonical grand 
ensemble—which are stationary ensembles. This pre- 
sents us with another justification for using these en- 
sembles to describe systems in thermodynamical 
equilibrium.® 

The time needed to reach the equilibriym distribu- 
tions will, in general, be of the order of the relaxation 
time, and not—as suggested by the Ehrenfests (11£2)— 
of the order of the recurrence times of the Poincaré 
cycle (Sec. A2). It seems to us that they forget to take 
into account the indistinguishability of the constituent 


“ Equation (C1.32) is obtained by subtracting Eq. (C1.31) 
from Eq. (C1.28) changing in the integrand of (C1.28) p’_to p” 
which can be done since dp/d!=0 [compare Eq. (C1.20)], and 
agin Pe expression D te which is zero by«virtue 
oO s. (C1.04) an 23). 

6 For the usual justification see, eg., Gibbs’ sg et 
(02G, especially Chapters IV, X, and XV) or ESM, Secs. 5.3, 
5.7, and 6.1 (compare also ESM Sees. 5.6 and 6.3). ag 
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particles in the system. This means that the Poincaré 
period must be divided by N ! and time intervals of the 
order of the relaxation time ensue. 

To conclude this section there are two points which 
we should like to mention. First of all we have proved 
that, if we have set up a representation ensemble at /’, 
the quantity Ð will decrease from its value at /’ to a 
smaller value at ¢’’, but we have not proved, but only 
made plausible by appealing to the case of coloring 
material in water that X will continue to decrease. 
The situation is very similar to the one encountered in 
Sec. A(3), where we discussed the statistical aspects of 
the H-theorem. It would be worth while to investigate 
the behavior of Ð as a function of time in more detail 
along lines analogous to the discussion of Sec. A(3). 
It is interesting to note that Gibbs (02G) himself did 
not state that ©’ would monotonically decrease, but 
only that 

lim E(D), (C1.36) 
a property which he did not prove, but which for a 
special case was proven by Poincaré (06P) and Kroò 
(11K2). We may also refer to the discussion of Tol- 
man’s (38T, especially Secs. 49 and 51) and the discus- 
sion in the next section. 

The second point to which we wish to draw attention 
is that the decrease of X resulted from the fact that we 
came from a state where p and P were equal to a state 
where p was no longer everywhere equal to P. As P is 
obtained by experimental observation we may say that 
at l we “know” p. However, at i” we no longer 
possess the same amount of specific information. Since 
X can be considered to be a “coarse-grained” o and since 
g is related to the entropy by the relation 


o=(n)=—S/k, (C1.37) 


we see once more how increase of lack of knowledge 
corresponds to an increase in entropy.® 


(2) Representative Ensembles 


In the preceding section we have considered the be- 
havior of ensembles and we have thus started the dis- 
cussion of question (B) of the Introduction, that is, 
the question: Why is it possible to describe the be- 
havior of a system by considering the average behavior 
of a system in an ensemble. This is perhaps the most 
important question in statistical mechanics, but for 
some queer reasons it has not been considered as care- 
fully as it deserved. This may be partly due to the fact 
that Gibbs himself introdused ensembles to use them 
rather for statistical considerations than to illustrate 


_ the behavior of physical systems—even though he did 


not adhere strictly to his original intentions as set out 
in the introduction to his monograph. As a matter of 
fact, there are some indications that Gibbs had in mind 

66 We do not wish to pursue this correspondence between en- 
tropy and lack of detailed information or extent of ignorance, but 


refer to the literature; see, e.g., ESM p. 160 and papers by Szilard 
(29S) and Brillouin (51B4;seealso48W, 49S1, 54M1, 54G1, 55G1). 
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the possibility that the average behavior of,a system in a 
micro- or macrocanonical ensemble might represent 
the actual behavior of a physical system in equilibrium 
[see, e.g., the remarks made by Bumstead (04B1)]. 
In the same year as Gibbs and independently, Einstein 
(02E, 03E) introduced the ensemble theory and he 
definitely had in mind this possibility of the equivalence 
of the average behavior in an ensemble and the actual 
behavior of a physical system. However, the Ehrenfests 
(112), Ornstein (080), and Uhlenbeck (27U) con- 
sidered the ensemble theory mainly as a mathematical 
trick to lighten the calculation of average values of 
phase functions. Uhlenbeck, for instance, definitely 
puts the ensemble theory on a par with the Darwin- 
Fowler method (22D1, 22D2, 22D3, 2312, 2313, 23D1, 
23D2, 25F1, 26F2, 26F3; for a brief account see ESM, 
Appendix IV or 48S, Chapter VI). It was not until 
Tolman (38T, 40T) introduced the concept of repre- 
sentative ensembles that ensemble theory got a thorough 
physical foundation, but his point of view has not 
always been properly taken info account. We shall dis- 
cuss his approach in some detail in the present section 
as far as classical statistical mechanics is concerned, 
reserving for Sec. D(3) the corresponding discussion 
of the quantum-mechanical case. 
We have mentioned several times before in this paper 
that owing to severe limitations to our possibilities, we 
can only obtain very inadequate knowledge about the 
initial condition, say, of the physical systems which we 
happen to study. If we remind ourselves that one mole 
of gas contains 6.10% atoms, we see immediately that 
instead of obtaining values for the 36.10% coordinates 
and momenta which we should have to know in order 
to characterize the situation completely (assuming 
each atom to be a point particle with only three degrees 
of freedom) we can only obtain a few relations between 
them, for instance, by measuring the pressure, the 
linear momentum, and the angular momentum of the 
system. Moreover, we certainly will not know the total 
number of particles in the system; we have only to 
remind ourselves that Avogadro’s number is only 
known to a few parts in ten thousand (51B1, 51D). 
Having obtained this very scant information we now 
want to make predictions about the future behavior of 
the system from this information. Clearly the only 
possible way to do so is by using probability theory and 
statistical methods. If we could have obtained the values 
of all the coordinates and momenta at one moment— 
and if we could have solved the equations of motion— 
the behavior of the system would be completely de- 
termined, as we would then be dealing with a problem 
of classical mechanics «with a sufficient number of 
boundary conditions to determine the orbit of the repre- 
sentative point of the system in phase space. However, 
as we cannot obtain these values we are reduced to 


making statements about the most probable behavior - 


of the system. In order to determine this most probable 
behavior we compare various systems which all possess 
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the same value of those quantities which have been 
measured, but which otherwise may differ widely, 
This means, that we construct an ensemble to represent 
our systems. There remains the question how we can 
construct such a representative ensemble, and this 
introduces the problem of a priori probabilities. In 
constructing a classical ensemble we use the assumption 
of equal @ priori probabilities for equal volumes in 
I space. As far as petit ensembles are concerned this 
leads to a straightforward prescription, but in the case 
of the construction of a representative grand en- 
semble we must remember that we are dealing with 
I spaces of differing dimensions and that those with 
larger numbers of dimensions have thus larger a priori 
weights. We shall return presently to discuss the 
plausibility or justification of our choice for the a priori 
probability. 

In order to see what kind of ensemble will best repre- 
sent a system under consideration, we must first of all 
bear in mind that in practically all circumstances we 
only measure the properties of a small subsystem. In 
measuring the pressure of a gas, only the part of the 
gas near the wall takes part in exerting the force on the 
wall which is registered by our instruments. In de- 
termining the density of a system by optical method 
only part of the system is traversed by the light beam 
(compare, e.g., the case of critical opalescence). In 
measuring the electrical conductivity of a semiconduc- 
tor, a probe is put on the piece of material and, indeed, 
is often used to find out differences between situations 
at one place or another, and so on. This entails that we 
are really interested only in predicting the future be- 
havior of a subsystem. For the sake of simplicity we 
shall assume that the complete system—which may be 
either the whole of the physical system in our apparatus 
or even this system plus part of the surroundings—is 
isolated so that it cannot exchange either energy or 
particles with its surroundings. Let p be the density 
of the ensemble describing one subsystem and p’ the 
density of the ensemble describing the “rest system,” 
that is, the complete system minus the subsystem. 
Let furthermore e and v denote the energy and number 
of particles in the subsystem and ¢’ and v’ those in the 
rest system. The densities p and p’ will then satisfy 
the relations 


; Z fodo=1, (C2.01) 
X f pdd =1, (C2.02) 
E f (ctedoo'aaaa’= constant, (C2.03) 
Eor) f ododo = constant, (C2.04) 


87 This assumption was tacitly introduced when we wrote down 
expression (A3.01) for the probability of a situation Z. 
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where Eqs. (C2.03) and (C2.04) express the fact that 
in an isolated system the total energy and the total 
number of particles is constant. Let us, furthérmore, 
assume that an equilibrium situation has been reached, 
which means that X is minimum for the complete 
system, or, 


E f oe) In(pp’)dQdQ/ = minimum. (C2.05) 


In writing down Eqs. (C2.03) to (C2.05) we have used 
the fact that the ensemble of the complete system is the 
product of the ensembles of the subsystem and the rest 
system. Combining Eqs. (C2.01) to (C2.04) we can 
write instead of Eqs. (C2.03) and (C2.04), 


E f e145 f “pd = constam, (C2.06) 


Er f oin+Sy f da= constant. (C2.07) 


The equilibrium density now follows from the minimum 
condition (C2.05) which must be satisfied under the 
conditions (C2.01), (C2.02), (C2.06), and (C2.07). 
This leads in the well-known way (method of Lagrangian 
multipliers, see, e.g., ESM p. 20 or 444) to the following 
equilibrium expression for p; 


(C2.08) 


We may remark here that, as Eq. (C2.05) really only 
holds for the coarse-grained density, we really only 
obtain expression (C2.08) for P. 

Secondly, we may draw attention to the fact that our 
assumption about the a priori probabilities is used in 
writing down Eqs. (C2.01) to (C2.04) without introduc- 
ing apart from p a density function which would indicate 
the a priori weights of the various volumes in phase 
space. ; 

We have proved here in a manner slightly different 
from the one used in the previous section that the 
canonical grand ensemble will represent any sub- 
system in equilibrium. We must, however, pay some 
attention to the problem of nonequilibrium systems and 
we will follow Tolman’s discussion (38T, especially 
Secs. 79b, 51c, and 112) to a large extent. After that we 
conclude this section by discussing the choice of equal 
a priori probabilities for equal volumes ip phase space. 

Let us assume that our system is in a nonequilibrium 
situation and that we have ascertained this by making 
suitable observations on the system at /’. In order to 
predict its future behavior we construct an ensemble 
to represent the system, using, on the one hand, our 
observational information and, on the other hand, the 
assumption of equal a priori probabilities (or equal 
weights) for equal volumes in phase space. As we dis- 
cussed in the previous section we can only determine 
at most the coarse-grained density P’ and, using “tie 


p=exp[—q+rvu— ße]. 
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assumption of equal weights for equal volumes, we are 
thus led to choose our fine-grained density p’ according 
to Eq. (C1.27). Ata later time /” the fine-grained density 
p” which follows from p’ will no longer be constant in 
each of the cells into which we have divided phase 
space. As we saw earlier this results from the fact that 
P” and p” are no longer equal. From Eq. (C1.32) we 
see that /—” is the larger, the more P” and p” differ, 
and we can thus expect a continued decrease of > until 
the distributions of Eqs. (C1.33) to (C1.35) are reached. 

Tn order to see this approach to equilibrium we might 
make suitable observations on the system. From the 
forgoing discussion we would expect that these observa- 
tions would lead to coarse-grained densities such that 
5 is a steadily decreasing function of time. However, 
in making that assumption we, first of all, have tacitly 
assumed that the mixing of the various cells in phase 
space is a thorough one corresponding to the mixing 
of coloring material in water, and, secondly, we have 
forgotten to take into account the fact that each time 
we perform an observation this will add to the amount 
of available information and hence our choice of a 
representative ensemble at /’”, say, will no longer be as 
simple as at /’, since the information obtained at and 
i” must be taken into account. As far as the mixing is 
concerned, we see that we are back to the question 
treated in Hopf’s ergodic theorem and we see that, 
indeed, the approach via the ergodic theorem or via 
the ensemble theory, once we get down to the basic 
essentials, do not differ as much as one would expect 
beforehand. 

Let us now consider the problem of a priori proba- 
bilities. Let us restrict ourselves for the sake of sim- 
plicity to petit ensembles. Consider two regions Qo 
and @o of I space. Let those two regions at to be filled 
with representative points of an ensemble and let 
(Qo) and p(@o) be the a priori probabilities for the two 
regions. At /;, fs, --- the representative points which at 
fo filled up Qo and @o will fill up regions Qı and @1, 
Qz and G: -:- with a priori probabilities p(@:) and 
p(B), p(@2), and p(®2)---. One can clearly only 
tolerate such a choice of a priori probabilities that 


(Go)/P (Bo) = p(G1)/p (Bi) 


=(@2)/p(@2)=--:, (C2.09) 
and these relations are satisfied, if we take 
p(@)=W(@), p(®)=W(B),  (C2.10) 


where W (Q) ór W (G) is the volume of the region @ or @ 
in T space. We remark here that Eq. (C2.10) will only 
satisfy Eq. (C2.09) if we are dealing with a phase space 
made up of canonically conjugated p’s and q’s—one 
of the reasons for choosing the canonical coordinates to 
describe the phase of a system. 

Instead of Eq. (C2.10) one could choose the more 
general form 
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where the integration extends over the region @ and 
where /’(e,¢;) is a once and for all chosen function of 
the energy ¢ and the other constants of motion gi. 

The reason for putting [(e,¢:) equal to 1, which 
means choosing Eq. (C2.10) instead of the more general 
form (C2.11) was originally historical and was based 
on the assumption that ergodic systems existed. Since 
we are nowadays convinced that mechanical systems 
are quasi ergodic, which means essentially that the 
various possible values of ø; all occur with practically 
equal frequency in each part of phase space, we can 
immediately restrict ourselves to what the Ehrenfests 
call the ergodic choice, 


p(@)= fron (C2.12) 


However, since the density p of the representative 
ensembles of equilibrium systems are functions of the 
energy only, the choice (C2,12) instead of (C2.10) 
would correspond to a density 


p'()=p(6)/F (6), 


instead of the p(e) given by Eqs. (C1.08) or (C1.10). 
The F(e) would occur in all integrals, but the final 
results would be the same. 

We may mention here that the Ehrenfests’ remark 
that the ergodic choice is often inexpedient is no longer 
relevant as it was based on difficulties encountered by 
thermostatistics before the introduction of quantum 
mechanics. 


(C2.13) 


SUMMARY OF THE SITUATION IN 
CLASSICAL THERMOSTATISTICS 


Before we start the discussion of the status of 
H-theorem and ergodic theorem in quantum statistics, 
we wish to summarize briefly the contents of Parts A, 
B, and C. There are essentially three different methods 
of approaching the problem of how to justify the 
method of statistical mechanics: (1) the utilitarian 
approach; (2) the formalistic approach; and (3) the 
physical approach,® although the last two can overlap 
to some extent. 

The utilitarian approach justifies the statistical 
method by its results and feels no need for a further 
justification. In this approach calculations made by 
any method, be it the Darwin-Fowler method, the use 
of microcanonical ensembles, the use of macrocanonical 
ensembles, or the use of grand ensembles, are welcome 
as long as they lead to useful results. The fact that for 
physical systems with many degrees of freedom all 
these calculations lead to the same results makes it a 


*® We may also refer to a paper by Ehrenfest (14E) discussing 


a y the possible choices of F (e, ;). 
AOR f F(e p;)d9, (C2.11) The reader will forgive me, I hope, for this classification which 
= shows clearly my own prejudices and preference. 
A CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 
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matter of expediency to choose the method ultimately 
used. ‘ 

The formalistic approach usually argues that a 
mechanical system should be an isolated one and thus 
possess a definite value of its energy as well as of the 
number of particles in it, since these are the only systems 
with which classical mechanics is concerned. The only 
stationary ensemble to be considered is thus the micro- 
canonical petit ensemble although one may use the 
macrocanonical or grand ensembles as a mathematical 
trick to ease the calculations, Moreover, the micro- 
canonical ensemble is only introduced once it is realized 
that probably the time average of any phase function, 
if calculated for one system, is the same as the average 
taken over a microcanonical ensemble. In order to 
prove this one must introduce the ergodic or better the 
quasi-ergodic theorem, and adherents to the formalistic 
school of thought place the ergodic theorem in the 
center of their considerations. 

The physical approach emphasizes the fact that we 
can only obtain a limited amount of information about 
the systems in which we are actually interested and 
that we thus must work with representative ensembles, 
In this case, one must prove that the development of a 
nonequilibrium situation will be such that the ensemble 
representing the system under observation will resemble 
a canonical ensemble more and more closely. 

The considerations trying to justify the assump- 
tions of the last two methods of approach fall into 
three groups. One can either try to use the straight- 
forward H-theorem, but as we saw in Part A, one runs 
into trouble. The straightforward H-theorem would 
provide an alternative to the ergodic theorem, but the 
H-theorem in its statistical form does not do this, 
unless one is willing to introduce the various assump- 
tions about transition probabilities without further 
proofs. However, the statistical form of the H-theorem 
does serve a useful purpose in that it shows the relative 
frequency with which fluctuations will occur and the 
rate at which nonequilibrium situations may be ex- 
pected to return to equilibrium. 

One is then led to the ergodic theorem and as we saw 
in Part B the problem is at the present time reduced to a 
proof of the metrical indecomposability of the relevant 
phase spaces. If one could prove that physical systems 
are in general quasi ergodic, the ergodic theorem would 
have been proved but this has not been done. It is 
interesting to note that both Einstein (02E, 03E) and 
Poincaré (Q6P) explicitly state that they expect that 
the energy is the, only time-independent, uniquely 
valued, analytical integral of a physical system. If one 
introduces a slight amount of reality in the considera- 
tions by allowing the energy to vary over a small, but 
finite range, one has recourse to Hopf’s ergodic theorem 
where the mixing of various energy surfaces takes place. 


- 7 It is interesting to note that even Tolman uses this utilitarian 
point of view when he justifies the assumption of equal a priori 
probabilities for equal volumes in T space. 


Once again the metrical indecomposability of the phase 
spaces involved has not yet been proved. 

If one prefers the approach via the representative 
ensembles, one uses the //-theorem in the form suitable 
for ensemble theory, that is, where H is identified with 
the average value of the natural logarithm of the coarse- 
grained density. The approach to equilibrium follows 
easily, but the considerations are based on a special 
choice for a priori probabilities, and as we saw at the 
end of Sec. C(2) this choice can only be justified, if the 
systems under consideration are quasi ergodic. 

We see thus that all methods have in common the 
lack of proof of the quasi ergodicity of physical systems, 
Once this proof is given, one can either use the formal- 
istic approach since the (quasi) ergodic theorem is 
then proved, or the physical approach since the choice 
of a priori probabilities is justified from first principles. 
Concluding we can thus say that although there seems 
to be a large difference between the formalistic and the 
physical approach, this difference is in fact not so 
enormous, and both break down on the same point. 


D. THE H-THEOREM IN QUANTUM STATISTICS 
(1) The H-Theorem in the Elementary Method 


It is well known that statistical mechanics was rela- 
tively little affected by the change over from classical 
to quantum mechanics. The same methods which were 
used to good effect in classical thermostatistics could 
be taken over in quantum thermostatistics. If anything, 
statistical methods were more suited to the new situa- 
tion. Since quantum mechanics started from statistical 
considerations, namely, Planck’s discussion of black- 
body radiation (00P1, 00P2; see also 43P for an account 
by Planck himself of the reasons for introducing the 
quantum of action), it is not surprising that the ques- 
tions raised by the introduction of quantum theory into 
statistical discussions have been the subject of papers 
both before the advent of Bohr’s old quantum theory 
(13B1, 13B2) and before and after the development of 
modern quantum theory, the so-called wave or matrix 
mechanics (25H, 25B, 26S1).” In the beginning the 
discussion was mainly on the subject of a priori weights 
(see Sec. D3), but after the start of wave mechanics, 
many authors (e.g., 27N1, 27N2, 29D, 32N3, 35D, 36D, 
40H) have shown how Gibbs’ ensembles can be intro- 
duced to advantage in quantum mechanics. In many 
ways thermostatistics fits more easily into quantum 
mechanics than into classica mechanics, since quantum 
mechanics involves statistical predictions. For instance, 
even if we have the maximum amount of information 
about the physical system under consideration, that is, 
when we are dealing with that von Neumann calls a _ 
pure case (Reiner Fall), we can only verify the predic- 


7 For an account of the development of both the old and the 
modern quantum theory we must refer to the literature (e.g., 
19S, 33H, 38K). : i ` 
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tions which can be made from the appropriate wave 
function by using an ensemble.” 

In quantum statistics as in classical statistics there 
are different ways of tackling the problem with which 
we are concerned in the present paper. Once again we 
are confronted with questions (A) and (B) of the 
Introduction and once again we can try to consider 
first of all isolated systems, either by using the quantum- 
mechanical analogy of the -theorem or by considering 
a quantum-mechanical ergodic theorem. Alternatively, 
we can consider ensembles and the question of represen- 
tative ensembles. In this part we shall begin by con- 
sidering [Sec. D(1)] the question of isolated systems 
by using the so-called elementary method (see ESM 
Ch. IV) and, firstly, show how one derives the equi- 
librium distribution in this case and, secondly, discuss 
the H-theorem. After that [Sec. D(2)] we shall discuss 
the H-theorem in ensemble theory, while we conclude 
this chapter with a discussion of representative en- 
sembles and the choice of a priori weights [Sec. D(3)]. 
In the next part (E) we shall discuss the quantum- 
mechanical ergodic theorem. 

Before we discuss the elementary method, we wish 
to consider briefly Ornstein and Kramer’s consideration 
(270) of the equilibrium of a system of Fermi-Dirac 
particles (compare also papers by Nordheim (28N) 
and Halpern and Doermann (39H)). In their paper 
they show how one can derive the Fermi distribution 
from purely (or better nearly purely) kinetic argu- 
ments. Their argument runs as follows. Consider a 
system of independent particles and let each particle 
be capable of occupying energy levels ex which, for the 
sake of simplicity, we assume to be nondegenerate. 
Consider now a “collision” between two particles such 
that the energies before the collision were e; and ej and 
after the collision ev and ¢;. Conservation of energy 
gives us the relation 


Beiter tes) (D1.01) 


Let the” transition probability for this transition be 
denoted by a, the transition probability for the inverse 
transition (i’j’—ij) by a’, the number of transitions 
per unit time in the one direction by A, and the number 
of transitions in the opposite direction by A’. The quan- 
tities A and A’ satisfy the equations 


A=annj1—ny)(1—n;), 


Aamin (1—n)(1—n,)), 


(D1.02) 
(D1.03) 


e 


where n(n; ny nj) is the average number of particles in 
the energy state e:(€;,e:,¢;,), and where Eqs. (D1.02) 
and (D1.03) follow from the fact that the number of 
transitions is not only proportional to the average 
number of particles in the states t and j, but also to the 
chance of finding states 7’ and j’ unoccupied, since 


72 This is, for instance, discussed in detail by Kemble (37K; 
especially ‘Sec, 14b); see also Sec. D(3). 
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otherwise the transition cannot take place because of 
the Pauli principle (25P, 27P). 5 
If the system is in equilibrium, we have 


A=A'. (D1.04) 
We shall now assume that 
a=a', — (D108) 
and from Eqs. (D1.02) to (D1.05) we then get 
mmj=mimi, (D1.06) 
where 
mi=n;/(1— ni). (D1.07) 


Equation (D1.06) must be satisfied for any pair of 
states e; ej in equilibrium. Just as the Maxwell dis- 
tribution followed from Eq. (A1.12) and the condition 
of conservation of energy, so we get from Eqs. (D1.06) 
which have to be satisfied under the condition (D1.01) 
that in equilibrium 


m= exp (u* Bei), (D1.08) 
or, using Eq. (D1.07), 
n= 1/[exp(—n+8e) +1], 


(D1.09) 


the Fermi distribution (26F1). 

Tt must be noted here (i) that Eq. (D1.02) is purely 
based on a kinetic argument and not on any quantum- 
mechanical formalism, and (ii) that Eq. (D1.05) is 
introduced as an assumption—an assumption which 
takes the place of the Stosszahlansatz in Sec. Al. We 
may mention here that Ornstein and Kramers actually 
discussed whether Eq. (D1.05) could be derived from 
first principles and mentioned that it followed from 
considerations by Heisenberg (26H) and Jordan (26) 
that only the much weaker relation 


Dis (Giger — ar 5,1) =0 


instead of relation (D1.05) could be derived from quan- 
tum mechanics. In Eq. (D1.10) aij,i-j(airy;27) is the 
transition probability up to now denoted by a(e’), and 
the summation is over all pairs e;, e; for which eit éj is 
the same. The question in how far Eq. (D1.05) can be 
derived from Eq. (D1.10) is connected with questions 
discussed in Sec. D(3) and in Appendix V, to which 
we refer.” 3 

We have not yet proved that a distribution different 
from the one given by Eq. (D1.09) will tend toward this 
distribution. This follows easily, if we use the expression 
for the entropy of a system of independent Fermi-Dirac 
particles” [see, e.g., ESM p. 406, Eq. (A3.06) and com- 


(D1.10) 


_ It is interesting to note that Eq. (D1.10) is sufficient to de- 
rive from Eq. (D1.04) and condition (D1.01) the Maxwell- 
Boltzmann distribution when, of course, we do not have the factors 
(1=n;,), etc. in Eqs. (D1.02) and (D1.03); compare in this con- 
nection also 54K Sec. 14, 53T and 54L. 


4 i i 7 > 

3 Such particles are sometimes called fermions and the Bose- 
Einstein particles, bosons. 
A 
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pare also Eq, (D1.28)] and write 
a 


H= —S/k= 5 [1 =ni) n(1— ntn; Inzi]. (D1.11) 


The rate of change of H is given by the equation 


dH/d=>; In[n,/(1—n,) Jdn;/dt 
= Dany In[2;/ (1 =n;)] 
X {disseny (1—n)) (1—n,) 
~GizvynnjA—ny)(1—ny)}. (D1.12) 


In Eq. (D1.12) we sum firstly over all energy levels 
éi secondly over all energy levels e; with which colli- 
sions can take place, and thirdly over all pairs of energy 
levels ey and e; which can be obtained from a collision 
between particles in states ei and e;. In order to get a 
symmetrical expression, where the summation is both 


over all pairs e; and e; and over all pairs ep and e we 
write 


dH /dt= i; vy anen; (1 —ni)(1—n)) 
—nnj1—ny)(1—ny)] 
X Unlnm,;/(1—n,;)(1—n,)]}, (D1.13) 


where we have used Eq. (D1.05). If we now take the 
mean of the expression on the right-hand side of Eq. 
(D1,13) and the expression obtained from this one by 
exchanging ij and ij’, we get [compare the discussion 
of Eq. (A1.09)] 


dH /dt=% Y izv y a{In[nn;(1—ny) 
X (1=ny)/(1-ni) (1 — nnen; }} 
X [nny (1-23) 1-2) —nnj(1—ny) 
X(1—ny)]<0, (D1.14) 


where the equal sign only holds, if the 1; satisfy Eq. 
(D1.09). We see here, thus, once again the approach 
to an equilibrium distribution proved by using an as- 
sumption about the number of transitions. We leave a 
discussion of the derivation of Eq. (D1.14) until we 
have considered the approach via the elementary 
method. 

In the elementary method one takes the energy levels 
together in groups with all approximately the same 
energy. Let the ith group contain Z; levels, let there be 
N; particles in the system with energies within the 
group, and let E; be the (approximate) value of the 
energy of the levels in the group. We first of all want to 
find the-equilibrium distribution of the N; and once 
again we shall, as in Sec. A(3), define this as the most 
probable distribution, and we must thus find the 
a priori probability W (N;) for a given distribution, 
corresponding to the W(Z) of Eq. (A3.01). At this 
point we must differentiate between Boltzmann (Bo), 
Termi-Dirac (F-D) and Bose-Einstein (B-E) statistics. 
We must first of all remind ourselves that quantum 
mechanics introduces two aspects different from those 
of classical mechanics, namely, diffraction effects and 
symmetry effects.” The first aspect entered already in 


15 For a discussion of the importance ef this distinction see 54H3. 
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the old quantum mechanics and is responsible for the 
existence of the quantization of energy, angular mo- 
mentum, and so on. The second aspect enters when one 
takes into account that apparently nature only permits 
wave functions of a system of identical particles which 
are either completely symmetric in all particles (B-E) 
or completely antisymmetric (F-D). At first sight 
one might think therefore that only B-E or F-D 
statistics would occur. However there are cases such as, 
for instance, a system of particles arranged on a crystal 
lattice, where identical particles can be distinguished, 
e.g., in the case of a crystal, by the lattice site they 
occupy (compare 49R, Chap. TII, Sec. 1). In such a 
case we are dealing with Boltzmann statistics—with 
one slight difference which we shall discuss presently. 
3-E statistics were introduced by Bose (24B)?7 for the 
case of light quanta who used them to derive Planck’s 
radiation law. Einstein (2412, 25E1, 252) applied 
B-E statistics to a perfect gas, and showed how they 
led to the so-called Einstein condensation —a phe- 
nomenon which may have a bearing on the A transition 
of liquid helium (compare the discussion in Ch. IX of 
ESM). Fermi (26F1) introduced Pauli’s exclusion prin- 
ciple (25P, 27P) into the statistical discussion, while 
Dirac (26D) discussed the connection between statistics 
and wave mechanics. Ehrenfest and Uhlenbeck (27E, 
see also 27U) showed how also the Boltzmann statistics 
can be incorporated in quantum statistics. 

We can now proceed to evaluate W(N,). Let n; be 
the number of particles in the energy level e; and let 
W (n;) be the probability for a distribution of the Nj. 
We then have 


W(NV)= LW (n,), (D115) 


where the summation extends over all n,-distributions 
such that for each group 


Lnj=Ni, (D1.16) 


where the e; over which is summed belong to the group 
Z;. Our problem is now reduced to finding W(n;), 
that is, the probability that .V particles are distributed 
over the various energy levels which are all assumed to 
be nondegenerate in such a way that these are Nj par- 
ticles in level ej. 

In the case of B-E statistics for any given distribu- 
tion there exists just one wave function and hence 


Ws-e(n;)=1. (D1.17) 


In the case of F-D statistics only completely anti- 
symmetric wave functions are possible—which means 
that each level can be either occupied by one particle 


7 For a discussion of the reasons why some particles obey B-E 
and others F-D statistics we refer to the literature (e.g., 39B2, 
40P1, 40P2, 49W2). 

™ This paper was translated by Einstein who added the follow- 
ing remark: “Boses Ableitung der Planckschen Formel bedeutet 
nach meiner Meinung einen wichtigen Fortschritt. Die hier 
benutzte Methode liefert aucł die Quantentheorie des idealen 
Gases, wie ich an anderer.Stelle ausfithren will,” = 
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or unoccupied, but cannot be occupied by more than 
one particle—and we have thus either one possible 
wave, function, or none, f 


Wro) =1, 
Wr-oln;) = 0, 


Finally in the case of Bo statistics any of the N! 
permutations of the arguments of the V particles in the 
wave function will lead to another permitted wave 
function. As a permutation of the nj-particles in the 
same level would not change the wave function we 
might expect 


if all 2; are O or 1; 
i (D1.18) 
if at least one nj>1. 


W'no= NY TT) 1/15!). (D1.19) 


This does not take into account that, although we can 
sometimes label our particles, we cannot keep track of 
each separate atom and are thus overestimating the 
weight of each n,-distribution by a factor A ! which is 
the number of different microsituations corresponding 
to the same macrosituation.7’ We must thus use in the 
case of Bo statistics instead of Eq. (D1.19), the ex- 
pression 


Woo(n)=IL; (1/25). (D1.20) 


We can now use Eqs. (D1.15) to (D1.18) and (D1.20) 
to calculate W(N;). In the case of Bo statistics one 
finds easily (compare ESM, p. 74) that 


1 N;! 
We (N )= —e——=T(Z:"*/N!). (D1.21) 
oN;! [bu! 


In the case of B-E statistics we must find the number 
of different ways in which N; particles can be distributed 
over Z; levels and we get 


Wn- )=IL [V21 YN 1). (1.22) 


Finally in the case of F-D statistics we must find the 
number of different ways in which M; particles can be 
distributed over Z; levels without having more than 
one particle in any level, and we get 


Wr-v(N;) =]; PA VN; '(Z;—N;) 1]. (D1.23) 


We now assume that all Z, V;, and Z;—N; (in the 
F-D case) are large numbers so that we can use Eq. 
(A3.03) for the factorials. We then get 


hW (N)=X; {(NitaZ,) Inl(Zi/N 1) +e] 

‘ —aZ; In(Z./N;)}, (D1.24) 
where 8 
(D1.25) 


and where we have omitted a term ZN;:(=N) in the 
Bo case, which is unimportant for our discussion. From 
Eq. (D1.24) it can be seen that in the limit where 
N;/Z0, that is, for highly diluted systems, or systems 


apo=0, op-e=1, arpv=—l, 


78 For the importance of the omission of the factor N ! see also 
Gibbs’ monograph (02G, Chap. XV) and a paper by Nord- 
Acin (24N) - 
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at very high temperatures, the three expressions become 
the same. 

We find the equilibrium distribution by maximizing 
InIV (NV) under the conditions of given total energy and 
given total number of particles, 


DNVi=N, YN L=E, 
and the result is [compare the discussion in Sec. A (3) ], 
Inf ZrtaN)/NiJ=—e+BE,  (D1.27) 


leading to the well-known expressions for the Boltz- 
mann, Fermi-Dirac, and Bose-Einstein distributions. 
We now must prove the approach to equilibrium.” 
Using the relation H=—S/k and the Boltzmann- 
Planck relation S=% InIV, we have for our H-function 


H= DA: In (Z/N; 
— (Nitaz) ME (ZiHaN:)/N:]). 


Let Nizvy be the number of transitions per unit time 
where the groups Z; and Z; each lose one particle and 
the groups Zy and Zy each gain one particle. Such 
transitions will be possible only if energy is conserved 
or if 


(D1.26) 


(D1.28) 


Et E;=Ev+Ey. (D1.29) 


Tt will be shown in Appendix V that the Nij vy satisfy 
the equations 


Niz j= A ij; vp NN (Ze taN y) (Zj+aN y), (D 1 .30) 


while we shall assume [compare Eq. (D1.05)] that 
(D1.31) 


A iji vy=A Vist 


We get now, in a manner completely analogous to 
the one leading to Eq. (D1.14),® 


dH /dt=>;; (aN ;/dt) LN i/ (ZitaN:)] 
= 4 Dish ij! A EN:N y (Zi+aN:) (Z;+aN;) 
NN; (Zut aN i) (Zi +N j)] 
x MENN; (Zy +aN i) (ZjpptaN j)/ 
NoN (Zita) (Z;taN j) <0. 


The equal sign only holds when Eq. (D1.27) is satisfied. 

Once again we have arrived at an equation showing 
the steady decrease of H until equilibrium has been 
reached. Both Eq. (D1.32) and Eq. (D1.14) are derived 
by using a Stosszahlansalz and by considering transition 
probabilities. The discussion is thus on the same level 
as that of Sec. A(3) and we could also, in the present, 
case consider the frequency with which we can expect 
fluctuations to occur. Just as the use of Eq. (A3.01) 
for the probability of the occurrence of a situation was 
based on the choice of equal a priori probabilities for 


(D1.32) 


We follow here the considerations of Pauli (28P); Compare 
also the considerations by Nordheim (28N). Nordheim is especially 
Bea in the distribution of electrons in metals (compare 


80 ; : bane 
Oa pte that Zi+aN; is never negative, since in the 
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equal volumgs in phase Space, so the use of Eqs. (11.21) 
to (D1.23) is based on the assumption of equal a priori 
probabilities for all nondegenerate quantum states as 
can be seen from the derivation of these equations, This 
assumption will be discussed in Sec. D (3) when we also 
shall discuss how far Eq. (D1.31) can be justified from 
first principles.* 


o 
(2) H-Theorem in Quantum Ensemble Theory®? 


The considerations of the preceding section corre- 
sponded to the considerations of Sec. A (3) in the classi- 
cal case. We shall now be concerned with the quantum- 
mechanical analogy to Part C. First of all, in the present 
section we shall consider the fine-grained density. We 
shall then introduce a coarse-grained density and con- 
sider again a quantity X which is related to the 5 of 
Sec. C(1), and we draw attention to a few points where 
slight differences from the classical case occur. After 
that we shall briefly consider the influence of measure- 
ments.” The question of representative ensembles and 
of @ priori probabilities is deferred until Sec. D (3). 

It is well known that the so-called density matrix 9% 
introduced by yon Neumann (27N1; see also 27N2, 
29D, 30D1, 30D2, 31D, 32N3, 33P, 35D, 36D, 37K. 
40H, 54T, ESM Chapter VII) replaces in quantum 
statistics the role of the ensemble density p in classical 
statistics. This density matrix also plays an important 
part in many quantum-mechanical discussions, but this 
is not the place to discuss the use of the density matrix 
for anything but statistical purposes. 

The (fine-grained) density matrix o is introduced as 
follows. Let y* be the normalized wave function describ- 
ing the kth system of the ensemble and let ø, be a 
complete set of orthonormal functions in the Hilbert 
space of the wave functions. We can then expand 
the ¥* as follows: 


V=Dn anton, (D2.01) 


at= f oxtVdr, 


where a star indicates the conjugate complex and where 
dr is a volume element of the coordinate space.% 

We can use the a,* instead of the ¥* to describe the 
kth system. The two representations are equivalent 


where 


(D2.02) 


51 Recently Stueckelberg and co-workers (52S2, 541) have based 
the decrease of H on the unitarity of the so-called S-matrix. 

€ Compare 38T, Chap. XII, part A, which we follow to a large 
extent in our discussion. 3 z 

8 For a thorough discussion of this subject we refer to von 
Neumann’s book (32N3) and a recent paper by Groenewold 
(46G2). = 

§ We denote all matrices by bold face characters. 4 

55 For the sake of simplicity we shall assume that the ¢, form 
a discrete set so that in Eq. (D2.01) we are dealing with a sum. 
It is not difficult to extend the formulas to the case where the 
sum has to be replaced by an integral (a Stieltjes integral, if 
necessary) but we leave this to the reader. 

86 We restrict ourselves in our formulas to the case where the 
wave functions are scalars. It is not difficult to extend our formulas 
to the case where they are spinors or uñdors (39B1). 


and, while the y* satisfy the Schrédinger equation 
Hy = ihht, (2.03) 
the a,” satisfy the transformed Schrödinger equation 
idn =Z m H amim", (D2.04) 


where the Hmn are the matrix elements of the Hamil- 
tonian operator H, 


Hm= f ont Wend. (D2.05) 


The physical significance of the a,* is that they are 
probability amplitudes, {a,*|? being the probability 
that the kth system is characterized by the function gn. 
From the normalization of the y+ and the fact that the 


¢n form a complete orthonormal set, it follows that 
En |an*|?= 1. (12.06) 


We can now introduce the density operator p or 
density matrix by defining it through its elements, 


1 y 
fie I Om On (D2.07) 
T k=l 


where N is the number of systems in the ensemble. 

The average value (G) of a physical quantity G, 
which now corresponds to an operator G, is given by 
the equation 


CENT. f y” Gyr, (D2.08) 


which is twice an average, firstly an average over the 
wave function, and secondly an average over the en- 
semble. 

Introducing the an, we get’ 


OS Ne Ly ars eG 
=Lin.nPmnGum=Tr(oG). (D2.09) 


An example of Eq. (12.09) is the normalization of 0. 
Putting G=1, we have 


f= ‘Trp! (D2.10) 


an equation which also follows directly from Eq. 
(D2.06). 

The index of probability n is again related to p by 
the equation 


(DZ11) 


where the right-hand side is an abbreviated notation for 
the infinite exponential power series: 


expn=}_ (n !) n”. (D2.12) 
At this point we may draw attention to the fact that 


the introduction of grand ensembles in quantum 
statistics does not involve any extra complications. 


9=expn, 


$ TrA is a shorthand notation for, the trace, or , of an 
operator, that is, the sum òf its diagonal! elements: TrA= Dn, Anam 
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316 D. TER 
We can assume that second quantization has ‘taken 
place? and that thus all wave functions are expressed 
as products of Jordan-Klein and Jordan-Wigner ma- 
trices. ‘The formalism remains unchanged. 

We now introduce a quantity o by the equation 


a=(n), (D2.13) 
or, using Eqs. (12.11) and (D2.09) 
o=Tr(o Ino). (D2.14) 


As in Sec. C(1) we can show that g possesses a number 
of minimum properties (compare 36D). In order to 
show this, we first choose as our complete orthonormal 
set yn a complete set of characteristic functions of the 
Hamiltonian operator and of the number operator.” 
We shall indicate the state corresponding to v particles 
while the energy of the system is Ex() by a double 
index k, v and matrix elements between two states, say 
k, vı and 1, va by (&,1; !,v2) for instance H(k,v1; 1,2). 

We have now the following properties of o. 


(a) If our ensemble is such that all systems have the 
same number of particles V and that the energy of all 
systems lies within a given interval Æ, E+ôE, o will 
be minimum if ọ is given by the equation 


p(k,v15 lv) = CO, 15» NOvgN 

for Eemeren] (D2.15) 
p(k,131,v2)=0 otherwise, 
where ôx is the Kronecker symbol and where c is a 
constant. 
(b) If our ensemble is such that all systems have the 
same number of particles V but that only the average 
energy is given, that is, that ọ must satisfy the condition 


Tr(oH)=£, (D2.16) 
g will be minimum, if ọ satisfies the equation 
p(k,13 1,v2) = dxdiNdven exp{6[Y—Ex.(N)]}. (D2.17) 


(c) If only the average number of particles and the 
average energy of the systems in the ensemble are given 
so that ọ must satisfy Eq. (D2.16) and the condition 


Tr(ov)=N, (D2.18) 


where v is the number operator, o will be minimum, if 
g satisfies the equation” 


p(hyvs; 1,v2) = òrri» expl —gt+vin—BEx(r1)]. (D2.19) 


We shall prove only the last statement since the 
proofs of the two other statements are completely 


88 For an account of the method of second quantization (28J; 
see also 27J2, 34F) we refer to Kramers’ monograph (38K Sec. 72). 

® Once again we shall restrict ourselves to the case where only 
one kind of particle is present. In the present discussion it would, 
however, be simple to extend the discussion to the case of a num- 
ber of different kinds of particles. 

% We shall assume for the sake of simplicity that all energy 
levels are nondegenerate. 

s Tr this case it is really unnecessary to make a particu- 
lar choice for’ the gn. In general Eq. (1D2.19) would read 


@=expl—g+vu— 6H]. 
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analogous. As in Sec. C(1), the densit*es given by 
Eqs. (D2.15), (D2.17), and (D2.19) correspond to our 
energy-shell ensemble, a macrocanonical ensemble, and 
a canonical grand ensemble and £, Y, and y have here 
the same meaning as there. 

The proof of (c) is as follows (compare 36D Sec. 8): 
Compare two density matrices 9; and 92, where 9; is 
given by Eq. (D2.19) and 92 by the equatidn 


o2= 01 exp(An).’ (D2.20) 
The quantity o2—01 is given by the equation 
oo—91= Tr (n: expnz— n expny). (D2.21) 


The right-hand side of Eq. (D2.21) can be written as 


Tr[(nz— nı) expne ]-+TrLm(o2— 01) | 
=Tr[ (no— m) expn:]+Tr[(— q+ vu — bH) (o2— 01) 
=Tr[ (no— n) expne ], 


where we have used the fact that both @: and @» satisfy 
Eqs. (D2.16), (D2.18), and The normalization condi- 
tion (D2.10), and where we have used Eqs. (D2.19) and 
(D2.11). We have thus 


o2—o1= Tr (no— nı) expne ]. (D2.22) 


The right-hand side of Eq. (D2.22) is positive, except 
when n»=n,. This can be seen as follows. We first of 
all note that for any Hermitean operator A the following 
inequality holds (compare 38P): 


[expA ]i:2 expA rk, (D2.23) 


the equal sign only holding when A is on diagonal form. 


One proves Eq. (D2.23) by considering a unitary transforma- 
tion U which brings A onto diagonal form with diagonal elements 
(characteristic values) Ax. We then have 


Ape= Z| UnA, Dıl Unul =1, (D2.24) 


and 
D(A) Jex= Zil Unal f(A) 

=D| Un AS (A) H AA f (Are) 

+3 (Ar —A pa) f” (A rrtonCA— A ax ))} O SOx S1) 

=f (A) H42 U n|? (Ar ArT", (D2.25) 
where we have used Eqs. (D2.24). If f(A) is the exponential 
function, Eq. (D2.23) follows. 

We now choose a representation in which nz is on 

diagonal form. Since from Eq. (D2.10) it follows that 


Tr(expn:)=Tr(expne) (= 1), (D2.26) 
we get 


Tri (m2—11) expn2 | 
=Tr[ne—n1) expno—™ expne-+texpny | 
=) [ (nai— mii) expnes— expne:-+ (expni) i] 
> SIL (nzi — nii) €XPn2i— expyaitexpniii] 
=}, (expmi) Enzi =ni 1) exp (nzi— niii) t 1] 20, ; 


where we have first used Eq. (D2.23) and then used the 
properties of the function y given by Eq. (C1.13). 


CE eS See 


the 
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Once again, however, ø will not change with time. 
This follows from the fact that we can transform from 
one instant (’) to another (/’) by a unitary trans- 
formation. This means that, if 9’ and o” denote the 
density matrices at // and t”, they are related by the 
equation 

0” =Uto'U, (D2.27) 
Py . . 
where Ut is the Hermitean conjugate of U and where 


“UU=UUt=1, (D2.28) 


Any average value (G) is invariant against a unitary 


transformation. This follows since Tr(AB)=Tr(BA), 
and thus 


Tr(o”G”)= Tr(Uto'UUtG'U) = Tr(Uto’G’U) 
‘=Tr(G’UUio’) = Tr(G’p')= Tr(0/G’).  (D2.29) 


Since ø is the average value of n we see that o'=0" as 
in the classical case. 

We shall now introduce as in Sec. C(1) a coarse- 
grained density, or in this case a coarse-grained density 
matrix P. We choose now as our complete orthonormal 
set of functions the characteristic functions of the 
Hamiltonian H of the system. Furthermore, we shall 
divide the stationary states into groups, as before, but 
now we shall assume the grouping to be in such a way 
as to correspond to the inaccuracy of our observation, 
this is, we shall assume that by means of our available 
observational methods we can discriminate between the 
different groups, but not within them.” Let S; be the 
number of levels in the ith group. We now define the 
coarse-grained density matrix P by its matrix elements 
in the particular representation which we are using at 
the moment, as follows: 


Prr= bri 25 23;/Si, (D2.30) 


where the energy level Æ; belongs to the ith group and 
where the summation is taken over all the states of the 
ith group. Since pj; is the probability of finding a 
system of the ensemble in the state characterized by 
gi, we see that S;P.x is the probability of finding a 
system of the ensemble in a state belonging to the ith 
group. Equation (D2.30) defines the coarse-grained 
density in a particular representation; its matrix ele- 
ments in any other representation follow by the usual 
transformation rules.” 

From Eqs. (D2.30) and (D2.10) it follows that P is 
also normalized, 


TrP=1. (D2.31) 
We now define a quantity = by the equation 
y=Tr(P InP) = >>. Pex Pkr, (D2.32) 


as long as we stay in the representation which we chose 


i i in S 
® There is a difference between the grouping here and in Sec. 
D(1). E we grouped the energy levels of the constituent 
particles but now we group the energy levels of the whole system. 
8 For a discussion of the choice of the coarse-grained density 
matrix see also a recent paper by van Kampen (54K). 


originally. Using Eq. (D2.30) we can also write 
=P pr InP Trp InP)=(nP}. (2.33) 


There are several ways of studying the behavior of £ 
in time. Possibly the crudest way is the one used by Born 
and Green (48B1, 48B2, 49B1, 49B2, 49B3) who, in 
our opinion, do not properly distinguish between fine- 
grained and coarse-grained densities and, moreover, 
do not take into account the fact that coarse-grained 
densities are introduced as a nec essary step to keep our 
formalism in accordance with our experimental possi- 
bilities. In their work they define as the entropy a 
quantity which is not proportional to either —ø or ey, 
but to a hybrid of the two, which we shall call —o¥, 
The quantity o£ can be obtained from $ by putting 
all S; equal to one, or from o by assuming o to be 
always diagonalized, 


oZ => per Inpes. (12.34) 


We shall see later on that cS and & are sometimes the 
same, namely, for an ensemble representing a system on 
which a measurement has just been made, 

The second step in their discussion consists in ob- 
serving that one is usually interested in systems which 
are in contact with their surroundings so that the 
Hamiltonian of the system consists of two parts, 


H=H,+V, (D2.35) 


where Ho is the Hamiltonian of the system if inter- 
action with the outside world is neglected, and V the 
interaction energy operator. It is a well-known result 
from quantum-mechanical perturbation theory that 
transitions between characteristic states of Hy can occur 
under the influence of V. Choose now for the », the 
characteristic functions of Ho and of any other operators 
which commute with Hy. For reasons which will become 
clear presently [compare Eqs. (D2.37) and (D2.41)], 
we are especially interested in the case where the energy 
characteristic values are degenerate. To indicate this 
degeneracy we shall give two indices to the Eni Ci, 
where the first index indicates the energy value Es, and 
the second index distinguishes between the various 
states belonging to the same E£;. 

Since we are only dealing with the diagonal elements 
of 9 which are, as we saw a moment ago, the probabili- 
ties of finding a system of the ensemble in å particular 
state, we can use the quantum-mechanical formulas for 
the time dependence of these probabilities, and we find 


p(kl,kl; i)= p(kl,kl; to) +Z vv J (RLKN) 
X (e(RU RU ; to)—p(kl,kl; to)}, (D2.36) 
where 
J (kl kU) = ôr (47° h)] V (RLEV)|?(t—t). (D2.37) 
In Eq. (D2.37) the ôr» shows that J is zero unless 
energy is conserved, that is, transitions only take place 


moca 


* Compare the discussion in Appendix V. 
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between states of the same energy, i is Planck’s con- 
stant, and V (kk) is the matrix element of V which 
satisfies the equation 


V (ALR!) = V* (RRL), 


because V is a Hermitean matrix. By virtue of Eq. 
(12.38) we have the relation 


S(RLRU) =F (RR). (D2.39) 


Substituting from Eq. (D2.36) into Eq. (D2.34), we 
get up to the first order in J® 


oX (1) =oE (lo) — 3 Dar rv (kl, k'V)O(kl, kV) (D2.40) 
with 


Q(RL RT) =[p (R11; to) —p(h'U,R' 5 to) ] 
XIn[o(kl,kl; to)/p(kU,k'U’ ; to) ]. 


As Q is never negative and J always positive and, 
moreover, a monotonically increasing function of 4, 
we have in oX a monotonically decreasing function. 

One can object to this argument on the grounds that 
Eq. (D2.37) is only valid for small values of !—1o, 
but Green has shown that the argument can be changed 
so as to be valid for all values of ¿—to. The most im- 
portant objection is, however, against the use of the 
function —oD as a measure for the entropy. As was 
pointed out by Pauli (49P) as soon as this choice is 
made everything else follows as can be seen from the 
work of Klein (31K) which we discuss in Appendix VI 
(see also later in this section). 

A second method of discussing the behavior of X the 
one due to Pauli (28P). Let P; be the probability of 
finding a system of the ensemble in the ith group. 
We then have the following relations; 


P3=S;Pix, 


(D2.38) 


(D2.41) 


(D2.42) 


where 14; is one of the S; equal diagonal elements be- 
longing to the ith group. From Eq. (D2.32) we then 
have in erms of the P; 


2=X; (P/S) In(P:/S,) =D P; n(Pi/S;), (D2.43) 


where in the second member the first summation is 
over all groups and the second summation over the 
S; members of the group. 

The time.derivation of X is now given by the equation 


, d2/dt=’.(InP;—InS;) (dP;/dl), (D2.44) 


where we used the fact that X P;=1, or Z dP;/dt=0 as 
follows from Eqs. (D2.31) and (D2.42). 

Let Vi; be the average transition probability from 
group S; to group S;. In Appendix V we shall prove that 


%5 We use here Eq. (D2.39) (i) to show that the sum 
ZI (Rl k'l Eo (RV, kV ; t)—p(kl,kl; t)] 
is zero, and (ii) to obtain the sum on the right-hand side of Eq. 


(D2.40) [compare the derivation of Eq. (D1.14); hence the 


factor $ e > i too) ; 
6 Compare the discussion of Eq. (D230). 


TER 


HAAR 
Naj is given by the equation a 
Nij= Aisi Pa (D2.45) 
and we shall assume 
Aij Aji (D2.46) 


We then get 
dP/d=0; (Ne—Nig=L; A wLSiPj;— SP], (D2.47) 


and in much the same manner as we derived Eq. 
(D2.40) we get now 


d®/dt=} Za; AisLPjS — P:S] In(P:S;/P;S:). (D2.48) 


Once again it follows from the form of the right-hand 
side of equation (D2.48) that d2/dt will never be positive. 

We may at this point draw attention to the fact that 
d=/dt will only be zero if ` 


P;Si=P:S; or, P;/Sj=P;/Sa, (D249) 
or, when use is made of Eqs. (D2.45) and (D2.46), 
Nis=N in (D2.50) 


and we see that at equilibrium there are as many 
transitions from group S; to group S; as the other way 
round. This is an example of the so-called principle of 
detailed balancing which we shall discuss in more detail 
in Appendix VII. : 

We shall return presently to the discussion of this 
second method of considering the variation with time of 
X, but we want first to discuss a third method which is 
completely analogous to the consideration of the varia- 
tion with time of X in Sec. C(1).97 

Let us assume that we have made observations on our 
system at ¢’ which will give us the coarse-grained den- 
sity P. The fine-grained density ọ at ¢’ will then be 
given by the equation [compare the discussion in 


Sec. D(3) ]. 


=P’, (D2.51) 
while we have for X’ 
=!=Tr(P’ InP’) =Tr (9 Ino’). (D2.52) 


If the situation at #’ did not correspond to an equi- 
librium situation so that 9’ (or P’) did not satisfy one 
of the Eqs. (D2.15), (D2.17), or (D2.19), we cannot 
expect Eq. (D2.51) to hold at a later time ¢’”, but we 
would have 


0’ AP”, (D2.53) 
and as a result Sig 
DULD (D2.54) 


Let us consider in slightly more detail how Eq. 
(D2.54) follows from Eq. (D2.53).°8, We are concerned 
with the expression 
2/—3"=Tr(P! ee’) —Tr(P” InP”) 

=Tr(0’ Ino’) —Tr(o0” InP”) 
=} k prr Inpre’— on pnan INPan”’, (D2.55) 


_“T should like to express my thanks to Professor O. Klein for 2 
discussion of some points connected with this method of approach. 


?8 The discussion on p. 373 of ESM is too short and rather 


misleading. 
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where we have used the fact that the coarse-grained 
density is always a diagonal matrix [see Eq. (D2.30)]. 
We add now to the right-hand side of equation (D2.55) 
first of all the expression 


D se Mayon E yey! Inps’, 


which by Klein’s lemma (31K) is never positive and 
only equal to zero, if o” is a diagonal matrix as is 
proved in Appendix VI. We also add to the right-hand 
side of Eq. (D2.55) the expression Tr P”—Tr o” which 
is zero since both o” and P” are normalized. The final 
result is 


D-2" =; (oyna Monn” — prn” DET 
= pan Pan jie On (DSA 
the last inequality following from the properties of the 


function y given by Eq. (C1.13), since substituting 
«=Ina—Ing and multiplying by £ gives for B>0 


aln a~a InB—a+e>0, a#ß; 
ù =0, a=ß. 


(D2.56) 


(D2.58) 


The fact that we expect X to continue to decrease is 
based on considerations similar to those discussed in 
Sec.. C (1). This decrease will continue until a stationary 
state is reached where P is given by one of the three 
Eqs. (D2.15), (D2.17), or (D2.19), and where o=P. 

We must now discuss the different ways of consider- 
ing the time dependence of ©. As we mentioned before, 
we are not convinced of the validity of Born and 
Green’s method of approach and we shall not discuss 
it any further at this point. The second method of 
approach is the one used by Pauli. This method is 
actually closely connected with the simple approach of 
Sec. D(1). This we can see as follows. We write X of 
Eq. (D2.43) in the form 


YET IF: InP;->d; Ie InS;. (D2.59) 


Now, P, is the probability of finding a system in one 
of the states corresponding to the ith group, and S; 
is the number of states in this group. The number of 
states in a group is, however, just equal to the function 
W(N;,) discussed in Sec. D(1) as will be seen after a 
moment’s thought, and using the relation between H 
and InW which led to Eq. (D1.28) we can write Eq. 
(D2.59) in the form 


Y=: PA+D: Pi lnP:;=(H)+2,P; InP, (D2.60) 


where H; is the value of H if the system is in a state 
belonging to the ith group,” and (H) the average of H 
over the ensemble. x f 

Using Eq. (D2.60) we see first ofeall that if we are 
dealing with an ensemble representing a system which 
by measurement has just been found to be in one par- 
ticular state so that one of the P; is equal to one and 


9 We may remark here that we could have derived an equation 
similar to Eq. (D2.60) in the classicaj case. The discussion would 
have been analogous to the present discussion. 


the others equal to zero, this equation would reduce to 
the form 


LH - (D2.61) 


Secondly, taking the time derivation of Eq. (12.60), 
we get 


d3/dt= (dH, ‘dy +>.. (dP; ‘dt) InP,, (1)2.62) 


and we see that the decrease of X is due to two facts 
(i) the general decrease in H for each system of the 
ensemble, and (ii) the tendency to decrease XP, InP, 
by a more uniform distribution of the systems in the 
ensemble over the different groups of states, 

Let us now consider Eq. (D2.57). We saw that 3 
decreased firstly because of the fact that o and P were 
no longer the same, a cause for the decrease of X which 
also operated in the classical case, and secondly because 
the expression Spss Inps decreased (Klein's lemma). 
For the second cause there does not exist a classical 
counterpart and we must discuss it briefly. It is called 
by Tolman (38T) the guantum-mechanical Spreading of 
fine-grained probability. There is no classical counter- 
part since the quantity Ep: Inpex is not a trace of a 
quantum-mechanical operator. The more states are 
occupied the lower the value of this sum becomes. 
This decrease is associated with the irreversible dis- 
turbance due to observations on the system. The 
effect will thus be most pronounced in the case where 
observations are such as to approach the limits set by 
the Heisenberg relation, but will be unimportant in 
those cases where the disturbance of the system by 
the observation may be neglected. In the latter case 
the growing difference between » and P will be re- 
sponsible for the decrease of 3. 

It is of interest to note that some authors (36D, 37S) 
only consider this influence of measurements on the 
system without properly paying attention to the role 
of fine-grained and coarse-grained densities and to the 
fact that the approach to equilibrium should be inde- 
pendent of whether or not the system is, actually 
observed! 

In the next section we shall discuss how one can set 
up representative ensembles in quantum statistics, but 
some aspects of the consequences of observations on a 
physical system we wish to discuss briefly now, es- 
pecially the importance of the pure case. 

In classical statistics we introduce ensembles since 
our knowledge about the physical systems under con- 
sideration is practically always far removed from the 
maximum attainable knowiedge. In quantum statistics 
the situation is, however, complicated due to the 
statistical aspects inherent in quantum mechanics. 
In the ideal (and always idealized) case we may, in 


classical statistics, know the values of all constants of _ 


motion so that the problem of the behavior of the sys- 


tem reduces to a problem of classical mechanics. In the _ 


1% Compare, for instance, the discussion by Bohr (33B1, 33B2). 
* Compare the criticism raised by Pauli and Fierz 372), We 
also may refer to a paper by Davydov (47D). 
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ideal (and always idealized) case in quantum statistics, 
we may know that one system is in a characteristic 
state of a quantum-mechaniéal operator for which for 
the sake of simplicity we shall choose the energy 
operator’ so that we are dealing with the system 
in a stationary state. We are dealing with a pure case 
(Reiner Fall) and all the systems in the representative 
ensemble will possess the same wave function, namely, 
the one corresponding to the characteristics state in 
question. Only one of the two averaging processes 
described by Eq. (D2.08) remains, namely, the quan- 
tum-mechanical average. We still must use statistical 
considerations, although our knowledge is maximal. 

Jn general, however, the situation will not be so 
favorable and the construction of the representative 
ensemble is much more complicated. Both averages in 
Bq. (D2.08) must be taken into account. We are dealing 
with a mixed case (Gemisch). 

In the pure case, if the state of the system is charac- 
terized by one of the functions forming the complete 
orthonormal set gn, say p, the y* are all equal to g, 
and we have 


an = ðn, (D2.63) 
and hence 
Pmn = Ômnôm1: (D2.64) 
In this case we have for o 
o=Tr(oọ lnọ)=0. (D2.65) 


In a mixed case, there are more than one pmn which 
are different from zero and ø is smaller than zero.!% 
This is probably the reason by Elsasser (37E) calls o 
the mixture index. The decrease of o when we go from 
a pure case to a mixed case once again shows the rela- 
tion between entropy—which is proportional to —o— 
and lack of information as the decrease of detailed 
information involves the decrease of ø (compare also 
the discussion in 39K1, 39K2). 

In the next section we shall see that the representa- 
tive ensemble of a subsystem which is part of a large 
isolated system will be a canonical grand ensemble. This 
means that while the larger system may be represented 
by a pure case, the subsystems will certainly correspond 
to a mixture. This fact has been the source of a long 
discussion started off by a paper by Einstein, Podolski, 
and Rosen (35E) in which the paradox which bears 
their names was first stated. We have not got the space 
to discuss this paradox and refer to the literature (35B, 
3581, 3552, 3553, 3554, 36M, 36F, 36S, 51B3; compare 
also 27N1). i 


402 Tt is easy to extend the discussion to include the case where 
the system is in a characteristic state of two or more commuting 
operators. 

103 Equation (D2.65) is a necessary and sufficient condition for 
a pure case (33P) and is equivalent to the usual condition o?=9. 

104 Tn his paper Elsasser discusses how one can use measurements 
to set up representative ensembles. Instead of making assumptions 
about a- priori probabilities Elsasser chooses his ensembles in 
such a way that they make ø minimum. In this way he, of course, 

arrives at the same results as we do, in the more conventional way, 
since the normal method leads to minimum values of ø. 
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(3) Representative Ensembles 


Once again the situation is very similar to the situa- 
tion in classical statistics. We have only incomplete 
data from which to predict the future behavior of our 
system and we must have recourse to representative 
ensembles. Once again the question of a priori prob- 
abilities crops up, and as we really need thea,"’s, also 
the question of the a priori phases, as we shall see 
presently. We use the assumption of equal a priori 
probabilities for all nondegenerate states!" and the 
assumption of random a priori phases, where we shall 
see presently what we mean by the term random in this 
connection. Our discussion will now proceed analogous 
to the discussion in Sec. C(2). We assume again that 
our system is part of a larger, isolated system and 
instead of Eqs. (C2.01) to (C2.04) we have now 


Tro=i, (D3.01) 
Tr o'=1, (D3.02) 
Tr(Ho)+Tr(H’o’) =constant, (D3.03) 
Tr(vo)+Tr(v’o’) = constant, (D3.04) 


where the unprimed quantities refer to the subsystem 
and the primed quantities to the rest system. If equi- 
librium has been attained X should be a minimum, or, 


TrL(e0’) In(99’) |=minimum. (D3.05) 


Using the method of undetermined multipliers, one then 
finds (see, e.g., 40H for a discussion!) for the equi- 
librium expression for 


o=exp[—g+ vu—pH], 


in accordance with what we discussed in the previous 
section. As in Sec. C(2), Eq. (D3.06) really only holds 
for the coarse-grained density P as Eq. (D3.05) only 
holds for 2 and not for ø. 

The situation is also completely analogous to the 
classical case as far as nonequilibrium situations are 
concerned and we refer to the discussion of Sec. C(2) 
which can be immediately applied to the present case. 

We wish to make one more remark about the setting 
up of a representative ensemble after an observation 
has been made on a system. We can only distinguish 
between states belonging to different groups as we 
discussed before and we can thus only determine the 
coarse-grained density, or, the P+. In accordance with 
the assumption of equal a priori probabilities for all 
nondegenerate states we should then put all diagonal 
elements of ọ corrésponding to one group equal to each 
other and equal to the corresponding Px,. As far as the 
off-diagonal elements of o are concerned, since these are 


(D3.06) 


; P If a state is g-fold degenerate, it is counted as g nondegenerate” 
states. 


106 One varies both the characteristic values and the charac- 
teristic functions of ọ in this case. 
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the averagé values of the product of two a,* we have 


1 
Pan= yen women 


1 
= yen expli(dm’—dn*)], (13.07) 


where 7,,* is the modulus of a,* and d»* its phase. 

Using the assumption of random a priori phases, we 
may expect the average of expli(¢n'—@n*) ] to be zero, 
and we have thus for the fine-grained density of our 
ensemble 


(13.08) 


and we see that, indeed, as we remarked when discussing 
Eqs. (D2.34), c, X, and oX are all three equal. 

We come now to a discussion of our choice of a priori 
probabilities and a priori phases. When we discussed 
the analogous situation in classical statistics we showed 
first of all that the a priori weights should be such as to 
be invariant against the movement of the representative 
point along its orbit. We thus showed that in the case 
of quasi-ergodic systems, the a priori weights would be 
functions of the energy only and could thus be taken 
to be equal for equal volumes in phase space. The 
situation in quantum statistics is both simplified and 
complicated as compared with the situation in classical 
statistics. We shall first show that an ensemble con- 
structed without any conditions according to the pre- 
scription of equal a priori probabilities and random 
a priori phases for different nondegenerate quantum 
states will be a stationary ensemble and will, more- 
over, correspond to a similar ensemble if we change our 
representation. Secondly we shall briefly discuss in this 
connection the importance of the so-called adiabatic 
invariance of the weights of the different quantum 
states. We shall then briefly discuss the relation be- 
tween our choice of a priori probabilities and the condi- 
tion of nondegeneracy of energy states which is the 
ergodic condition (see Part E). Finally we shall show 
how in the classical limit, the condition of equal a priori 
probabilities and random a priori phases from different 
nondegenerate states leads to the condition of equal 
a priori probabilities for equal volumes in phase space." 

Consider the so-called uniform ensemble whose 
density matrix is given by the equation 


pri = Pkrôku, 


prı=pôrkn, Or, ọ=pol, (D3.09) 


where 1 is the unit matrix. Equation (D3.09) will be 
satisfied by ọ independent of the chgice of the complete 
orthonormal system, as we can easily show by perform- 
ing a unitary transformation U, 


o= UtoU= UtpolU=p.Ut1U= pUtU = pol. (D3.10) 


107 We refer to a paper by Uhlenbeck (35U) for a discussion of 
some other aspects of the transitiqa from quantum to classical 
statistics. 
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The uniform ensemble can be considered to have been 
obtained by applying the prescription of equal æ priori 
probabilities and random a priori phases for different 
states, since we have from Eq. (12.07) 
Pmn= (aman an 


(D3.11) 


where the average is taken over all the systems in the 
ensemble, Putting 


Om = 1m EXP (1m), (D3.12) 


where 7» is the modulus and a», the phase of dm, we 
have from Eq. (D3.11) 


Pmn= {TmF'n CXPLi(ctm—Cn) }} ave (D3.13) 
Equal a priori probabilities means 
rm= independent of m= y/ po, (D3.14) 
so that Eq. (D3.13) reduces to 
Pmn= pol expLi(am—ana) ]}an (D3.15) 


which because of the random distribution of the phases 
reduces to Eq. (D3.09), 

The uniform ensemble is also a stationary ensemble. 
This follows immediately from Eq. (D2.27) [compare 
Eq. (D3.10) J. 

We see here that the condition of equal a priori 
probabilities and random a priori phases for different 
states is invariant both against the passage of time and 
against transformations from one representation to 
another. The assumption of random a priori phases is 
necessary in order to assure the establishment of these 
invariances, as the density matrix given by the equation 


Pit = per HA (1 — ôr) (D3.16) 
would satisfy the requirement of equal a priori prob- 
ability but would not be a stationary density or a 
density invariant against a unitary transformation. 

We may remark here that Dirac (29D ; see also 27N1) 
singles out the uniform ensemble since it remains in- 
variant against arbitrary slight perturbations of the 
Hamiltonian and thus gives the a priori distribution of 
quantum statistics. He also draws attention to the fact 
that it leads—as we just saw—to equal a priori prob- 
abilities for all discrete states. 

The equality of a priori weights for nondegenerate 
states follows also from the so-called adiabatic principle 
to which Ehrenfest’s name is usually attached. We 
cannot enter into a detailed discussion of this very 
important subject and must refer to the literature! 
(06E1, 11E1, 14E, 16E, 17B1, 17B2, 17B3, 17B4, 17E1, 
18B, 23B2, 27U, and especially 23E1, where further ret- 
erences can be found). This principle states that if the 
external parameters are varied adiabatically, the rela- 
tive weights of the different states cannot change. 
From this it follows directly that in the case of multiply 


198 Some of Ehrenfest’s papers alse discuss the importance of thi 
principle for the choice of the a priori weights in classical ae 
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periodic systems each nondegenerate quantum state 
has the same a priori weight. 

We come now to the important question whether, 
as in the classical case, the choice of a priori probabili- 
ties is determined completely, if we are dealing with 
ergodic systems. We saw in Sec. C(3) that this was the 
case in classical statistics due to the fact that the other 
constants of motion take up their possible values with 
practically equal frequency jr different parts of phase 
space, or, as it is sometimes put, are practically con- 
stant over an energy surface. This latter property is 
also true for an ergodic system in quantum mechanics 
as we can see easily. If all energy states are nondegen- 
erate, the energy characteristic functions will form a 
complete orthonormal set. Since a constant of motion 
corresponds to an operator which commutes with the 
energy operator, it follows from quantum theory (e.g., 
38K Sec. 38) that the energy characteristic functions 
are also characteristic functions of all the operators 
corresponding to the integrals of motion. It then follows 
that in each energy state—which, because of the non- 
degeneracy corresponds to one and only one character- 
istic function—the integrals of motion have well- 
defined constant values. 

Unfortunately, however, this does not ensure that 
our choice of a priori probabilities and of a priori 
phases follows. As far as the a priori probabilities are 
concerned, we hardly need the ergodicity of the systems 
under consideration, since as we saw a moment ago, the 
adiabatic principle allows only those a priori weights 
which we have used throughout. The choice of a priori 
phases is, however, more difficult to justify from first 
principles—and that is what we are trying to do at the 
moment. 

Let us consider a moment the plausibility of the 
existence of ergodic systems. We know from quantum 
theory that degenerate levels are the rule rather than 
the exception when we are dealing with simple systems 
such as atoms and molecules, and it seems unlikely 
that the situation would be different when we come to 
complicated systems such as are considered in statistical 
mechanics. In this connection it is of interest to remind 
ourselves of the fact that it has not even been possible 
to prove that the ground state of any system is non- 
degenerate—a property which is important for a dis- 
cussion of the third law of thermodynamics, the so- 
called Nernst heat theorem. We must therefore also 
face the possibility that physical systems are not 
ergodic. Let us briefly consider in how far the ergodic 
theorem of Part E, that is, the equality of time and 
ensemble averages can still be saved. This equality is 
based on Eq. (E.26) which is proved in Appendix IV. 

From the discussion in that appendix we see that Eq. 
(E.26) can also be proved, if we may average over the 
phases on the right-hand side of Eq. (IV.07).¥° J 
would like to suggest that this averaging is in accordance 


eo 


10 Kor a discussion see 308, 51S, 16S, and ESM Appendix III. 
uo Compare also Eq. (E.08). 
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with the basic ideas of quantum mechanics. It is well 
known that in no physical quantities the (initial) 
phases of the wave functions play a role, and it js 
therefore tempting to suggest that each wave function 
be considered to be an average (or better a mixture) of 
a large number of wave functions, only differing in their 
phases, but otherwise being the same. This would not 
alter any results obtained from the normal wave me- 
chanics and would ensure the ergodic theorem as well 
as justifying the manner in which we set up our repre- 
sentative ensembles. We transfer in this way the as- 
sumption of random a priori phases from statistical 
mechanics to quantum mechanics itself. 

We conclude this section with a discussion of how our 
classical a priori probabilities are the limiting case of 
the assumption regarding a priori weights and a priori 
phases made in quantum statistics. The transition 
could easily be justified in the old (model) quantum 
mechanics, at any rate, as long as one was dealing with 
multiply periodic systems (compare 18B), since in the 
old quantum theory the energy levels were just ob- 
tained by dividing phase space into cells of volume %* 
(h: Planck’s constant; s: number of degrees of freedom) 
and assigning one level to each cell. 

In modern quantum mechanics, the situation is 
more complicated. One often finds the cryptic statement 
that the assignment of a volume /* in phase space for 
each state is connected with the Heisenberg relations 
(27H1) which are then written in the form 


Ap: AqZh, (D3.17) 


where p and q are a canonically conjugate momentum- 
coordinate pair, and where Ap and Aq are the ranges of 
accuracy with which they can be measured simul- 
taneously. It is not explained in these cases, why the 
volume would not be (//47)*, as the right-hand side of 
inequality (D3.17) in the most rigorous form should be 
h/4r and not h (30H1). 

However, one can probably justify the choice of a 
volume 4° in phase space for each state as follows 
(ESM p. 60; compare also 35D Sec. 37, 49M). In 
classical mechanics the phase of the particle is given by 
the representative point in phase space. This point can 
be considered to be the combination of a point in co- 
ordinate—or q-space—and a point in momentum—or 
p-space. In quantum mechanics we must, liowever, 
consider probability densities. The two probability 
densities in p- and in g-space are not independent. The 
probability density in g-space is determined by the 
wave function y and is |p|?, which is normalized, 


l fivltdo=1, 


where dq stands for a volume element in g-space. The 
probability density in -space is obtained by first 
transforming the wave function (q) in coordinate 
representation to the probability amplitude A(p) in 


(D3.18) 
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momentum, space. The probability density in p-space 
is then |A|*. It follows from transformation theory 
that A and y are each other’s Fourier transforms, if we 
consider q and p/h as the variables, "! and that thus 


fiAlap=m, 


We can now say that the total volume of phase space 
corresponding to one state of the system is obtained by 
multiplying the probability densities in p- and g-space 
and integrating over the whole of phase space, and we 
get from Eqs. (D3.18) and (D3.19) as the final result 
a volume /?, 

From the foregoing discussion we can conclude that 
in the limit of 4-0, indeed, equal volumes of phase 
space—which contain equal numbers of cells of volume 
h'—correspond to equal a priori probabilities, once we 
have accepted the equal a priori probabilities for all 
nondegenerate states. The necessity of random a priori 
phases in this connection is not so apparent. It follows, 
however, from the considerations of the uniform en- 
semble. We should like to have equal weights for non- 
degenerate states independent of the representation 
we are using, and this can only be ensured if we use the 
additional assumption of random a priori phases as we 
saw earlier in the present section. 


(D3.19) 


E. THE ERGODIC THEOREM IN QUANTUM 
STATISTICS 


We saw earlier that apart from the approach via the 
H-theorem one could justify the use of statistical 
methods by using the ergodic (or quasi-ergodic) 
theorem and proving the equivalence of equality of time 
and ensemble averages. The same choice of method of 
approach exists in quantum statistics, and we do now 
wish to consider the quantum-statistical ergodic 
theorem. In analogy with the old ergodic theorem 
[Sec. B(1)] one can assume the existence of ergodic 
systems. Such systems are defined as such systems 
where all states Æ, within a given energy range 


EX E,< E+6E (E.01) 


can be reached from all other states within the range— 
even if only via other intermediate states.12 

Let P, denote once again the probability of finding 
the system in the Ath state, and let wz be the transition 
probability for a transition from the kth to the lth 
state. We then have the following equation governing 
the temporal behavior of the Pi: 


dP,,/di=— ıı Prwat di Pw 
=} wi(Pi- Pr), 


111 Tt is at this point that Planck’s constant enters—one might 


(E.02) 


` say through the de Broglie relations, or through the commutation 


where we have used the fact that wr=w." The equi- 
libriurn solution of Eq. (E.02) which is attained ex- 
ponentially is 3 

(E.03) 


if n is the number of states in the range (E.01). From 
this it follows that the average time spent in any of the 
n States is the same and to calculate the time average of 
any quantity we can take the mean over the states of 
the range (E.01), or over a representative micro- 
canonical ensemble [compare the density for such an 
ensemble given by Eq. (D2.15)]. We have thus proved 
that for ergodic systems the two averages are the same. 
The problem remains whether or not ergodic systems 
exist. In the present case it seems very likely that this 
is the case. In a closed system radiation must have been 
included and this should make transitions from one 
state to another possible. 

There is, however, also in quantum mechanics a 
different method of approach which is analogous to 
Birkhoft’s or Hopf’s ergodic theorem. This method 
consists of proving directly the equivalence of time and 
ensemble averages. The first proof was given by von 
Neumann (29N) and it was later simplified by Pauli 
and Fierz (37P) and by Fierz (private com- 
munication).!!5 

As in the classical case we can distinguish two differ- 
ent ergodic theorems according to whether or not we 
allow a “coarsening” of the energy of the system. In the 
quantum-statistical case the second ergodic theorem is 
the more important one, but we shall first discuss 
briefly the first ergodic theorem and we shall follow 
largely the exposition given by Rosenfeld (52R). 

In the first case we are dealing with a pure case. Let 
¥(t) be the wave function describing the system under 
consideration. We choose now for our complete ortho- 
normal set ¢, the characteristic functions of the energy 
operator H and denote by En the corresponding char- 
acteristic values. We now write 


YO)=>, nEn, r 


P= constant = 1/7, 


(E.04) 
with 
(E.05) 


where 7, is the modulus and æ, the phase of a,. For 
y(i) we then have in the usual way 


¥()=LXrn explian) exp(—iEst/h)yn, (E.06) 


and the density matrix as a function of time is given 
by the equation . 


Panl) =f n expli (am—an)] 
Xexp[—i(En—En)t/h]. (E07) 


™3 Van Kampen (54K) has shown that Eq. (E.02) can also be 
derived under conditions where wa rw 

14 We may mention in this connection the recent paper by M. J. 
Klein (52K) who makes some explicit assumptions about the 
interactions allowing the transitions to take place. p 

4ST am greatly indebted to Professor M. Fierz for putting at 


an=Tn Cxp(ian), 


e : i i disposal his latest considerations which were discussed at the 
vhich also lead to the Heisenberg relation. my j 
relati oe 3 J, Sec. 2.2; 40M, p- 56; 46J, p. 152. Les Houches Summer School of 1933. = 
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The average value of any physical quantity A is the 
quantum-mechanical average given by the equation 


(A) = Daan Emn expli (@m—an)? 


—i(Em— En )t/h]Amn (E.08) 
where Amn is given by the equation 
= feter (£.09) 


We can now take the time average of (A) and we see 
that, if there is no degeneracy, that is, if no two char- 
acteristic energy values are the same, we have 


(A)= Don Pnnd nm (E.10) 


which is independent of the initial phases (the an). 

We see that (A) will only be independent of the 
initial conditions, in this case, the initial phases, if 
there is no degeneracy. The requirement of nondegen- 
eracy thus takes the place of the requirement of metrical 
transitivity in the classical case. 

That Eq. (E.10) gives (A) as a statistical average 
can be seen when we introduce the so-called projection 
operators R(¢). This operator is defined by the equation 


R(Ox=e | or, (E.11) 


which should be valid for any pair of functions ø and xy. 
The geometric meaning of the projection operator is 
simple, if we may use for a moment the language of the 
Hilbert space. The integral /y*xd7 then corresponds 
to the scalar product of the two vectors x and ọ in 
Hilbert space and the right-hand side of Eq. (E.11) 
gives thus the “projection” of x on ¢ if ¢ is normalized, 
and thus corresponds to a unit vector in Hilbert space." 

One can easily see that, in the case where the system 
is characterized by the wave function y, the operator 
R(y) is diagonal and its diagonal elements are equal 
to the diagonal elements of the density matrix so that 
we can write Eq. (E.10) in the form 


(A)=Tr(R()A). (E.12) 


From Eq. (E.07) it follows also that R(W) can be 
obtained by averaging the density matrix over the 
different values of the initial phases, or, 


p RW)= ow, (E.13) 


the subscript “Av” indicating here the averaging over 
the values of the an, and Eq. (E.12) or (E.10) can also 
be written in the form 


(A)=Tr(owA). (E.14) 
Loosely speaking we may interpret this result as 
meaning that the temporal change of the wave function 


u6 We use the notation R instead ef P to emphasize the relation 
with the density matrix. a z 


HAAR 


corresponds to a uniform distributing of the «œn over all 
possible values, corresponding to the classical picture 
of the trajectory completely covering the energy 
surface. 

If we are dealing with an isolated system of constant 
energy the y(0) is one of the øn and we are dealing with 
a pure case with a microcanonical density matrix. In 
the general case discussed at this moment time averag- 
ing has the effect of replacing the pure case by a mixture 
obtained by averaging over the initial phases. 

The discussion of the first ergodic theorem is mainly 
of importance for the discussion of the influence of ob- 
servations on the state of the system and the discussion 
of the change from a pure to a mixed case by observa- 
tion. We have discussed some aspects of this at the end 
of Sec. D(2) and are not going to discuss it any further 
at this point. 

The second ergodic theorem deals as does Hopf’s 
ergodic theorem with an energy shell. At the same time 
it meets the difficulty that we often use to describe a 
system a number of physical quantities which do not 
correspond to commuting operators so that it is not 
possible to have a wave function which is a character- 
istic function of all these operators. It is then necessary 
to introduce instead of these microscopic operators 
so-called macroscopic operators as was first done by 
von Neumann.' Roughly speaking a macroscopic 
operator can be obtained from its microscopic counter- 
part as follows. Let A be a continuous variable which 
can take on any value between — and +œ and let 
macroscopic measurements be such that only the 
intervals 


RGAE p=0,221,262)-9. (E15) 


can be distinguished. Let further f(x) be defined by 
the equation 


f(@w)=k, k&x<k+1, k=0, +1, +2,-+-. (E16) 


The macroscopic variable corresponding to A will then 
be f(A). As is emphasized by von Neumann,” macro- 
scopic variables can always be measured simultaneously. 
It is instructive to quote his actual remarks: “During a 
macroscopic simultaneous measurement of coordinate 
and momentum (or of another pair of quantities which 
cannot be measured simultaneously due to the Heisen- 
berg relations) we measure indeed two physical quanti- 
ties simultaneously and accurately, but these two quan- 
tities are not exactly coordinate and momentum. We 
measure, for instance, the positions of two needles or 
the positions of two images on a photographic plate and 
nothing can prevent us from measuring these simul- 
taneously and as accurately as we wish, but the coup- 
ling between them and the quantities in which we are 


= We must refer here to a paper by van Kampen (54K) which 


was published after this was written and which discusses in great | 


detail and rigor the introduction of these macroscopic operators. 
Em S also the discussion given by von Weizsäcker 
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interested CP and pz) is a loose one and the looseness of 
the coupling is determined by the Heisenberg relations.” 

We first of all introduce energy shells in such a way 
that we group the energy levels together as before and 
assign to each group of S; levels one representative 
energy value £;. Inside each shell we introduce (phase) 
cells [compare the discussion in Sec. A (3) ] which each 
contain sẹ states, say. In each cell the macroscopic 
variables A, B, -+- in which we are interested will have 
a constant value. This means that we have introduced 
macroscopic operators for the energy, A, B, and so on, 
in such a way that they all commute and that their 
characteristic values are s,-fold degenerate. If w, are 
the characteristic functions of the macroenergy, and 
the macro-operators corresponding to A, B, ---, they 
correspond inside a cell to the same characteristic values. 


We further have 
sy (E.17) 


where v runs from 1 to Na if N; is the number of cells 
in the shell. 

Let now P, be the probability of finding the system 
in the vth cell and let p, be the probability of finding 
the system in one of the states belonging to the th cell.!!2 
We have then [compare Eq. (D.242)] 


P =S,þr. (E.18) 


We now define a quantity X(¿) by the equation [com- 
pare Eq. (D2.43) ] 


z=), P,\n(P,/s,). (E.19) 


We must bear in mind that the number of cells in an 
energy shell, that is, the number of states which can be 
distinguished by an observer will be of the order of 
magnitude of (at most) 10° say. On the other hand, 
— which is a measure of the entropy of the system in 
units k (=Boltzmann’s constant) will also be of the 
order of magnitude 10%, that is, if the order of Avo- 
gadro’s number, which means that the s, will be 
enormously big,'?° 


s,~exp(10”) and also S;~exp(10").  (E.20) 


As a result P, will be of the order of magnitude 107% 
and Ins, will be extremely large compared to —InP, 
so that we can write instead of Eq. (E.19) 


LOH=—>d, P, Ins,. 
On the other hand consider the quantity 
(2) = — Dr (5,/S;) Ins,. 
For (©) we can easily prove the following inequalities: 


InS;> >, (s,/5;) In(s./S;) 5 3 
+>, (s,/S;) nSi=—@)m, (E-23) 
—(D)w2 Dor (1/.V,) In (Si/N;)=InS;—InN,. (E.24) 
u9 The p, would be the coarse-grained density of an ensemble 


7 vave function 
9 all systems corresponded to the same w unction, 
aa : function of the system which we are considering. 


(E.21) 


(E.22) 
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The first inequality follows since 0<s,/S;<1, and the 
second inequality since a sum of the kind Za, Ina, 
under the condition Ya,=1 is minimum for all a, being 
equal. By Eq. (E.20), InS; is of the order 10” while 
In.V; is only of the order 50, so that we can neglect In.V; 
with respect to In.S; and write 


(Djy = MSi 


(E.25) 


Since S; is the number of states in the energy shell, 
—(2)w can be interpreted (apart from a factor k) as 
the entropy of the microcanonical ensemble [compare 
the discussion just before Eq. (D1.28)]. 

We can prove the ergodic theorem, if we can prove 
that (Z),, is the time average of E(t) given by Eq. (E.21) 
or, that 

P,=s,/S:, 


(E.26) 


a relation which is strongly reminiscent of Eq. (E.03). 

It is possible, that Eq. (E.26) holds on the average. 
The average is here taken over all possible divisions of 
an energy shell into cells. The way in which the various 
divisions are weighted is the one introduced by Weyl 
(25W). In Appendix IV we shall discuss the proof of 
Eq. (E.26). A necessary condition for the proof is that 
there are no degenerate energy levels.!#! 

We have now proved the equivalence of time and 
ensemble averages in the case of the function 3. How- 
ever, as soon as Eq. (E.26) holds, one can also prove the 
equivalence of time and ensemble averages for other 
variables. This follows from the considerations of von 
Neumann,” but can also be seen by our argument 
similar to the one discussed in the Introduction and 
which runs as follows. From the fact that È and (3), 
are the same, it follows that the system spends most of 
the time in the equilibrium state and that thus the time 
average of any variable will correspond to the equi- 
librium value. We must also mention that in proving 
Eq. (E.26) we have really proved more than just the 
equivalence of X and (S),,, but have made a statement 
about the relative periods spent in the diffegent cells. 
Equation (E.26) corresponds to Eq. (B1.05) of the 
classical case. 

We may, in conclusion, say a few words about a 
recent paper by Klein (52K). His point of view is 
similar to the one of Born and Green in that only 
systems in a well-defined state are considered. In that 
case it is necessary to introduce an interaction between 
the system and the outside world. This interaction, 
provided it satisfies certain mixing ¢onditions, will 
ensure the equivalence of rime and ensemble averages. 
However, this method of approach does not, in our 
opinion, do justice to the fact that an observer is dealing 

121 It must be noted here that if we use, with Fierz, Eq. (E.21) 
for I(t), we only need the condition of nondegeneracy, but not 
the extra restriction of no resonances. This is very satisfactory, 
since, while the energy differences have a physical meaning ( 
enter into expressions for relaxation times, for instance), the second 


differences do not have such a physical meaning. I am,indebted 
to Professor Fierz for calling my attention to this point. 


E to Professor M: Figrz for these considerations. 12 See Appendix IV. . . de 
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with macroscopic variables. We refer to his paper for a 
detailed discussion of his very interesting analysis. 


SUMMARY OF THE SITUATION IN QUANTUM 
THERMOSTATISTICS 


Once again we can briefly summarize the results of the 
two preceding parts (D and E), There is no need to dis- 
cuss again the utilitarian approach as there is no 
difference between the classical and the quantum- 
mechanical case. 

In the case of the formalistic approach, once again 
one investigates the necessary conditions for the 
equality of time and ensemble averages. In the quan- 
tum-mechanical case the condition is the one of non- 
degeneracy of all energy characteristic states. 

Tn the case of the physical approach, one works again 
with representative ensembles and shows that the de- 
velopment of a nonequilibrium situation will be such 
that the ensemble representing the system under 
consideration will approach a canonical ensemble when 
time goes on. The problem here reduces to finding the 
rules according to which the representative ensembles 
can be set up. These rules are the assumption of equal 
a priori probabilities for all nondegenerate states and 
of random a priori phases for the probability amplitudes. 
Once these assumptions are made, one can either use 
the quantum-mechanical version of the statistical 
H-theorem, or one can use the H-theorem as applied to 
ensembles. The discussion is very similar to the one 
applicable in the classical case. 

In the classical case we saw that in the last instance 
the basic ideas of both the physical and the formalistic 
approach were the same. Once one has proved that 
physical systems are quasi ergodic, the ergodic theorem 
follows, but one can also justify from first principles 
the choice of a priori probabilities. In the quantum- 
mechanical case, the situation is not so simple. One can 
justify from first principles the assumption of equal 
a priori weights, but one is left with the assumption 
of random a priori phases. On the other hand, it seems 
unlikely that physical systems are really ergodic in the 
quantum-mechanical sense, and in that case the quan- 
tum-mechanical ergodic theorem must also use an 
assumption about the distribution of the phases. Since 
in quantum-mechanics the (initial) phases do not play 
a physical role, one might with some justification say 
that one should really always average over these phases. 
Tn that case the quantum-mechanical ergodic theorem is 
always true, arid the assumption of random a priori 
phases is also justified. As in the classical case we see 
that the two different methods of approach are not 

differing by as much as one would expect at first sight. 


APPENDIX I. THE LORENTZ MODEL": 


In Sec. A(1) we introduced a model of a physical 
system which possessed the following properties. One 


"23 See 54G; compare also 55G, 55H2. - 
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kind of particle is fixed in space, randomly distributed 
with a density of n per unit volume. A second kind of 
particle is traveling through the lattice colliding elasti- 
cally and isotropically with the first kind, but not 
interacting with its own kind. Its density in space is V 
per unit volume and its speed is c. The cross section for 
a collision is denoted by ø. 

For our discussion we assume phase space, which in 
the present case is the unit sphere, to be divided into 
Im-+1 finite cells, each of extension du, 


bo = 4r/ (m+ 1), (1.01) 


and numbered by an index v which runs through the 
sequence —m, —m+1,---,0,°-°, m—1, m. We denote 
by f, the number per unit volume of particles of the 
second kind—which in order to keep the analogy with 
Lorentz’s model of a metal we shall call electrons— 
moving in the direction specified by the vth element of 
solid angle. The f, satisfy the condition 


D f= (1.02) 


and their equilibrium values f,° are given by the 
equation 


foe=N/(2m-+1). (1.03) 


To characterize the state of the system, we introduce 
A by the equation 


A= yen (fho fo’)? (1.04) 


We can now ask for the probability w(A)dA of finding 
a value of A in the interval A, A+dA. We use the method 
introduced by Markoff (12M; see also 43C, 46C) to 
find a probability distribution for a quantity which is a 
sum of a number of similar terms of functions, the 
probability distribution of which is given. We mention 
the result (see 43C) but give no proof. If 


b= Le Pk (a), (1.05) 
where the q; are stochastic variables such that 


is the probability of finding the q; within ranges qi 
qitdg:, the probability w(@®)d® to find # between ® 
and @+d® is given by the equation 


-Ho 
w(®)db= (d®/2r) f exp(—tp5)A(p)dp,. (1.07) 
where A (p) is determined from 
A(p)= f sog f exp(ipdiox)P(q:)IIdq:. (T.08) 


In our case Eq. (1.04) takes the place of Eq. (1.05), 
and the probability of finding a certain set of values 


for the f, is a compound probability. Using Eq. (1.02), - 


we find ` 
LO) =EN YES !](2m+1)-%, (1.09) 


id 


e 


a 
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which for large values of the » Which are in the neigh- 
borhood of f,* takes the fami oe 
Pf) =L(2m+ 1)/2r N ]"(2m-+-1)3 
Xexpl— (2m-+1)> a IYIN 
which is normalized. 
For the function A (p) we get 


(1.10) 


4%)= f. . fon) exp(—ipA)dfin++-dfm, (1.11) 


where the integral is 2m-fold since by virtue of Eq. 
(1.02) only 2m of the Je are independent. Since one of 
the fe must be eliminated, for which fo is chosen to 
preserve the symmetry, the integration is rather tedious, 
but luckily straightforward and one find 


A (p)=[1—2iNp/(2m+1)}-™. (1.12) 
Substituting from Eq. (1.12) into Eq. (1.07) one finds 
by contour integration» 
w(A)dA=[ (2m+1)/2N ]"[A"-1/ (m—1) !] 

Xexp{— (2m-+1)A/2N}dA, 


which is Eq. (A3.13). 
From Eq. (1.13) one gets 


(1.13) 


an= | Aw(A)da=2nN/(2m+1), (1.14) 
and 


(4%) w= f ato(ajda=2m(2m+2)N*/ (2m1), (1.15) 


from which Eqs. (A3.16) follow if we neglect in the 
final result 1 against 2m, since m>>1. 

We now must evaluate the probability w(A,A’) that 
A changes its value from A to A’ during a time interval 
7. The change in A is related to the changes in the 
values of the f, by 


A’—A=2 Eo Ge GE = 3) 
=2 SEs Gs =f); 


since from Eq. (1.02) it follows that > fr =} fe. 

Let Xe be the actual number of electrons passing 
from cell v to cell v’ during a time interval 7. If we could 
use a Stosszahlansalz, we would have 


(1.16) 


Xost = Afr, (1.17) 


where 


` a=norc/ (2m+1), (1.18) 


where we have used the assumption of isotropic scatter- 

ing [compare Eq. (A1.15) ]- i f i 
We are, however, interested in fluctuations and in- 

stead of Eq. (1.17) we must consider a distribution of 


the x, around an average value which will be given by 


1 To derive Eq. (1.10), one uses the Stirling formula in the 
form heler Ine ethet inde and expands in terms of 


foo Wf. 


Eq. (1.17). If the distribution of the fixed particles in 
space is random we may assume this distribution to be a 
Bernoullian one, 5 s 
The time interval 7 will be chosen in such a way that 
the probability of any one electron undergoing more 
than one collision during 7 is negligibly small, or 
THA /noe. 


(1.19) 


If we chose V so large that a. is still large compared to 
one so that we have the inequalities 


(1.20) 


we can use the Gaussian form for the «,,--distribution, 
P (Hen) = (2ra fa) expl— (ty—af.)?/2af,]. (1.21) 


The change of the f, during a time interval 7 is 
governed by the x. and we have 


fe’ —fo=Xw (toro tr). (1.22) 


Combining Eqs. (1.16) and (1.22) we see that we have 
once again a problem of the kind which led to w(A). 
The expression w(A)w(A,A’) now takes the place $ of 
Eq. (1.05) and the $y are expressions in the f, and the 
Xv". From Eq. (1.07) we now have 


NYavN>t, 


w(A)w(A,A’) = (1/42?) 


x ff expl—iea—ip(a’—ay (p,0)dpdo, (1.23) 
where 


A (oo)= f . fajm +f eie 


X P( fo) p (E-m) re P(Xmm) 
Xexpliods (fo— fe) + 2i fel fe’— fe) ]. (1.24) 


The integration over the (2m-+1)? variables xw is 
straightforward after we have substituted from Eqs. 
(1.21) and (1.22). The integration over the 2m variables 
fv (v0) is more complicated. We introduce new vari- 
ables a, through 


a= i= fe fet (1.25) 


and neglect cubic terms in the exponent with respect 
to quadratic terms. We are then left with a quadratic 
expression in the exponent and can integrate. The final 
result leads to c 


A (p,o)=[ (2m+1)/2][4Vap(2Np+2m+1) 
—2Nis+2m+1}", (126) 
and from Eqs. (1.23) and (1.13) we get the normalized 
expression 
w(A,A’)= (32r Vaa) 
Xexp[— (A’—-A+2(2m+1)cA)?/32NaAT> (L27) 


which is Eq. (A3.19). ` 
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Equation (A3.21) follows from the equation 


Av = f Awla, A'da’, (1.28) 


using Eq. (1.27). ; 
In footnote 36 we mentioned that the following 
relation held: 


qw(A)w(A,A’) =w(A’)w(A’,A). (1.29) 


We must draw attention to the fact that in proving 
Eq. (1.29) one neglects higher-order terms in a which is 
permitted because of inequality (1.19). 

Using Eqs. (A3.27), (A3.28), (A3.31), and (1.27), 
Eqs. (A3.19) and (A3.32) follow, again up to first-order 
terms in a. Similarly, Eq. (A3.30) follows from Eq. 
(A3.29) by neglecting terms in a? and higher-order 
terms. 

Tf the Stosszahlansalz would be valid, one can use for 
the rate of change of the fẹ Eqs. (1.22) and (1.17) which 
gives us 


df,/dt= (fe — fA T= Xi’ (Xorn — Xov) 
= (a/v fe — (2m+1) fo], (1.30) 


whence Eq. (A3.34) follows by using the fact that 
Vfer=Zfr t= (2m-+1) fot. 


APPENDIX IJ. PROOF OF THE EXISTENCE OF 
QUASI-ERGODIC SYSTEMS!*5 


Fermi’s proof of the quasi ergodicity of kanonische 
Normalsysteme with more than two degrees of freedom 
consists of two parts. The first part is a proof that the 
energy surfaces $ are the only family of surfaces in 
phase space which have the property that any orbit 
starting from a point of one of the surfaces of the 
family will always stay on the same surface. The second 
part then consists of showing that, if there are two 
arbitrary regions o and o* on $, there will be orbits 
passing through both o and o*. 

The proof of the first part is slightly complicated and 
runs as follows. A kanonisches Normalsystem is a system 
where one can introduce canonically conjugate coordi- 
nates a; and y; such that: 


(a) the energy is time independent; 
(b) the Hamiltonian 3¢ of the system can be developed 
in a power series of a parameter a, 


Se= Rotalia panion (11.01) 


(c) the Hamiltonian and thus all 3; are periodic in all 
the x;. We can then choose our x; such that the 
period is the same for all «,, say, 27; 

(d) the first term in the series expansion in Eq. (II.01), 
Ico, does not depend on the x;. 

Let us assume now that the equations 
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describe a family of surfaces Ra and that thé functions © 
are uniquely valued, analytical and periodic in the x; 
We must show that each of the Ra coincides with an 
energy surface $. 

We expand ® in powers of œ and have 


=P) tah) a°Po+ OTIS (11.03) 


If the Ra are given we can still to a large extent choose 
our &; arbitrarily and especially it is.possible to prove!?6 
that one can for given Ra choose Po not to depend on 
the x: Moreover, one can choose the ®, arbitrarily 
outside Ro. 

As the Qa are surfaces such that an orbit starting 
from one of its points will stay on the same surface, the 
equation è 

{5C,P} =0 (II.04) 
must be a consequence of Eq. (II.02), where {A,B} is 
the Poisson bracket defined by 


{A,B} =>; [(0A/0y,) (0B/dx2 
—(0A/dx;)(dB/dy,)]. (I1.05) 
From Eq. (II.04) and the series expansions (II.01) and 
(11.03) it follows that the following equations must hold: 
{5Co,Bo} =0, (II.06) 
{50,81} + {5C1,Po} = 0, seleg (11.07) 
Equation (II.06) is trivial as neither 5Co nor Po contain 
the xvi. 
As both ®; and 3e, are periodic in all the x;, we can 
write 


P= Am(yx) EXPIm, (11.08) 
IC1= F Bm (Yi) EXPIm, (11.09) 

with 
Om= ilmi moxt -° Hmrtn), (11.10) 


where 7 is a short-hand notation for all the 1 m; and 
where the summations in Eqs. (II.08) and (II.09) are 
over all n mi. 


From Eqs. (II.07) to (1.09) we get 


2 expôm[ A mi Mmiwi— Bm 2; 3x; ]=0, (11.1 1) 
or 
A Da Mw = BA De; MXi (11.12) 
where A 
w= O5o/ Oy, xj;=O8P/ dy; (11.13) 


If we exclude the possibility that the Bm can be zero," 
it follows from Eq. (II.12) that in any point of Qo 
where m,w: is zero, 2m 3x; must also be zero. 

Assume now 2=3. The discussion is equally valid for 
n>3, but not for2=2 (compare 23F1). In general at 
most one of the three ratios w/w, w3/w2 will be con- 
stant in Qo. We may thus assume the first two ratios to 


be varying on Qo. In that case Ro is covered densely 


ý P(x,y; a) =0 (11.02) 126 See Sec. 3 of 23F1; one uses the fact that the Ra are surfaces 
= ` ja on which an orbit stays. _ : 
@ 125 See 23F 1. 121 For a discussion of this case see 92P. 
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with points where w2z/w, takes on rational values so 
that one can find m and ma such that 


Myo + Mx02= 0, (L14 
It then follows that in these points 

mixıtm:x:=0, (11.15) 
so that « 

2/1 = X2/X1. (11.16) 


As Eq. (II.16) holds in a point set which is everywhere 
dense on Qo, it holds identically on Ro. The same is 
true for equation 


@3/01=X3/X1. 


(11.17) 
From Eqs. (11.13), (11.16), (11.17) and the fact that 
5Co and &y do not depend on the x; it follows that on Ro 

Po=KHo+co, (11.18) 


where co is a constant. As Po can be chosen arbitrarily 
outside Ro we can choose it in such a way that Eq 
(11.18) holds everywhere in phase space. 

If our system is nondegenerate, the equation 


YS mw;=0 (11.19) 


cannot hold identically on Qo and we can divide Eq. 
(11.12) by 2mw,; which by virtue of Eq. (11.18) is 
equal to Ym,x;. The result is 


An=B,, except when all m;=0. (11.20) 
It follows that on Ry 
G1 (4,9) =i (xry) + hy), (11.21) 


and once again we can choose ¢ outside Go such that 
Eq. (1II.21) holds everywhere in phase space. 

One can now easily prove by complete induction from 
Eqs. (11.18) and (11.21) that 


B:=Ki+c;, (11.22) 


where the c; are constants. The proof is along the same 
lines as the proof of Eqs. (II.18) and (II.21). 

Equations (II.22) lead to the result that Eq. (11.02) is 
equivalent to the equation 


Kayi; @) = Lena" =c, 


and the Qa thus coincide with energy surfaces. 

The second part of Fermi’s proof is much less com- 
plicated. Let o and o* be two arbitrary regions on $. 
Let o’ be that part of S$ which is covered by the orbits 
which start somewhere in $. If o’ would cover all of $ our 
point would have been proved. If o’ does not cover all 
of S, let o” be that part of $ which is not covered, and 
let ® be the boundary surface between g’ and a”. There 
can be no orbit which contains points both in o’ and o”, 
as can be seen as follows. Let P’ in o’ be passed at / 
by an orbit which at ¢” passes through P” in g”. Since 


(11.23) 


` the solutions of the equations of motion are analytical 


z : Sava Z 
functions, we can find small regions 7 ino around P’ 


|» and 7” in o” around P” suclf that the orbit passing 
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through an arbitrary point 0’ of 7! at 2’ will pass through 
n” at t”. Since, however, n’ lies in o’ there will be points 
lying on an orbit starting from ø and we have arrived 
at a contradiction. 

Let now P be a point of ® and once again P’ and P” 
be points of o’ and o”, respectively. If we consider 
orbits passing at £ through P, P’, P”, these orbits will 
pass at 4; through points P, Py, Py’, at l through 
Pz, Px, Px”, and so on, where Py’, Pa’, +++ are all lying 
in a and Pi”, Px’, «+» are all lying in o”. We can 
choose P’ and P” so close to P that PY and Pi” are 
close to Pi, Px’, and Ps’ close to Ps, and so on. It then 
follows that P, Pi, Ps, «-+ must all lie on @ and ® thus 
contains an orbit. We saw, however, that the only sur- 
face on which an orbit could stay was an energy surface. 
Hence, @ cannot exist, or, o’ must cover $ completely. 
That means that o* will be part of o’ and there will be 
orbits starting from ø which pass through o*. As both 
g and o* can be taken arbitrarily small we have proved 
the quasi ergodicity of the systems considered, 


APPENDIX III. PROOF OF THE CLASSICAL 
ERGODIC THEOREM! 


Birkhoff’s ergodic theorem proves the equivalence of 
the average taken over an energy surface and a time 
average taken for practically all orbits on the energy 
surface, provided the energy surface is metrically inde- 
composable. Let f(P) be a well-behaved phase func- 
tion,” that is, a function of the phase represented bya 
point P on the energy surface $. 

Consider the function f(P; tT) defined by the 
equation 


to+T 
JP; 6.1)=— f f(P)dt,  (IIL01) 


to 


where P, is the point passed at ¢ of an orbit which 
passed through P at t. We shall first of all prove that 
for almost all orbits the following limit exists: 


F(P3 to, 2) = lim JP; 46,7). (111.02) 
Secondly we shall show that this limit is independent of 
lo Finally we shall show that this limit is constant 
almost everywhere on 8. 

To prove the first part we divide the time scale in 
finite segments and write G 


T=nr, (I11.03) 
and 


OR I stench (111.04) 


If P were a phase point for which the limit of Eq. 
(III.02) would not exist, the lower bound L(P) of 
fn(P; to) and the upper bound U (P) of f,(P; to) would 
be different and one could find two quantities a and 8 


28 The proof we give here is due to Kolomogoroff [see 52R, 


9K1). . 
19 Well-behaved is understood here in the sense of summable 
over the energy surface which is assumed to be of finite volume.* 
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such that 


L(P)<a<B<U(P). (111.05) 


Moreover, if the set © of phase points for which the 
limit of Eq. (111.02) did not exist were of positive meas- 
ure, one could find a subset D’ of D also of positive 
measure and a set of values a and £ such that the 
inequalities (111.05) would be satisfied for all points P 
of D’. This, however, leads to a contradiction, as follows. 

Let Py be the phase point at fo+-A7 and let a,(P) be 
defined by the equation 


1 tht 
m(P)=- | f(Piddt. (111.06) 
Tv tk 
By shifting the origin one sees easily that 
ax(P)=20(Pr). (111.07) 


The time average f,(P; to) can be expressed in the 
xx as follows: 
1 n—1 


Fr(P; t) =- E wx(P). 


n k=0 


(111.08) 


We now integrate fa (P; to) over a set of points Do”? 
which is a subset of D’ such that for any point of Do” 
we have 


fn(P; to) >B. 


The result of the integration is 


(11.09) 


MBN (Do) < nfi (P; to)dwo 


n—1 n—1 
= >D x (P)dwo= yD xolPi)dwk, (111.10) 
k=0 k=0 


where M (Di™) is the measure of the set Do”, where 
diy indicates integration on the set Do”) and dw,” 
integration over the set D,‘") obtained from Do“ by 
the transformation P—P,. 

Suppose that the sets D,“ are nonoverlapping and 
let their sum be D™, 


DM = TD, (111.11) 
From Eq. (111.10) we now have 
f xo(P)do™>BM(D), (12 


where / dw) indicates integration over D™ and 
where we have ‘used the fact that Dt(D.™)=M (Do) 
for all k, as a consequence of Liouville’s theorem. 
It can be shown that one can find such sums D™ of 
$ nonoverlapping sets for each value of 7 in such a way 
that they exhaust D’ and from inequality (III.12) it 
thus follows that 


f xo(P)dw' >BIN(D'), (111.13) 


_ where fd! indicates integration over the whole of D’. 
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One can similarly prove the inequality ‘ 


Í xo(P)des! <aM(D’), (111.14) 


ee i 


and combining inequalities (111.13) and (III.14) with 

the assumption M(D’)>0 we arrive at a contradiction 2 
with our choice of a<f. It thus follows that D is of 
measure zero and that the limit of Eq. (III.02) exists 

for practically all orbits. 


The first part of the proof is now concluded by á 
considering ' 
1 tot T 1 lotar P 
Az -Í j(Pyai—— f ea, (111.15) Ca 
T Jt NT Vto 
where now Eq. (III.03) does not necessarily hold, but x 
is the largest integer contained in T/r. We can split A 
into two parts, 
A=A,+4 2) (I1Z.16) z s 
with 
1 1 lotnr 
NA (=-->) f s(P eat, (II1.17) 
12 NT to r 
1 to+T 1 lotnr 
A= =f seya- f f(Pddt\. (I11.18) 
T to T to 
Since fn(P; to) had a limiting value for n>, say, 
F, we have for A, the limit 
nr-T 
Am T, (111.19) 
and we see that 41-0 for T =o. 
For Az we have the inequality a 
1 (ett T 
A:=— | f(P)dt|<—|x,(P)|, (III.20) 1? 
totnr T 


and for any well-behaved function f(P) «,(P) will be 

bounded and thus A»—0 for 7. This concludes the a 

first part of the argument. 
The fact that f(P;t,) is independent of fy follows 

easily from the equations 


1 to+T 1 RA 
lim — Í f(P)dt=lim — Í P,)dt 
IE to T Ju K ) 
u+T 


1 7% 1 
Him — f J(P)di—lim— f f(Pddt, (111-21) | 
T to T t 


oT l 


and for any well-behaved function f(P) the last two 
limits will be equal to zero. 

The final part of the proof of Birkhoff’s ergodic 
theorem follows from the fact that, if f(P; to) were 


not constant practically everywhere on $ it would be F 


eS 


oO 


THE 


possible to,find a value F of J(P; la) such that the 
conditions f(P;t,«)<F and JP; tp) >F would 
define two sets of positive measure on $ whi h would 
both be invariant against the transformations PP, 
since J(P; lo) is invariant against such transforma- 
tions. This would, however, be in contradiction to the 
assumed metrical indecomposability of $. 


s 
APPENDIX IV. PROOF OF THE QUANTUM- 
MECHANICAL ERGODIC THEOREM: 


In Sec. E we found that the quantum mechanical 
ergodic theorem was proved once we could show that 
the time average P, of the probability P, of finding 
the system in the vth cell is practically always equal to 
the ratio of the number of states s, in the cell to the 
number of states S; of the corresponding energy shell. 
In our discussion we had used the fact that both s, and 
S; are of the order of magnitude of exp(10%) and we 
shall use this fact also in the following. The expression 
“practically always” is meant in the sense that Eq. 
(E.26) holds for practically all subdivisions of phase 
space into phase cells, where the weight of different sub- 
divisions will be defined presently. 

Let y be the wave function of our system, let y, be a 
complete orthonormal system corresponding to the 
Hamiltonian operator H, and let w, be a complete 
orthonormal system corresponding to the macroscopic 
operators discussed in part E. The set of the w, will 
depend on the manner in which we have chosen our 
phase cells. We can express y in terms of the g, or in 
terms of the w,, 


Y= Die oe (IV.01) 
YH=Dr lw. (IV.02) 


The g, and w, are related to each other through a 
unitary transformation, 


@r= doe UreGa, Po= dor Ver*w;,  (IV.03) 


and the different ways in which the phase cells can be 
chosen is reflected in the different transformation 


matrices U. 
From Eqs. (IV.01) to (IV.03) it follows that 


l=} TeUne, (IV.04) 


and since the gy, are the characteristic functions of the 
energy operator we have 


re=|ro| exp(@Eot/h). (IV.05) 
The quantity P, is given by the equation 
P,=2 l’, (IV.06) 


where the summation extends over the s, levels of the 
yth cell. From Eqs. (IV.04) to (IV.06) we get 


P, =E rop|re||rp| Ure" zp expli(E.—£,)t/h], (IV.07) 


10 See 29N, 37P, 52R; I am also indebted to Professor M. Fierz 
for correspondence on this topic. ¢@ 
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and for the time average 


P= Lealtel™| Unel, _(1V.08) 


where we used the fact that no two energy levels are 
the same,'# 
Consider now 


IP,—s,/S,| = Elre CsA SIl (TV.09) 
where 
Co= Doel Ore, (IV.10) 
and where we have used the fact that 
We teltan (IV.11) 
From Eqs. (IV.09) and (IV.11) it follows that 
|P,—s,/S;| <max/C,—s,/5;]. (LV.12) 


If we can prove that, for practically all transforma- 
tions matrices U, C, is practically equal to s,/5, our 
theorem is proved. We must thus find the probability 
distribution of the quantity C, given by Eq. (1V.10). 
The U,, are the components of unitary unit vectors in 
Hilbert space and the C, are the square of the length of 
the projection of these vectors on the subspace corre- 
sponding to the s, states of the phase cell under con- 
sideration. If we now assume that the probability of 
finding divisions corresponding to the unitary unit 
vectors U lying within a given solid angle in Hilbert 
space is proportional to the measure of this solid angle— 
or to the surface area cut out of the unit sphere in 
Hilbert space—the determination of the probability 
W(C)dC that C, lies between C and C-+dC is reduced 
to a problem in multidimensional geometry. Similar 
problems are discussed by von Neumann (29N) and 
Pauli and Fierz (37P), and we shall only give the final 
result for W(C) which is 


W(C)=k-C”(1—C) Si, (IV.13) 


where & is a normalizing constant and where we have 
made use of Eqs. (E.20). E 

From Eq. (IV.13) we see, first of all, that the most 
probable value for C, is s,/S,, and secondly, that the 
maximum is extremely sharp. In fact, the function 
W (C) is reduced to half its value at a distance S-? from 
its maximum. From this it follows that for “practically 
all” divisions C, will have a value “practically” equal 
tos,/S;and that thus Eq. (E.26) is “practically always” 
satisfied. 


APPENDIX V. TIME-PROPORTIONAL TRANSITIONS"? 


In this appendix we wish to sketch the derivation of 
Eqs. (D2.45) and (D1.30) which played such an im- 
portant role in our discussion of the quantum-mechani- 
cal H-theorem. 

We, first of all, remember that in ordinary quantum- 
mechanical perturbation theory it follows from the 

31 It must be noticed that the additional condition of no de- 


generacy of resonances is not 
12 Compare 38T, Secs. 99, 100. * a me 


a 
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method of the variation of the constants’(26D, 27D, 
28P, 38K Sec. 53) that if a,(0) are the amplitudes at 
/=0, the amplitudes at a later time ¢ will be given by 
the formula 


a,(¢) =i Ventexpli(En—Et/h]— 1 } 
<a, (0)/(Ex- En), 


where the summation is over all these states which were 
represented at /=0, where the state # was not repre- 
sented at /=0, where Van is the matrix element of the 
perturbation operator V through which the transitions 
take place [compare the discussion of the Born-Green 
considerations in Sec. D (2) ], and where Æx is the energy 
value of the state k. 

Consider now the situation where we want to know 
the number of transitions from one group S; of energy 
levels to another group S; of energy levels. Let us first 
of all assume that at ‘=0 we know from observation 
that the system is in one of the S; states of the ith 
group. In accordance with our basic assumption of equal 
a priori probabilities and random a priori phases we 
have at /=0 for the density matrix representing our 
system 


(V.01) 


Pri= (arai) = ĝr1/ Si 
if state & belongs to the ith group, r(V.02) 
=0, otherwise. 


These equations follow since if both k and / belong to 
the ith group and are unequal (a,a/*) is zero when we 
average over the phases; the remainder of the equations 
follows easily." 

The probability P,’ of finding the system at / in a 
state of the jth group follows from Eq. (V.01) and we get 


Pj =Yon an" (t)an(t) 
=}, ln Vien* VinfexplL—7(Ex—E,)t/h]— 1} 
X {expli(£.—£,)t/h]—1} 
Xa,*(0)ai(0)/(E,—Ex)(En—E1). (V.03) 


Averaging over the representative ensemble and using 
Eqs. (V.02), we get for the P; the expression 


P;(t)= (4/8) Dine | Vien? 
X[sin?s (Ey—E,)!/h/ (Ex—E,)?. (V.04) 


In Eqs. (V.03) and (V.04) the states k and / belong to 
the ith group and the states to the jth group. To 
evaluate the sum in Eq. (V.04) we replace it by a 
double integration and get (compare 38T Sec. 99) 


Pi (O= (20/hS;) Dien| Ven|?on(B)ox(E)AE t, (V.05) 


where c,,(cx) are the density of the energy levels in the 
jth (ith) group and where AE is the range of energy 
corresponding to the jth group. 

Equation (V.05) can be written in the form 
Pi=Tit/ Si, (V.06) 


18 Ope takes into account the fact that 9 is normalized. 


a 


HAAR 


and, if we had been interested in a situation where 
originally observation had shown the system to be in 
the jth group and we would like to know the probability 
of finding the systems in a state of the ith group, we 
would have got the result 


P= TS (V.07) 


and from Eq. (V.05) and the Hermitean character of V 
it follows that 


T = T jx (V.08) 


If observation had only shown that at /=0 the differ- 
ent groups were occupied with probabilities P; P;, ++, 
we should instead of Eq. (V.02) for ọ have used the 
equation 


pri= durP;/S: (state k belonging to the ith group), (V.09) 


and by an analysis similar to the one just given we 
would have found that the rate of transition V;; from 
the ith group to the jth group would be given by the 
equation 


Niyj=Ti;Ps/Si, (V.10) 


which reduces to Eq. (D2.45) by the substitution 
Aij=Ti3/SiSj, (V.11) 


while from Eqs. (V.08) and (V.11) Eq. (D2.46) follows. 

In order to derive Eq. (D1.30) we remind ourselves 
that we are dealing with a system containing N prac- 
tically independent particles so that the Hamiltonian 
of the system can be written in the form - 


H=Ho+V, (V.12) 
where 


H=2:H,, (V.13) 


H; being the Hamiltonian of the ith particle, and where 
V is the interaction operator which is neglected to a 
first approximation, but which is necessary in order that 
transitions can take place between different states of the 
system. The characteristic functions of Ho are of the 


form 
$=]: pp Ex=Di e; (V.14) 


where the g; are the characteristic functions of the H,, 
Ep the characteristic value of Ho corresponding to Pr, 
and the e; the characteristic values of the H;. A wave 
function P&; obtained from &; by a permutation of the 
arguments of the V particles is another characteristic 
function of Ho belonging to Er. 

If we are dealing with a system of Fermi-Dirac par- 
ticles we can only allow those wave functions which are 
antisymmetrical in the arguments of all NV particles 
and we have thus only cne acceptable combination of 
the P&, in this case, namely, 


}r.p=C>(—)PPa&,, (v.15) 


14 Compare the analogous position in the case of Boltzmann’s 
H-theorem, where the Maxwell-Boltzmann distribution which 
really presupposes independent particles is established through 
collisions, that is, by means c£ their interactions. 
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sae : 
when (istori according as the permutation 
is even or odd. 

In the case of a system of Bose-Einstein particles the 
only acceptable wave function is completely sym- 
metrical, or, i 


®y.2=C> Ph; (V.16) 


where in Eqs. (V.15) and (V.16) the summations extend 
over all N ! permutations, and where in both cases the 
factor C occurs as a normalization constant. 

In the following we shall restrict ourselves to a system 
of Bose-Einstein particles. The case of Fermi-Dirac 
particles is simpler, but otherwise analogous. We first 
of all assume that V is of the form ; 


V= Vas, 


where a and 8 number the V particles, and where the 
summation extends over all pairs of the system (a@<{). 
This means that we assume that only two-body colli- 
sions are of importance and that three-body or higher- 
order collisions can be neglected. As we are already 
assuming that the system is so dilute that to a first 
approximation V can be neglected this is a reasonable 
assumption.!# 

The wave function , may contain m factors g1, n2 
factors go, ---, n; factors g; ---, where the ø; are a 
set of ortho-normalized characteristic functions. In 
order to evaluate the factor C in Eq. (V.16), one must 
combine only those permutations which do not lead to 
products of orthogonal ø; with the same argument! 
so that we finally get for C the equation 


(V.17) 


[C|P=[N tm mnm! gle J (V.18) 


In a similar manner we can evaluate the matrix ele- 
ment V+ corresponding to a transition from the original 
state to a state where instead of n; factors ¢,, 1; factors 
$j, ny factors pv, and n; factors gj we have n,+1 
factors g; mj;+1 factors ¢;, nv—1 factors py, and 
nj —1 factors gj. Introducing the two integrals 


n= f vi" (a) 9;*(B) Vasp (a) 93(B)dradrp, (V.19) 


ce Í o (6)o” (0) Vasov (a) py (B)dradra, (V-20) 


one finds 
Vr= |114|? Gritz) ij Lren. (V-21) 


ae very few cases where three-body collisions—to 
foes eer R those—have been considered. They enter 
into the calculation of the third virial coefficient (39B3, 40B, 
41M) but up to now only the classical case has been studied and 
the difficulties to be expected in a quantum-mechanical considera- 
wate acumen the argument here and refer to Tolman 
(38T Sec. 100) for a detailed derivation. 


In the Fetmi-Dirac case we would have found instead 
of Eq. (V.21) the expression" 


Vus |I| nen; (m—1) (n= t). (2 


Up to now we have not introduced the groups of 
energy levels which we considered in Sec. D(1). We do 
this now and then ask for the number of transitions 
which take place per unit time from Z; and Z; to Zy 
and Zy, where therefore Va Nj Vy, Vy, change to 
Ni—1, Nj-1, Neti, Vy-+1. Once again we use the 
method of the variation of constants and the analysis 
very similar to the one leading to Eq. (V.06) so that 
we shall only sketch it. 

The probability of P;(t)%* of finding the system at £ 
in one of the states corresponding to Ni-1, Nj—1, 
Ny+1, Vy-+1, if the state was originally in one of the 
states corresponding to Va Na Vy, Ny, is given by 
the expression 


P= rlulh l =E toor Vro” V tooo (0) 
X {expl —i(E;—E, l/h J—-1) 
X {exp[—ilEr— Ev)t/h]—13/ 


(E»v—E)(Es—E), (V.23) 
where poo’ (0) is the density matrix at t=0, 
Poot (0) = a0:* (0)a,(0), (V.24) 


and where the aps and a,’s are again the probability 
amplitudes. 

Again, we assume that we have made a measurement 
at /=0 giving us the information that at £=0 one of the 
initial states was realized so that we can write for the 
density matrix 


Poor (0) = (1/20) Soros | 
if state o satisfies our requirements, -(V.25) 
=0, otherwise, if 


where g, is the number of states satisfying our require- 
ments. 

In Eq. (V.23) we can now introduce Expressions 
(V.21) and (V.22) for Vr and write for Er— Eo, 


(V.26) 


E:— E= ete Ep ép. 
Writing further 


m=N/Za n=Ni Zn ne=Ne/[Ze, , 
ny=Ny/Zy, NA 


that is, replacing the m;, --- by their mean values in the 
corresponding group, and integrating we finally get for 
P;(t) the expression 


Pi(t)= (20/A)[ | i T2|2/AE] 


XKNN (Zeb Ne) (ZytN yt, (V.28) 
37 This formula can be found in papers by Jordan (25J, 27J1), 
Ornstein and Kramers (270), and Bothe (28B); the further de- 
velopment is due to Tolman. A * J 
13 The subscript f indicatés the final state, while the subscripts 
o and o’ indicate the original states. ls 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar e 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


334 D. 


where AF is the width of the energy groups. From Eq. 
(V.28) Eq. (11.30) follows easily, and Eq. (D1.31) is 
now a consequence of the Hermitean nature of the 
operator V. 

We have now derived Eqs. (D1.30) and (D2.45) 
which played such an important role in the discussion 
of the quantum-mechanical 7/-theorem. We have used 
(i) the quantum-mechanical picture of time dependent 
transitions, and (ii) the statistical assumption of equal 
a priori probabilities and random a priori phases. The 
second point has been discussed in Sec. D(3), but with 
regard to the first point we should like to conclude this 
appendix with a quotation from Pauli’s classical paper 
(28P): “The statistical laws for the frequency of transi- 
tions between stationary states, which are of the same 
character as the laws for radioactive decay, guarantee 
in themselves such a sufficient degree of randomness 
as is necessary for the statistical interpretation of the 
second law of thermodynamics.” 


APPENDIX VI. KLEIN’S LEMMA! 


In Sec. D (2) we used the fact that expression (D2.56) 
would never be positive. This property will now be 
proved. We are dealing with two density matrices 9’ 
and 9” representing a system at ?’ and ¢’” with matrix 
elements px’ and px’, where 


(VI.01) 


but o” is not necessarily a diagonal matrix. The diagonal 
elements pre’ (prk) are, as we saw in Sec. D(2), the 
probabilities of finding the system at /’ (#’’) in the kth 
state. They are thus never negative and the expression 
Qxn given by the equation 


Qrn= prr [npr — Monn” —1]+ pnn” (VI.02) 


is never negative as a consequence of Eqs. (D2.58). 
The pri” are determined through Eq. (D2.07) by 
the probability amplitudes at 7” which in turn follow 
from the probability amplitudes at ¢’ by integration of 
the Schrödinger equation. It is well known that one can 
obtain the probability amplitudes at 1”, a,*(t’), from 
those at t’, an*(t’), by a unitary transformation U, 


ant (V) =E ak(!)U nt. (VI.03) 
Using Eg. (VI.03), we get from Eq. (D2.07) 
pnn” =X r| Unk pr +d nU npr, (VI.04) 
or using Eq. (V1.01), 
Pan =r] Unk per. 


139 After this was written a paper by van Kampen (54K) was 
published in which the symmetry of the transition matrix is dis- 
cussed. Van Kampen draws special attention to the fact that, if 
magnetic fields or Coriolis forces are present, this symmetry must 
be replaced by a slightly weaker condition which does not follow 
from the Hermiticity of the Hamiltonian. Van Kampen also dis- 
cusses in nis paper the relation between the Onsager relations 
(3101, 3102,.45C) and the symmetry of the transition matrix. 

10 See 31K; 38T, Secs. 101 and 106. - 
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Prt = Pkk Ox1, 


(VI.05) 
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Multiplying Eq. (VI.02) by the positive quantity 
|Unx|? and summing over all values of n and k, we get 
Da a U nel prr" npr — Èr, | U nel pkr Inpn nt 

+— Si, nl U nel? prr HEr, nl Unk? Pnn" 20. (VI.06) 

Using Eq. (VI.05) and the property of unitary mat- 

rices that 


Yon Unk U n= sky i (VI.07) 
Eq. (VI.06) reduces to 
Dk prr! npr: — Qin Pris Inpnn” 
— r prr Hn nn’ 20, (VI.08) 


or 
ye pir Inpex’— Don Prot” ipana 2 0, (VI.09) 


since as a result of the normalization of 9’ and 9” 
[Eq. (D2.10)] we have 


Dp pee = a Panes 


We have now proved Klein’s lemma. It was stressed 
by Pauli (49P) that the essential content of this 
lemma is that Tr(o Ing) increases on the substitution 
of ọ by a diagonalized matrix in such a way that one 
just puts the off-diagonal elements equal to zero. As 
soon as this is realized, Born and Green’s proof of the 
decrease of o® is really trivial and could have been 
replaced by a reference to Klein’s lemma. 


(VI.10) 


APPENDIX VII. THE PRINCIPLE OF DETAILED 
BALANCING!" 


In Sec. D(2) we mentioned that Eq. (D2.50) ex- 
pressed the fact that at equilibrium as many transitions 
from the ith groups of energy levels to the jth group 
as the other way round. This is an example of a principle 
which is valid in a great many cases and which states 
that at equilibrium the number of processes destroying 
situation A and creating situation B will be equal to the 
number of processes producing A and destroying B. 
Other instances of situations where the principle of 
detailed balancing applies were met in Sec. A(1) 
[Eq. (A1.12)]] in the case of a gas of spherical molecules, 
Sec. A(3) (equation in footnote 36) in the case of some 
general models of a gas, and Sec. D(1) [Eq. (D1.04) 
and Eq. (D1.31) combined with Eqs. (D1.30) and 
(D1.27) ]. 

From these examples one might get the impression 
that detailed balancing would always hold. In the case 
of classical statistics, Lorentz (87L) showed that it 
could not hold in the case of polyatomic molecules 
which could not be considered to be spheres (see also 
98B). He showed how one gould still prove an H-theorem 
where now cycles Of transitions had to be considered 
instead of collisions and their inverse collisions.” 


u1 See 25F2, 25T, 38T Sec. 50, ESM p. 381. 


12 For a discussion of this more general H-theorem we refer the - 


reader to Tolman’s monograph (38T). On p. 119 of this mono- 
graph one finds an example of a collision which does not possess 
an inverse collision. ` 
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THE FOUNDATIONS 
The same difficulty arises in quantum statistics 
Hamilton and Peng (44414 ; see also 44H2, 5252) have 
shown that in the system composed of particles with 
spin and electromagnetic radiation detailed balancing 
does not apply. 
The principle is of great importance for its applica- 
tions, for instance, in the consideration of rate processes 
(11K1, 15M, 23C2). If, for instance, we wish to calcu- 
late the number of collisions in a gas which lead to 
excited states of amolecule—which may be a chemically 
active state—we can calculate the number of collisions 
leading to deactivation and use detailed balancing to 
give us the first number. Since the second number is 
more easily computed than the first, we have a clear 
gain. 

In 1925 an article by Lewis (2511) evoked a discus- 
sion (25F2, 25L2, 25T) of the principle of detailed 
balancing, called by Lewis the principle of entire equi- 
librium or the law of reversibility to the last detail.!5 
Fowler and Milne especially give an impressive list of 
applications. They point out that the principle was to a 
large extent inspired by Einstein’s classical paper 
(17E2) on transition probabilities. They also point out 
that the principle is really an extension of the ideas of 
Kirchhoff (60K) while it was probably first formulated 
by ‘Richardson (14R1, 24R). 

In 1916 Langmuir (16L) applied the principle to the 
problem of evaporation and condensation, stating: 
“Since evaporation and condensation are in general 
thermodynamically reversible phenomena, the mecha- 
nism of evaporation must be the exacl reverse of that of 
condensation,’ even down to the smallest detail.” 

In 1921 Klein and Rosseland (21K) used the prin- 
ciple of detailed balancing in their discussion of in- 
elastic collisions between atoms and electrons. Franck 
and Cario (22F, 22C1, 22C2, 23C1, 23F4, 244) ap- 
plied it also to collision processes. Fowler (24F1, 24F2, 
24F3) applied it to the phenomena of capture and 
loss of electrons by high velocity œ particles. Becker 
(23B1), Kramers (23K), and Milne (24M) applied it to 
the photoelectric process of ionization and capture. 
Eddington (22E, 24E1) used it in his investigations of 
the stellar absorption coefficient. Pauli (23P) and Ein- 
stein and Ehrenfest (23E2) discussed with its aid the 


scattering of radiation by electrons. Dirac (24D) used. 


it to discuss multiple collision processes. Lewis (25L2) 
used it to discuss Planck’s radiation law. From this list 
it can be seen that the principle is certainly one of the 


18 The statement in ESM that in quantum mechanics the prin- 
ciple of detailed balancing always holds is incorrect. 

i See also the recent paper by vên Kampen (54K) which shows 
that the same applies to situations where magnetic fields are 
present. See also a paper by Klein (S5K1) showing that detailed 
balancing does not hold generally for nonequilibrium steady states 


(compare also 5412). n ay eee 
45 Tolman (24T, 25T) sometimes uses the expression “principle 
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more important ones and one which can be applied to 
a large number of different problems. 
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Introduction to Some Recent Work in Meson 
Theory 


G. C. Wick 
Carnegie Institute of Technology, Pittsburgh, Pennsylvania 


INTRODUCTION 


HE aim of this article is to present a unified and 
relatively simple discussion of meson theory, 
which takes into account some of the most recent and 
promising developments, without requiring from the 
reader any previous knowledge of special field theoretic 
methods. Familiarity with ordinary nonrelativistic 
quantum mechanics (and its application to scattering), 
and with the main experimental facts of meson physics,! 
should suffice to read this article. Some field theoretic 
ideas and methods, such as the Feynman-Dyson graphs 
and renormalization, will be explained as we go along, 
in the very limited and simple form, in which they 
occur here. On the other hand, I have relegated to 
Appendix A such more familiar topics as charge inde- 
pendence and isotopic spin. An easy access to these 
things is already provided by current textbooks.” 

My aim could only be achieved at the cost of a 
drastic restriction in the subject matter. This is no 
review of the present status of the theory, and the 
picture I shall present is deliberately one-sided: I have 
selected ideas and results according as they fitted into 
the chosen pattern. For example a discussion of com- 
pletely relativistic schemes was out of the question 
from the start; but even such more closely related topics 
as the important “strong” and “intermediate” coupling; 


1 For a recent review see M. Gell-Mann and K. M. Watson, 
Ann. Rev. Nuclear Sci. 4 (1954); also Henley, Ruderman, and 
Steinberger, Ann. Rev. Nuclear Sci. 3 (1953). y 

2 See for example J. M. Blatt and V. F. Weisskopf, Theoretical 
Nuclear Physics (John Wiley and Sons, Inc., 1952), Chap. 3; G. 
Wentzel, Quantum Theory of Wave Fields (Intêrscience Publishers, 
Inc., 1949), Chap. 3; H. A. Bethe and F. de Hoffman, Mesons 

Row Peterson, 1955). 

‘ 3See G. Wentzel, nE Modern Phys. 19, 1 (1947); for more 
recent literature see T. D. Lee and R. Christian, Phys. Rev. 94, 
1760 (1954); the “weak-coupling” viewpoint is discussed amongst 
other things in R. E. Marshak’s Meson Physics (McGraw-Hill 
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approximations were omitted in favor of a fuller 
presentation of the main subject. 

To put it briefly, then, the form of the theory de- 
veloped here is the so-called symmetric pseudoscalar 
theory with the Yukawa (i.e. linear) coupling of the 
meson field to infinitely heavy nucleons. This implies a 
form of the Hamiltonian [see Eqs. (1.10) and (1.14)], 
that was already extensively studied,‘ among several 
alternative forms, in the early 1940’s. The detailed 
experimental information available today allows us to 
discard many of the alternative forms on mere inspec- 
tion. They assume either the wrong spin, or the wrong 
parity or the wrong number of charge states for the 
meson. Some of these theories, of course, still have an 
interest for the specialist; they provide him with guinea 
pigs, on which he can try out mathematical tricks, 
etc. From general pedagogical considerations I felt, 
however, that it was better in this report to stick to 
the one theory which has at least a fighting chance at 
the beginning. » 

Iam happy to acknowledge here my indebtedness to 
the work of G. F. Chew.® Thanks to Chew’s efforts, 
the Yukawa theory has at last achieved some contact 
with the quantitative aspects of meson physics. I am 
well aware that a wide range of opinions is possible, as 
to how deep that contact goes. For the purposes of this 
article, however, Chew’s work also contained another 
useful lesson. It showed, namely, that it was possible 
to apply to this theory ideas and methods derived 
from modern field theoretic work, without getting in- 
volved into too abstruse formalisms. In order to achieve 


‘See W. Pauli, Meson Theory of Nuclear Forces (Interscience 
Publishers, Inc., 1946). 

5 An especially simple and instructive example has been ex- 
pressly constructed for this papon by T. D. Lee [Phys. Rev. 95, 
1329 (1954)]; see also, G. en and W. Pauli, Dan. Math. Fys. 
Medd. (to be published). BP, 

°G. F. Chew, Phys. Rev. 94, 1748, 1755 (1954); Phys. Rev. 
1669 (1954). Š x, ai ey 
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my purpose, therefore, I only had to carry the process 
a little further, and give an elementary proof of some 
statements which in Chew’s paper are simply taken 
over from Dyson’s well-known, but rather difficult, 
discussion of the renormalization of the S-matrix. 

The key to this proof is Eq. (5.39) which I derived 
for this purpose. The same equation, it turned out, 
had been obtained a little earlier by Low from a less- 
elementary argument. Low, however, built on this 
equation a new and promising approach’ to the scatter- 
ing problem which has been further developed by Chew 
and Low.’ I have been able to give an account of this 
more recent work, insofar as it was available to me from 
communications at the 1955 Rochester Conference. 

It is mostly because of all this recent work being in 
the process of publication, that I have given up any 
attempt to include a detailed comparison between 
experiment and the results of the theory. I am afraid 
many readers will find that the theory as presented 
here sort of stops in mid-air. I felt, however, that to 
present in detail the calculations based on the older 
approach of Chew, now being rapidly superseded, would 
not have made much sense. As to the newer results, it 
was better to leave the word to the original authors. 

I feel, however, that I shall have attained my purpose 
if this article were accepted as a useful introduction to 
the study of some of these new and interesting develop- 
ments. 


1. BASIC ASSUMPTIONS: HAMILTONIAN 


The mesons we are interested in here are the 7 mesons 
or pions. We shall refer the reader to some excellent 
summaries already quoted, for the experimental evi- 
dence indicating that the spin of a pion is zero, and that 
an “intrinsic” parity change is associated with the 
creation (destruction) of a meson. By that one means 
that the parity of a state containing one meson with the 
orbital angular momentum /, when compared with the 
parity of the state with no meson (all other things being 
the same), is not, as one might expect, (— 1)! but rather 
(—1)**, the additional factor — 1 being the “intrinsic” 
parity of the meson. 

Free mesons are then described by the Klein-Gordon 
equation; the possible states of a meson are given by 
the eigensolutions of the wave equation 


Au+ku=0 (1.1) 
the energy w of the meson being given by 
w=wr= (u?+R?)}, (1.2) 


where p is the meson mass. (We use units #=1, c=1.) 
We shall assume for simplicity a finite quantization 
volume. For most purposes it is convenient to assume 
a cubic volume with periodic boundary condition so 


7F, Low, Phys. Rev. 97, 1392 (1955). I wish to thank Dr. Low 
for advance communication of his results and for an illuminating 
conversation. A 

8 G. F. Chew ‘and F. Low (to be published). 
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that the solutions of (1.1) are of the form 


y= Ne** 


(1.3) 


where N= (quantization volume)~} is a normalization 
factor. The number of eigenstates in the volume element 
dk of momentum space is then 


N-?(2r)—*dk. (1.4) 
The orthogonality condition has the form 
foe (Jine (X) dX = bx (1.5) 


where ôxw is the Kronecker symbol, = 1 if k=k’ and =0 
otherwise. The essential properties of this symbol can 
also be expressed by the rule 


Df (kK) bx =f (k’). (1.6) 


When f(k) is a continuous function, the sum in (1.6) 
may be replaced by an integral, according to (1.4) 
provided ôkw is replaced by a Dirac delta function 
according to the prescription 


5x42 N2(2)5(k—k’). (1.7) 


In order to specify completely the state of a meson it 
is necessary to give, besides k, also an index à, which 
has three possible values, corresponding to the three 
possibilities 7+ and 7°. We shall use a lightface index k 
to summarize k and A. Thus >>, means >>, combined 
with the integration (1.4). 

In addition to mesons, we may have one or more 
nucleons, photons, etc. For the moment let us consider 
only nucleons. We shall treat the nucleons as infinitely 
heavy and at rest; thus we shall not need any variables 
(say, momenta) to describe their transitional motion. 
We shall, however, need variables to describe their spin 
and charge states (see later). 

The theory we are considering takes over from the 
original Yukawa theory the assumption that mesons 
are emitted or absorbed by nucleons singly, like photons 
by electrons.® This refers of course to the elementary 
acts implied by the Hamiltonian: that is the Hamil- 
tonian has matrix elements only between states differing 
by the emission or absorption of one meson. 

More precisely the Hamiltonian consists of two parts 


H= H+, (1.8) 


where Ho, the unperturbed Hamiltonian, is simply the 
sum of the kinetic and rest energy of the mesons present 


Ho= > MOK (1.9) 


where ny is the number of mesons in the state k= (EA) 
and w, is given by (1.2). We do not have to include 
in Ho a term for the nucleon or nucleons; since We 


* A simple and lucid introduction to these concepts is provided 
by E. Fermi’s beautiful booklet Elementary particles (Yale Uni- 
versity Press, New Haven, 1951). 
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neglect recoil as well as the 
ence, this term would be a constant of the motion and 
can be ignored. We neglect the mass difference between 
charged and neutral mesons, 


ot r so that wp=w(k) inde- 
pendent of A. Since pions have spin zero, we assume 
that they obey 


Bose statistics, so that n can take anv 
value 0, 1, oe tees As to X, the “interaction” Hamil- 
tonian, 1t consists of a sum of terms, one for each 
nucleon prêsent. The number of nucleons is a constant 
of the motion; if there are no nucleons 5C=0 and the 
mesons are completely free. In this respect the theory 
is fundamentally different from the relativistic theories, 
in which the possibility of the creation of nucleon- 
antinucleon pairs leads to some interaction in all cases, 

We then need to write 3¢ only for the case of a single 
nucleon; if there are more nucleons, each will be repre- 
sented by a similar term. We may also assume for 
simplicity that the nucleon is placed at the origin. Let 
us introduce absorption operators a; and creation 
operators a,“ defined in the usual way for bosons.’ 
That is, the matrix element of a, is equal to one when 
it is taken between a state with no meson in the state 
k and a state with one meson in the state & (all other 
occupation numbers being the same for the two states). 
More generally if the state at the right contains 1; 
mesons in the state k, and the state at the left con- 
tains n’ =n;=—1 mesons in k (all other occupation 
numbers being unchanged) the matrix element of a; 
is equal to mk}. This has the effect that the prob- 
ability of absorption from a state containing n mesons 
is proportional éo 7, as one might expect. If n.—1,'%1 
or if np— np #0 for any of the other states p, then the 
matrix element of a, is zero. The creation operator 
a,* is simply the Hermitean conjugate of ar. 

The assumption that mesons are absorbed or emitted 
singly is then simply expressed by saying that J€ is 
linear in the a,’s and a;*’s. That is 


R= VH aV). (1.10) 


In determining further the form of V, and V;* one 
makes use of the additional requirements of charge 
conservation, charge independence (see Appendix A) 
and conservation of angular momentum. It will be 
most simple to write down at once the form of V+, and 
justify it piece by piece afterwards. One has 


Vr=N (4r)? (f/u) rik -ov(k) (2w). (1.11) 


Beginning from the left we have: (4m)#f is a proportion- 
ality constant which characterizes the strength of the 
interaction; the (47)? is sometimes omitted, the value 
of f must then be correspondingly larger and is then 
called a “rationalized” coupling constant. The y™ is 
introduced to make f dimensionless To make the 
absorption probability proportional to the meson den- 
sity (i.e. inversely proportional to the quantization 
volume) WV is necessary. Next come the most significant 
parts of the expression. We have so far only spoken of 
the changes that occur in the meson occupation num- 
d 


neutron-proton mass differ- 
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bers. Changes must occur, however, also in the nucleon. 
Owing to charge conservation, for example, a r* can 
only be absorbed by a neutron becoming a proton, and 
a a~ by a proton becoming a neutron; a 7° can be 
absorbed by either one, without change. This may be 
described by treating neutron and proton as two differ- 
ent states of the same particle, the nucleon, character- 
ized by an “isotopic spin” variable, = +4 for the proton 
and —4 for the neutron. Then all we have to do is to 
insert in V; a factor 7, representing an operator on the 
isotopic spin variable which has nonzero matrix ele- 
ments only for the appropriate transitions. The precise 
form of these operators is discussed in Appendix A, 
under the more stringent assumption of charge in- 
dependence. 

Just as 7, represents the possibility of changes in the 
charge of the nucleon, accompanying the emission or 
absorption of a meson, in a similar way the next term 
ik-o takes into account the possibility of ordinary spin 
changes. We may notice here that the most general 
form of an operator acting on the spin variable is 
a0,+4,+4.9,, where on grounds of invariance ana, 
must transform under rotations like the components of 
a vector. The only vector available, however, is k the 
momentum of the meson, hence the form k-o in (1.11). 
One must further point out that since k is a polar but 
ø an axial vector, their product is a pseudoscalar. Thus 
under an inversion at the origin V, is not invariant, 
but instead changes sign. This, however, is just as it 
should be on the assumption that the meson has nega- 
tive intrinsic parity. This remark also explains why 
k-o cannot be accompanied by a spin-independent 
term, in the form A+k-o where A is a scalar quantity. 
The matrix element cannot be part scalar, part pseudo- 
scalar; it has to be one or the other, depending on what 
“intrinsic” parity is assumed for the meson. 

Finally (1.11) contains a factor v(2)(2w,)~} which 
depends only on the magnitude of &, not on the direc- 
tion. About the precise form of this factor it is not 
necessary to make any assumption, except that,o(k)—0 
when & becomes very large. If this assumption is not 
made, virtual emission and reabsorption of mesons of 
large momentum gives rise to divergent expressions, 
much in the same way as virtual emission and reabsorp- 
tion of quanta gives rise to divergencies in quantum 
electrodynamics. 

The reason why the last factor in (1.11) is written 
in the composite form o(&)(2w)~4 rather than simply 
v(k) is that (1.11) is often obtained from field theory as 
follows. Writing explicitly 7k for k, dx» for a; we con- 
sider the expression (see below) 


Or(xX)= Nz (2ez) Hane Hae (1.12) 


and write v(&) as the Fourier transform of a function 


p(x) 


+ (1.13) 


ies f eik-tp(x)dx. 
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This does not involve any really restrictive assumption 
about o(k). Besides, since v depends only on k= |k], 
p will similarly depend only on r= |x]. With (1.12) 
and (1.13) it is now possible to write (1.11) as follows: 


f 
He (An)! Fan f aanta (dd) 


which is the interaction suggested by field theoretic 
considerations. In this case p(x) plays the role of 
“extended source distribution” for the nucleon. 

In field theory, (1.12) is just the meson field operator; 
the most obvious assumption would be that the inter- 
action involves only the value of ¢, or respectively Vo 
exactly af the position of the nucleon. That is like say- 
ing that the nucleon is point-like, p(x) is the delta 
function ô(x) if the nucleon is at the origin; this, how- 
ever, leads to v(k)=1 for all values of k, which leads 
to divergencies. It is necessary to attribute a finite 
extension to the nucleon in order to get a convergent 
theory. 

Two more remarks: if the nucleon is not located at 
the origin, but say at xn, then Eq. (14) suggests imme- 
diately that all we have to do is to replace p(x) by 
p(x—2,). The effect of this is that the value of Vi, 
Eq. (1.11) is multiplied by exp(k-x,), the amplitude of 
the meson wave function at the nucleon (as one might 
expect). Such factors become important when one has 
more than one nucleon, as in the nuclear force problem. 

Secondly, we must notice that (1.11) implies that 
mesons are absorbed or emitted in p-states only.'° The 
transformation which exhibits this fact explicitly is 
often useful, and we shall therefore mention it here. 
Instead of using for the free mesons the plane-wave 


states Eq. (1.3), one can introduce states of definite 
angular momentum 


Vkim(X) = N’ji(kr) Y m(x) (1.15) 


where jı is the “spherical Bessel function” of order / 
which hehaves asymptotically as (kr) sin(kr—}lr) 
and Fm is a normalized spherical harmonic depending 
only on the direction of x(S |Y |?d42=1). Finally N” is 
a normalization factor which like WV occurs also in the 
expression for the number of states in the interval dk 


(N')-2(2/m) dk. (1.16) 


It is clear that the transformation from plane to 
spherical waves only makes sense in the limit of infinite 
volume (spherical waves require a spherical box), so 


10 This is, of course, an immediate consequence of angular 
momentum and parity conservation in the elementary act. A 
nucleon has angular momentum 4; after a meson of orbital 
momentum / is emitted the total angular momentum of the 
system can only take the two values 1+4, and this can only be 
equal to 4 for either /=0 or l=1, That is, this excludes emission 
into other than s of p states. The final decision rests with parity. 
If the meson were a particle of + intrinsic parity it would be 
emitted in an s state, the opposite assumption leads to p-wave 
emission. — 
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we shall carry out the calculation in this limit. Now 
Va, Eq. (1.11), represents the matrix element for the 
absorption of a meson from a state (1.3) and according 
to the general rules of transformation theory, in order 
to calculate the corresponding matrix element for a 
state vpım, Eq. (1.15), we must multiply (1.11) by the 
scalar product 


(apn) = fD aan 


and sum over k. The integral (1.17) can be easily 
carried out in polar coordinates by means of the well- 
known expansion of a plane wave in spherical har- 
monics; for the reason just mentioned, the integration 
over r= |x| must extend from 0 to infinity. One easily 
finds 


(t)Upim) = 2NN'(— i)! Y tm (Kk) (m/k)’ô (p= k). (1.18) 


After multiplying (1.11) by (1.18) we carry out the 
sum over & in the limit of infinite volume, that is by 
means of (1.4), whereupon NV cancels out of the results, 
as of course it should. The absorption operator Vpim 
(which replaces Vx) is then seen to depend on the integral 
of (o-k)Yim(k) over the direction of k, which is zero 
if 11. 

Having to do with J=1 only we may simplify the 
notation by dropping } from the indices of vpım alto- 
gether. Furthermore, we may choose as basic spherical 
harmonics Ytm(k) simply the components of k 


Vim(k)—>(3/4r)'ek, (i=1,2,3). (1.19) 


The function v42m may be designated by vri and the 
absorption operator by Vra (where k now is the magni- 
tude of the momentum and the indices 7 and à both run 
from 1 to 3. Finally one finds 


Vin=N/37(f/u)kv(k) (2w) oiTa. (1.20) 


In this way one has the advantage that only p-states 
for the mesons appear in the calculation; owing to the 
similarity of ø and r matrices one sees furthermore that 
the isotopic index à and the angular momentum index 
i play completely similar and interchangeable roles in 
the calculation." 


2. THE “REAL NUCLEON” STATES 


All physical problems we want to discuss next can 
be reduced to a study of the eigenstates of the Hamil- 
tonian (1.8). For example, all the information on meson 
scattering is contained in certain eigenstates described 
in Sec. 5. 

We shall be concerned, in this section, with the lowest 
eigenstate of the Hamiltonian (1.8), which, if the theory 
has anything to do with reality, must represent a free 
nucleon. To be sure, this nucleon will be surrounded by 


u As remarked by F. H. Harlow and B. A. Jacobsohn, Phys 
Rey. 93, 333 (1954). This immediately leads to identities such ás 
531= 613. See Sec. 5. 
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a virtual meson cloud, but in the state in question 
pices no a (incident or outgoing) mesons. 
I aim of studying this eigenstate might 
be, of course, the calculation of the level shift due to 
the perturbation (1.10), i.e., the so-called “nucleon 
self-energy,” which in a cut-off theory is a finite and 
well-defined quantity. So far nobody has dared suggest 
however, that the self-energy effect due to (1.10) is 
connected o any observable mass or mass difference. 
In the work of Chew, as well as in the more sophisti- 
cated relativistic field theories, one manipulates the 
formulas in such a way that the self-energy (and other 
quantities, which are similarly regarded as unobsery- 
able) do not appear in the physically significant results. 
This gives rise to the following paradoxical situation: 
While the self-energy is an unobservable quantity which 
one does not have to evaluate, it is nevertheless neces- 
sary to go into a detailed formal discussion of it, as a 
preliminary to its elimination from the results. 

This detailed discussion will be carried out presently, 
and will also give us an opportunity to prove certain 
relations, which are useful in the elimination of other 
unobservable quantities from the theory. 

As explained in the Appendix, the Hamiltonian (1.8) 


- allows four “good” quantum numbers J, J., T, and T3. 


A state with J=} and T=3 is fourfold degenerate, 
corresponding to J-= +4 and T;=+}. The nucleon is, 
of course, assumed to be in such a state; this leads to 
the assignment of total angular momentum J=}, and 
total isotopic spin T=} to the lowest level of the 
Hamiltonian. | 

One assumes that as f is decreased continuously from 
its actual value to f=0, the aforementioned fourfold 
degenerate level remains the lowest level of the system. 
That is why the lowest level of Ho, i.e., the “bare 
nucleon” is also assumed to have the same fourfold 
degeneracy. We shall use a four-valued index «œ as an 
abbreviation for a pair of values (J., 73) to designate 
the four possible nucleon states, and we can obviously 
use the same index for the corresponding bare nucleon 
states. 

As we already see on this example we shall have to 
do with two types of states: the eigenstates ¢1, $2, ---, 
$n, ++: of the unperturbed Hamiltonian Ho, and the 
eigenstates of the exact Hamiltonian H, for which 
we shall employ Dirac bras and kets. In particular 
we shall use the notation ġe with a Greek index (a=1, 
---, 4) for the four bare-nucleon states, and |a> for 
the corresponding real nucleon states. 

Our somewhat hybrid notation has the advantage 
of making the tw6 kinds of states more clearly dis- 
tinguishable.!? Matrix elements of the type <B|---|a> 
will be called “nucleon expectation values”; they play 
a similar role as “vacuum-expectation values” do in 
other work. The corresponding bare-nucleon expecta- 


12 This notation becomes rather clumsy if one has to consider 


‘mixed scalar products between the two types of states, as in Eq. 


(2.17). However, we shall usually avoid this. 
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tion values will be written: øg” -Qa [see for example 
Eq. (2.3) ]. 

Starting from ġa one can construct all other unper- 
turbed states ¢, containing one and only one nucleon, 
by applying to ¢, one, two, -meson creation oper- 
ators, thus obtaining the complete orthogonal set 


Qa (no mesons), 
Pia= ar ba 


k z 
Prqa= ar lq ba 


(one meson) 
(two mesons). 


(2.1) 


These states satisfy the unperturbed Schrödinger equa- 
tions 


Hopa=0, (2.2a) 
Higia= Obra, (2.2b) 
eee ewe etc. 


and are normalized to unity.” 

We shall designate with ICa the matrix element of 
the perturbation operator between any two states Øn 
and ġm of the sequence (2.2) 


Ham=n* ID. (2.3) 


We notice in particular that (2.3) is zero if both ọn 
and ¢,, are bare-nucleon states 


5ag=0. (2.3") 


We are now in a position to discuss the application of 
perturbation methods to the complete Schrödinger 
equation 


(Ho+5)|>=E|> (2.4) 


where in particular we designate with Æ, the eigenvalue 
for a state |a>. Comparing with (2.2a) we see that F, 
is at the same time the level shift, i.e., the nucleon 
self-energy. The state |a> can be expanded in the 
orthogonal system (2.1), the coefficients being the so- 
called zero-meson, one-meson, --- amplitudes. The 
zero-meson (or bare-nucleon) part of the expansion 
must have the same J, and 73, i.e., the samfe Greek 
index, as |a>, hence we can write 


la> = Copa H} nonOn (2.5) 


where >>’ (here and henceforward as well) is used to 
denote a summation excluding the bare-nucleon states. 
One can, as usual, rewrite (2.4) for the state (2.5) as an 
infinite system of linear equations in the amplitudes 

“*, Ca ++. The first of these equations is for example 


Eco= D s Ieren (2.4a) 


where E is now, of course, just E,. The remaining set 
of equations, which the reader can easily write down 
for himself, can be divided by co and solved for the 


13 This requires the addition of numerical factors on the right- 


Co, * 


hand side of (2.1) when some of the indices $, q --+ are equal. 
E.g. a vom 
xta= (1/V2)qy k a, **-; ete. 7 b 
. a ° ye 
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ratios Cn/Co by an obvious perturbation expansion," 
yielding i 


€n/Co=Rna/ (E— En) ar E ae AA En) (E-— En) 


HE'I nmdC mpIC la/ 
(E-E) (E—En)(E—E)+ +++. (2.6) 


Inserting this back into (2.4a), we find for the eigen- 
value Æ the transcendent equation 


KanICna 
BLOE n 
E-E, 


HanICumICma 


paene (227) 
(E—E,)(E—E») 


SL em 


[we may notice, incidentally, that in our case the terms 
of odd order in (2.7) drop out]. It is to be understood 
that Æ, is, in particular, that root of (2.7) which is of 
order of f? when f—0. We shall see in Sec. 3 that >> (E) 
has the same meaning here as in the literature.° 

For the sake of later applications, it is desirable to 
rewrite (2.5), (2.6), and (2.7) in a more compact nota- 
tion. Let us introduce a projection operator A defined by 


AX= e' nbn(On X), (2.8) 


where A projects an arbitrary state vector x onto the 
subspace orthogonal to the bare-nucleon states ġa. We 
may also say that the matrix representing A in the 
system (2.1) is diagonal, with diagonal elements Aaa=0, 
Ann= 1 (na). The restriction expressed by the apex in 
Eqs. (2.6) and (2.7) can then be replaced by the inser- 
tion of factors Ann between matrix elements of JC. 
Furthermore, one can write 


|a>=cobatAla> 
and replace (2.6) by 
Ala> = col (E— Ho) 1+ (E— Hy) 1A5¢(E— Ho) 
+ (B— Ho) A3C (E— Hp)“ ASC (E— Hy) + «+ + JASC ba 
which is an expansion of 
Ala> =co(E— Ho— ASC)“ Aha (2.6) 


where one can further save the last factor A since owing 
to (2.3’), 


AKpa= Kha. (2.9) 
Similarly, one can abbreviate (2.7) as follows: 
E (Z)=¢.*5 (E— HoA) Ipa (2.7’) 


14 This method is known as the Brillouin-Wigner form of per- 
turbation theory, and is characterized by the fact that the eigen- 
value itself is not expanded in powers of f. The result (2.7) may 
Jook superficially like the customary Schrédinger expansion, but 
of course it is not since it gives Æ only implicitly. The explicit 
expansion of Æ in powers of f can be obtained from the implicit 
formula (2.7) by known methods; a little thought will show that 
the structure of the higher-order terms in the explicit expansion 
is far less simple than that in Eq. (2.7). This is why the latter is 
much more suitable for the discussion of the following section. 
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which at first sight may not seem a real quantity 
because AJC is non-Hermitean. Notice, however, that 


HyA=AHo XSA. (2.10) 


It is then casy to see that the value of (2.7) does not 
change if ASC is replaced by the Hermitean A3CA. [To 
prove this, make the replacement in the expanded form, 
and use (2.9). ] 

Let us now assume that |a> is normalized, so that 


1= <ala> = | co {1+ pa" IC (E—Ho— AI) Iha} 


where use has been made of (2.5’), (2.6’), and (2.9), 
and where, of course, Æ must be set equal to Æ. 

Comparing with (2.7’) we see that  |co|* is equal 
to Zs, the “renormalization constant” which (in analogy 
with the procedure described by Dyson, Ward et al. for 
the relativistic case) is defined in Chew’s paper by 


Zr!=1— (d2 /dE)E=Es. (2.11) 


Notice that |co|? represents the probability of finding 
no mesons around the nucleus; in the following, we 
shall dispose of the arbitrary phase factor in the state 
(2.5’) by choosing co real and positive so that 


and: 


(2.12) 


With Eqs. (2.11) and (2.12), we have proved a rela- 
tionship which plays an important role in the elimina- 
tion of unobservable quantities from the theory. 

We wish to conclude this section with another rela- 
tionship, which plays a similar role. Consider the 
matrix element 


oo=Z2}. 


Mea= <B|om|e> (2.13) 


which, as we shall see, plays an important role in the 
theory of meson scattering (and also in photomeson 
production). Apart from trivial factors it is the matrix 
element of the meson absorption, or emission, operators 
(1.11) and (1.20); it is, however, the matrix element 
between real nucleon states, and not the ordinary matrix 
element 

Mg.°= $6" 0 Taba, (2.14) 


which plays the central role in a more primitive ap- 
proach. Since, however, c; and 7, obey simple commuta- 
tion rules with the components of J and T, and since 
|a> and ġa have the same quantum numbers J, Jz, T, 
T; it is possible to prove by the customary methods" 
that the ratio of (2.13) to (2.14) is a proportionality 
constant independent of œ, 8, and à. The fact that this 
ratio is #1 is, of course, an effect of the meson cloud 
surrounding the nucleon and is described graphically 
as a change in the strength of coupling. Thus one writes 


o JMisa = frMga? (2.15) 


16 T, D. Lee, Phys. Rev. 95, 1329 (1954). This proves inciden- 
tally that 0<Z.<1. For the case of relativistic theories see H. 
Lehmann, Nuovo cimento 11, 342 (1954); M. Gell-Mann and 
F. E. Low, Phys. Rev. 95, 1300 (1954). i 

16 E.g., Condon and Shortley, The Theory of Atomic Specira, 
(Cambridge University Press, New York, 1935), Chap. II. 


| 
i 
Í 
] 
| 
| 
| 
| 


‘, 


Digttized by Arya Samaj Foundation Chennai and eGangotri 


MESON 


where f, is a “renormalized coupling constant,” which 
for a purposes may be defined by Eq. (2.15). Since 
the definition usually given is another one 

s , 
fr=fZoZ 


where Zy is another renormalization constant, let us 
briefly mention the relationship between the two defi- 
i 17 Fy ati ', g 

tions." Evaluating (2.14) by means of (2.5') and (2.12), 
we see that 


(2.16) 


Mga= ZMH co <B [Aonga 
Holp onA la> -+ <B| AoA la>. (2.17) 


Of the four terms on the right, the second and third 
are zero (A commutes with o7, and Aga=0), the fourth 
may be transformed by means of (2.6') and (2.9). If 
now we define the renormalization constant Z, by 
means of 


Zr Mgt = Mga? 
+Hoe (2, — Ho— A) on (E, — Ho— AIC) !ICha, (2.18) 


we see that (2.17) is equivalent to (2.15) and (2.16). 
Now it is easy to see, by a discussion of the kind de- 
veloped in the next section, that (2.18) is only a short- 
hand for the definition of Z, ordinarily given. We shall 
not-go into this, however, in any more detail here. 


3. GRAPHS 


Perturbation expansions like Eq. (2.7) can be dis- 
cussed most easily in terms of “graphs.” The graphs 
which occur in*the theory we are discussing are simpler 
and less general than those occurring in the full-fledged 
relativistic theories. Just for this reason, the present 
theory is a very good example with which to begin the 
study of graphs, for the reader who is unfamiliar with 
them (other readers may skip this section altogether). 

The idea is to represent each nucleon by a straight 
horizontal line, and each meson by a dashed line. A 
dashed line terminating on a nucleon line quite naturally 
represents the absorption or emission of a meson. If one 
wishes, one can append to a meson line an index k, 
(or p, +, etc.) specifying the momentum k and charge 
state \ of the meson. 

Let us now see how this works in practice. Consider 
the first term on the right-hand side of (2.7). Omitting 
the summation sign, the quantity to be evaluated is 


of the type 
Ianna) (E En); (3.1) 


where a (which in our case equals y) is a bare-nucleon 
state, and the intermediate state 7 is clearly of the type 
xg in the sequence (2.1). One associates conventionally 
5Cna to the transition from @ 7 7, Wan to the inverse 
transition; the précess to be described is clearly the 


A 5 onthe 
11T. D. Lee (reference 15). Equivalent relationships in 
relativistic theories have been known for some time (J. Schwinger, 


7 unpublished); see also H. Umezawa and S. Kamefuchi, Progr. 


Theoret. Phys. Japan 6, 543 (1951); G. Kallen, Helv. Phys. Acta. 
25, 417 (1952) and the papers prone quoted. 
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Fic. 1. Simplest self-energy graph representing the emission and 
reabsorption of a meson in a state k. 


emission and reabsorption of a virtual meson, the corre- 
sponding graph is Fig. 1. For completeness we have 
also added indices « and y to the bare-nucleon “stumps” 
at the ends; the reader is asked to notice that æ is 
written to the right and y to the left, just as they 
appear in (3,1). In our case, of course, this does not 
matter, since a=, but in the ‘ransition processes to be 
considered later the end states are different. It is then 
somewhat more convenient to have the indices appear 
in the same order on the graph as in the formula. 
Since æ will then be the initial, y the final state, one 
also says that, according to our convention, “time runs 
from right to left” on the graph; events appearing at 
the right are “earlier,” those at the left are “later.” 
Now in the actual calculation of (3.1) one will 
notice that for a given state # (i.e., given k8) the only 
terms of 3, Eq. (1.10), contributing to Kaa and Feya 
are, respectively, a,*V;.* and aV. Since moreover the 
matrix elements of a;* and a, are equal to \/1, we have 
Rra= pi Vi ba= (Vi) fey 
KR, n=, Vids T (Vx) a. (3.2) 


Here an important simplification occurs, owing to the 
fact that the energy of the intermediate state Ea= wy 
is independent of 8; the same simplification will be 
seen to occur in the general case. Namely, in the summa- 
tion n=} :} 3, the first step can be carried out 
symbolically 


Poal V e)ra V i Jga= (ViVi ra * 


In so doing, V; and V;,* are regarded, of course, as 
4X4 matrices (operating on the nucleon spin and iso- 
topic spin variables). One then has 


ViVi = dr N? (f/u) | v (k) | z (2w)! ™ (k $ o)*, (3.4) 
where one further has m?=1, (k-o)?=A*. Remember- 
ing (1.4), ; 


Eene =o f de, 


(3.3) 


so that finally we have 


TanTCna RT dike 
Ea OH) f kiai Be 5) 


E eax (E— w) 
* 
We have presented all tHe steps in this pedantic detail, 
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Fic. 2. An “improper” self-energy graph which does not occur in 
the evaluation of Eq. (2.7). 


so that we can proceed more speedily in the subsequent 
cases. 

Let us now go over to the fourth-order term in the 
expansion of X} (Æ) (the third-order term is zero). 
Various possibilities now arise, and the advantage of 
graphs in quickly visualizing all possibilities will appear. 
We have a product SCa:ICimiCmndIC na with three inter- 
mediate states /, m, and n. Beginning from the right, 
we see that Cra can lead only from the bare-nucleon 
state a to a one-meson state, say, ¢xa. The subsequent 
transition, however, can be either the reabsorption of 
the meson already emitted or the emission of a new 
meson; in the first alternative the stale m would be a 
bare-nucleon state, and this is forbidden by the apex. Tf it 
were not so, the process could go on with the emission 
of a new meson (5Cim) and reabsorption of the same 
(Car); the graph, which as we said is forbidden by the 
apex symbol would be that of Fig. 2. The other alter- 
native means that the state “in the middle,” i.e., m, is 
a two-meson state, dz, say. From that one can proceed 
in two ways. Clearly the two subsequent transitions 
must both be absorptions (since we must return to a 
bare nucleon at the end), but we can absorb meson k 
with 3Cim, and p with ICar or vice versa. This gives the 
two graphs, Fig. 3(a) and Fig. 3(b). We could write 
near the nucleon lines the symbols for the states (8, y, ô) 
of the nucleon in the states z, m, 7 but this is hardly 
necessary [it is done in Fig. 3(b) as an example ], since 
the summation over 8, y, and 6 is carried out implicitly 
as before, that is, the symbols 8, y, and ô do not even 
appear in the calculation. Let us also write down the 
unpertunbed energies of the intermediate states for the 
two sequences above: 


Va X 
fii: eo 
JL a ee 
8 y B 
a (b) 


Fic. 3. The two possible fourth-order graphs for Eq. (2.7). 
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[Fig. 3(a)] 


En= wk, m= OKO py E=wp, 


Crig. 3(b)] 


En= Ok, 


(3.6a) 


E= Wk. 


Em=wr twp, (3.6b) 


These values can, of course, be read off the graphs very 
quickly. An intermediate state is the state of affairs 
between two successive transitions; if we draw a ver- 
tical line intersecting the nucleon ‘line between two 
meson endpoints, the meson lines it intersects corre- 
spond to the mesons present in the field at that par- 
ticular virtual stage of the transition; in Fig. 3(b), for 
example, a vertical line between the last two meson 
endpoints gives Eı= wr. ; 

We are now in a position to write down immediately 
the fourth-order term of (23) as the sum of two terms 
corresponding, respectively, to Figs. 3(a) and 3(b): 


VeVi Vi" 
E-o) (E-o) (E01) 
VeV Vn Ve 
KE E o eE 


(3.7) 


A \ 
/ \ 
/ \ 


Frc. 4. General self-energy graphs. 


We shall not carry on this calculation any further now. 
We shall instead make some remarks on the structure 
of the general term in the expansion (2.7). 

In the term of order 2 there are 2m factors JCim, 
half of which must be used to create virtual mesons, 
and the other half to destroy them. The number of 
choices, however, becomes very large if n is large. The 
reader may try as an example to draw all the graphs 
for 2n=6. The general character of these graphs may 
be symbolized as in Fig. 4; the graph begins with an 
emission act at the right and closes with an absorption 
act at the left; in the region between the wavy lines, 
anything may happen, with one exception: there must 
not be any intermediate state with no virtual mesons. 
In other words, it must not be possible to separate the 
graph in the way indicated in Fig. 5. To be more 


specific, graphs such as this do not occur in formula 


(2.7), although they may be considered in other con- 
texts. They were called by Dyson “improper” self- 
energy graphs. A graph like Fig. 4, which cannot be 
split as in Fig. 5, is called a proper self-energy graph. 
The expression (2.7) is a sum over proper self-energy 
graphs only. 


nS 


JA y 
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Once a graph is plotted, the 
the sum can be written down im 
it is desirable to develop 


corresponding term in 
mediately. To this end 
; ; good bookkeeping habits. As 
Chew suggests, it is best in this to adhere strictly to 
Feynman. This has the advantage that the habits 
acquired here can be transferred to relativistic theories 
without effort. In the bookkeeping of “classical” per- 
turbation theory it was customary to say that in the 
virtual intermediate states “energy is not conserved,” 
By that one meant that the energies E, (that is, the 
unperturbed energy values) of the intermediate states 
can differ by arbitrarily large amounts from the true 
energy Æ of the stationary state under consideration. 
The energy denominators E—E, measure just this 
difference. 

In the Feynman bookkeeping, instead, one says that 
energy is conserved at every emission or absorption 
act, and the energy of the nucleon changes accordingly. 
Thus in a graph like Figs. 3 and 4 the nucleon enters at 
the right with energy E, and emits a meson of energ 
wr; it then has an energy E—w,. If now it emits a new 
meson wp, it will have an energy E—w,—w, and so on. 
Clearly Æn (the energy of the virtual mesons present 
in the virtual state ) represents the sum total of the 
energy of the mesons previously emitted minus those 


/ \ 
/ ‘N / \ 


....... .ssssos 


. 
Fic. 5. Improper S.-E. graph: a bare-nucleon state occurs 
in the middle. 


reabsorbed by the nucleon. Hence, the energy denomi- 
nators are now simply the “energy of the nucleon,” as 
defined previously, in the subsequent segments of the 
nucleon line, between two successive elementary acts. 
The most natural thing is then to write operators Vz 
and V;* and the energy denominators in the same order 
as they appear on the graph, thus, for Fig. 6, 


1 1 1 
Ve 


E—wp—Wq 


V, 


E—w, 
1 
Sie 


? 
E—wr—wop E—wk 


Vi. 


These conventions apply equally well to the more gen- 
eral graphs to be encountered later, in the discussion 
of scattering processes. In that case, of course, the 
nucleon will not be assumed to enter the graph with 
an energy E. Since Æ is used todenote the total energy, 
the initial energy of the nucleon will’be reckoned as Æ 
minus the energy of the incident meson or mesons. 
If then one proceeds as we have said, assuming energy 


conservation at each step, it is easy to see again that 


= “cc 32 Ti one 
energy denominators and nucleon energy” are 


and the same thing. a 
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Fic. 6, A possible sixth-order graph, 


To conclude this discussion, we go back to Fig. 3(a), 
Fig. 3(b), and Eq. (3.7) and notice that a refinement is 
necessary. Consider that in summing over k and p, 
the case will also occur that k= p. But in this case it 
has no meaning to draw two graphs; what happens is 
that after the second emission act we have two mesons 
in state k, and since they are indistinguishable we 
cannot ask the question whether it is the first-emitted 
or the second-emitted meson which is absorbed first. 
Thus, we should not have two terms in Eq. (3.7) but 
only one. We have, however, also made a compensating 
mistake! When k=, the emission amplitude for the 
second meson is not V,* but V2V,*; similarly the sub- 
sequent reabsorption act contains another V2. The 
omission of two factors v2 just compensates for the 
mistake of writing two terms instead of one. This is 
not accidental. The reader is invited to think through 
for himself the more complicated case where the occupa- 
tion number of a given meson state goes up to a maxi- 
mum value 7. There occur then in the correct expression 
factors V2, V3, --- 4/n each an even number of times, 
and these just make up for the reduction in the number 
of truly distinct processes, as compared to the fictitious 
multiple choices indicated by the graph method. 

One might point out, that this remark is really un- 
necessary, for the following reason. What we want to 
evaluate is, for example, the sum of (3.7) over & and p. 
Now owing to expression (1.4), the part of the sum 
with k= is negligibly small (when the quantization 
volume is large) compared to the rest of the sum. 

We have preferred not to rely on this argument, not 
merely because it is somewhat neater to show that the 
graph method of calculation is valid irrespective of the 
size of the quantization volume, but also because there 
are cases, not encountered in this section, where the 
“volume” argument fails. 


4. MESON SCATTERING I. BORN APPROXIMATION 


With the Hamiltonian (1.10), mesor’s interact with 
a nucleon only when in a p-state. Thus only p-wave 
scattering occurs. Obviously this cannot be the whole 
story in reality, but it is a fact that p-wave scattering 
is the dominant feature at low energies (say below 
250 Mev); s-wave scattering is surprisingly weak, while, 
not so surprisingly, scattering of waves with />2 þe- 
comes important only at higher energies." 

It is Chew’s great merit, that he showed that not 


merely the qualitative fact cf the prédominance of 4 
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Fic. 7. (a) Second-order graph for meson scattering with uncrossed 
lines. (b) The same with crossed lines. 


p-wave scattering, but also the main detailed features 
of it are satisfactorily explained by this theory. This 
fact had been previously obscured by an excessive 
reliance on the predictions of lowest-order perturbation 
calculations. 

In fact, although, as we shall see, the value of the 
coupling constant f is fairly small, higher-order effects 
play an important role in meson scattering. Neverthe- 
less, we shall discuss in this section the lowest-order 
calculation, purely as an occasion for certain remarks 
to be used later. 

We wish, then, to calculate by means of perturbation 
theory the transition amplitude from an initial state 
a with a nucleon in a state ġa and a meson in a state 
p(=pr), to a final state b with nucleon in a state $¢ 


and a meson in a state q(=qu). To achieve this we - 


must absorb meson p and emit meson q; in addition 
any number of virtual mesons may be emitted and 
reabsorbed. To lowest order, however, there are no 
virtual mesons in the process, and the matrix element 
for the transition has the familiar second-order form 


HondCna 
"E-E, 


M=> (4.1) 


where Æ is the energy, which may be identified with 
either the energy of the initial state Æ, or with Es 
(since Ea= Er, necessarily). Finally we equate Ea with 
the energy wp of the incident meson, since the nucleon 
energy has been set equal to zero (see, however, Sec. 5). 
Now the transition may proceed in two ways; namely, 
3C,,, may be used to absorb meson p, and ICsn to emit 
meson q, see Fig. 7(a), or vice versa Fig. 7(b). In the 
first case E,=0 and E—En=wp in the second case 
En=wp tw Since wp=w, and p=g, in magnitude, we 
write-simply wand p inthe following. Then if we pro- 


a 
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aN A ceed as we did in the previous section, we see that 
SS ve M=MotMz, 
aN 5 Ma=V shoe Vy =Xrun (q: 0)(P:0); 
N ie My=Vp(—ogV f= —Xta7,(p-9)(q-e), (4.2) 
x A X= 2r (f/u) o(p) |a. 


The differential cross section, of course, is, given by 
|M |? multiplied by the number of final states per unit 
energy interval and divided by the velocity p/w of 
the incident meson. If dQ is the solid angle, dqg= @dqdQ, 
and dividing by the energy interval dw and using 
dw/dq=q/w, the number of states becomes dq/(27)*dw 
= (2r)-*qw- dQ. Hence, the differential cross section is 


do/dQ= (2r) |M |2. S) 


We have omitted the “volume” factor V throughout, 
but, if we had not, it would have cancelled out of the 
final result. 

Let us now go into the calculation of M in some more 
detail. We may consider separately the dependence of 
M on the variables specifying the direction of motion 
of the meson and the ordinary spin of the nucleon, and 
the dependence on the “charge states” of meson and 
nucleon. As regards the latter we have six possible 
states (A=1, 2, 3 together with neutron or proton) so 
that M is a 6 by 6 matrix. In Eq. (4.2) this part of the 
dependence is contained in the operators Ty) and T)7p. 
Instead of \=1, 2, 3 we may use the physically signifi- 
cant states w=, 79; as explained in the Appendix this 
may be achieved without altering the formalism by 
using T+} 7T-, and r instead of 11, 72, T3. This only 
requires the minor modification that since ry. and 7- 
are not Hermitean, it is necessary to pay more attention 
to the “star” in an emission operators V¿*. In (4.2) 
this means replacing 7, by 7,* when using T+. 

One gets easily for the relevant transitions the matrix 
elements specified in Table I. As regards the remaining 
variables, we can use the well-known formula 


(A-o)(B-o)=A-B+io-AXB. (4.4) 
Let us write 
q:p=qpcosd; (qp)"pXq=sind-n (4.5) 
then 
Ma=X p raTa (cos0— i sinbn: o), (4.6) 


My=X p'r7,(cosd-+7 sinbn: o). 


TABLE I. 
* 
Trana QOS" ea ee 
pont p 7 (0) 2 
mtn>rtn " 2 5 0 
ain p v2 —v2 
mp—n? p 1 1 
monn 1 1 
mnr p -=V v2 
a pn p 2 0 
rn Nn 0 2 
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Thus as an example we find, from T. 


x able I the f. W- 
ing values for M; ea e 


Reaction Value of M/XpP* 

T por p 2 (cos6—i sinfn- ø) 

T pomn —2v2 cos6 (4.7) 
3 Thorp —2 (cos0+i sinĝn- o) 


Thus for the exchange reaction | M|?~8 cos?@ with an 
average of 8/3 (over the direction of q). The elastic 
cross sections contain a so-called “spin-flip” term n-ø. 
This expression refers to the fact that if one uses the 
incident direction as axis of quantization, then n-o has 

matrix elements only between states of opposite spin. 
On the other hand, one often uses n (the normal to 
the scattering plane) as axis of quantization. Then the 
nucleon spin does not change, but the scattering ampli- 
| tude depends on the spin direction; for example, in the 
first reaction (4.7) the amplitude is 2e~® for spin down. 
o > As it happens, in this,approximation the difference 
between the two is only a phase factor, and |M]? does 

not depend on the spin direction, in fact 


| |M|?=4; (mpr p) or (rtp—n*s). 


In this approximation, therefore, we find that the 
elastic ¢* and o7 cross section on hydrogen are isotropic 
and equal to one another. Furthermore, the ratio of 
elastic to exchange cross section for m~ in hydrogen is 
3/2. These results are in complete disagreement with 
experiment; as’ we shall see, this is only because of the 
inadequacy of the Born approximation. 
| Nevertheless, let us pursue the analysis further. The 
| raw experimental data appear, of course, as data for 
| the reactions of Table I, but they are usually analyzed 

in terms of other reactions, in which the initial and 
final states have a well defined total isotopic spin 
quantum number T (see Appendix B). Only the first 
and last reactions in Table I are of this type; states 
such as m+n, xp, etc., are not eigenstates of T?. The 
| reason for using eigenstates of T? is that the results 
i appear in a more condensed form. Let us say that 

initial and final states are characterized by quantum 
E numbers T, Ts and T’, Ts’, respectively (T=} or 3; 
| T:=T, --:, —T). Then ina charge-independent theory 
the results for the matrix M must be of the form’® 


(I'T3'|M| TT3)=M (1) orr-orsrs'- (4.8) 


| That is, as a result of reshuffling the basic states, the 
| 6X6 matrix is now diagonal, four of the elements o pe 
| diagonal being equal to M(8) and two equal to 4 (2). 

Let us now have another look at our notation. We 
have previously regarded 1° and 7,7* as 6X6 


itler’ -sis was revived by the work of K. M. Watson [Phys. 

/ Rev 85 ee and its importance became apparent when 
a the first detailed experimental data became available; see Proc. 
Rochester Conference, 1952, contributions by E. Fermi and K. A. 


ee Brueckner. C) 


| 18 W, Heitler, Proc. Roy. Irish Acad. 51, 33 (1946). Interest in 
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matrices with the matrix elements specified in Table I. 
We may also regard them as 3X3 matrices Q and 0! 
with matrix elements 


(OP = Ta Th ; Qu’ = TH py (4.9) 


which are not numbers but 2X2 submatrices. This 
simply takes account of the fact that in our notation 
the meson index X(u) for the initial (final) state is 
explicit, while the nucleon indices are implicit. Thus 
reverting for simplicity toA=1, 2, 3 instead of +, 0, —, 
and remembering that r=1, 772=173, etc., we see 
that the matrix Q, for example, has the form 


NA 1 2 3 
\ 
> 
1 1 irs —irn 
2 —ir 1 iTi 
3 iT2 —iri 1 


where the 1’s on the diagonal are, of course, 2X2 unit 
matrices. Remembering the form of the fa matrices 
[Eq. (A22)] for the meson spin we see that 

Q =1—(t-<) 

Q!=2—O=1-+ (t-z) (4.10) 


where (t-2)=7i+lore+ts7s. We now see that Q and 
Q’ are of the form (4.8). Namely, since 


(4.11) 


we see that t-< is diagonal together with T? and has 
the eigenvalues 


1 3/2 
t=] if r=| : (4.12) 


2 1/2 


Thus we may replace Table I by the simpler Table IT.” 

Now to the o dependence of the matrix element. 
Just as a great simplification was achieved by remem- 
bering that the total isotopic spin vector T is conserved 


Taste IL. 
> ooerator ryt) TATu" 
State i 
T=} 3 -1 
T=} Pa) 


0 A fast way of deriving these eigenvalues (see Dyson et aJ.¥) 
is to notice that the matrix Ọ° has the elements 


3 
(Ea = 2 Ques Oyy = Tut eTTr = 3ra = 30a. 


Hence @=3Q, showing that the eigenvalues of Q are 3 and 0. 
It is easy to recognize that the eigenvalue 0 belongs to T=}, 
because the operator Q appears in connection with the graph in 
Fig. 7(a) and it is clear that for the state +"p, which 8 a T=} 
state, the graph must contribute 0, since a protan cannot absorb 
ztl PA y 


- 


` 
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in the collision, similarly we can benefit from the fact 
that the total angular momentum J is conserved. 

The latter vector is the resultant of the meson orbital 
momentum and of the nucleon spin. As pointed out in 
an earlier section, mesons are only scattered, in the 
present theory, when their orbital quantum number / 
is unity. This is, of course, immediately apparent in 
the fact that the matrix element (4.2) is a bilinear 
function of the momenta p and g. In order to diagonalize 
J it is desirable to use meson states of definite orbital 
momentum rather than plane waves. This can be 
achieved very easily either by transforming (4.2) by 
means of (1.18), or, what is the same thing, replacing 
Vp and V, with the modified V’s of Eq. (1.20). All 
that amounts to (apart from a factor) is writing (4.2) 
as a bilinear function of p and q and picking the coefh- 
cient Bj: of q;pi. 

One finds then that B; is a combination of two 
operators 
(4.13) 


ojo; and a40;, 


which obviously have the same properties as TuTh and 
m7. Owing to the symmetry between isotopic spin 
and angular momentum indices (see Sec. 1 at the end), 
one can replace in Eqs. (4.8) to (4.12) and in Table II 
T, Ts, t, T, Tm T by J, Jz, l, o, oj, and o;, respectively. 

The matrix element Eq. (4.2), is in this notation, 
and apart from a factor, of the form 


(4.14) 


The eigenvalues of this operator may be easily calcu- 
lated, according to what we have just said, from Table 
II. Following accepted habit, we designate the possible 
states of the system with the symbol (27)(2J), i.e. 
for example 31 means T=$, J=}. Then the eigenvalues 
of (4.14) are 


TaTAO jTi TAT pO iO j. 


state: 33 31 or 13 11 


eigenvalue: —4 2 8. (4.15) 


That is, in a scheme in which T and J are diagonal, the 
matrix B is also diagonal and its diagonal elements are 
given by (4.15) times a common positive factor. 

The result (4.15) is, of course, only a different pres- 
entation of Table I and Egs. (4.6), and is, therefore, 
in equally bad disagreement with the facts. It points 
out, however, one qualitative feature which is in agree- 
ment. Namely (4.15) corresponds to “attraction” in the 
33 state and repulsion in the other two states.” Experi- 
ments indicate,.indeed a strong attractive effect in the 
33 state. They do not, however, indicate an even 
stronger repulsive effect in the 11 state; in fact (4.15) 
would be in much better agreement with the facts if 
the 2 and 8 were replaced by zero. 

Before we turn to a consideration of higher-order 
effects, let us finally rewrite (4.14) with all the factors 


» This language originates in a comparison with scattering due 
to a potential. In the first Born approximation an attractive 
(i.e. negative) pee gives a negative matrix element (or a 
positive phase shift). 
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in. The matrix element may be designated ‘symbolically 
with Bp(q) (which is a matrix in the four-dimensional 
vector space of the spin and charge states of the 
nucleon) or more explicitly with Bye (8). Remember 
also that p summarizes the magnitude |p| of the initial 
momentum, the index i(=1, 2,3) for the initial Y4; 
spherical harmonic and the initial charge state \; and 
similarly g summarizes |q], j, u. Using (1.20) with N” 
omitted, since it cancels out of the cross section when 
one uses (1.16) for the density of final states, one finds 


Bpl) =} (S/L) 0(p) |? (p/e)*Losoituta— cist 7]. (4.16) 


5. MESON SCATTERING II 


One believes (see later) that the value of the coupling 
constant f? is probably ~0.08, which at first sight may 
seem small enough to justify use of the Born approxi- 
mation. As in all such questions, however, one must 
be a little careful in estimating the order of magnitude 
of the successive terms of the Born expansion. In par- 
ticular, Chew has pointed out that “higher-order” 
terms corresponding to graphs such as those in Figs. 
8(a), 8(b), and 8(c) are enhanced owing to resonance 
effects (vanishing energy denominators) occurring when 
the energy of one of the virtual mesons is equal to that 
of the incident meson. In fact the graph of Fig. 8(a), 
as compared to the graphs of Fig. 7, gives an extra 
power of f? but also one vanishing denominator, the 
graph in Fig. 8(b) gives two extra powers of f°, but 


See LESA 
(b) 
N 
N 7 
SN ? Be 
N \/ WW / 
(c) 


Fics. 8. Graphs for scattering representing higher-order terms 
with vanishirg energy denominators. 
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also two vanishing denominators" the graph of Fig. 
8(c) gives three extra powers and three vanishing 
denominators, etc. Pursuing this idea further, Chew 
showed that scattering is strongly enhanced in the 
state T=3, J=4, but not in the other states. 

A satisfactory study of higher-order effects requires, 
however, as pointed out later by Chew, a systematic 
study of self-energy and “renormalization” effects. We 
begin, therefore, with a general formulation of the 
scattering problem? 

For the sake of comparison we must have at hand 
the main results of the ordinary formulation of scatter- 
ing theory. It will suffice to consider the scattering of 
a particle by a static potential V. The Hamiltonian 
operator is then H=Ho+V, where Hy is the kinetic- 
energy operator. Let a, ġa, --- be a complete ortho- 
normal set of eigenstates of Ho (plane waves) with 
energy Ea, Ey, --+. Let Ya, Wa, «++ be eigenstates of H 
with the same energy values; specifically in the study 
of the scattering problems one considers solutions Yat 
and ya~ defined by the*boundary condition at infinity 


outgoing 
tember} ie SK |waves only. (5.1) 
ingoing 
A solution Wa satisfies the inhomogeneous equation 
(E.— H) Wa— $a) = Voa (5.2) 
or alternatively 
s (E— Ho) (Pa— ġa) = Vya. (5.3) 


From these equations one obtains in a well-known 
manner?’ 
(5.4) 


Yat — ġa = (Ea+in— Hy) Vya (5.5) 


where iz is an infinitesimal positive imaginary quantity 
which is inserted to define the operators (Ea— H Ve and 
(Ea— Ho)! unambiguously and in agreement with the 
boundary condition (5.1). The most familiar proof of 
the outgoing wave nature of Yat—ġa Is that which 
consists in calculating explicitly the Green function 
which represents the operator (E.+in— Ho)" in con- 


Pa*— a= (Eakin—- H)> Voa 
and 


vanishing denominators 
21 The statements as to the number of vanis den t 
eats the assumption that the energy available is insufficient 


eee eee or enG in this familiar divertisement is, 
that otherwise we would have to refer the reader to more spe- 
ioiii form (5 d by Chew and Gold- 
“explicit? form (5.4) has been used by 
ties eee 87, 778 (1952) and Gell-Mann Se nase 
[Phy <. Rev. 91, 70 (1953). The “implicit” form (6.5) s = 
Tet with a different notation, since Born’s classical Sen a 
Physik 37, 863; 38,°803 (1926)]. The concise metiem Eos 
that of B. Lippmann and J. Schwinger [Phys. Rev. 12, 
09301 ’s functi tisfying the out- 
ct Green’s functions satisiying 
“A metod ito Cony integration along a poma E 
Tehich is essentially the în X ad ) ya a b 2 aaa d 
i “Di nsche Funktion d gle: : 
Fee E Die Stechen Math. Verein. 21, 309 (1912) 
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figuration space. It is important for us to recognize, 
however, that the in device is of completely general 
validity. For example, it is correct in Eq, (5.4), although 
it is not possible to evaluate the corresponding Green 
function explicitly. That this is so, follows most simply 
from time-dependent arguments, such as those em- 
ployed by Lippman and Schwinger and Gell-Mann and 
Goldberger.” To put it very briefly, these arguments 
boil down to this. The requirements to be satisfied by 
a scattering solution y,* can be stated also in the form 
of a condition on the time dependence of a wave packet 


Y= Laca Baty, (5.6) 


We assume that ¢, is a continuous function of the energy 
Ea, and is different from zero in a finite (and, if one 
wishes, very narrow) interval AZ. If ġa represents the 
incident part of y4*, then the wave packet 


pl) = E acae Faha (5.7 


should represent the incident part of the full wave 
packet (5.6). The latter is a solution of the complete 
time-dependent Schrödinger equation 4f/dt= Hy, while 
¢(t) is a solution of the corresponding free wave equa- 
tion. The requirement then, is, that at sufficiently 
remote times in the past the wave packet W(/) must 
reduce to $(t), 


¥Q)-¢)—0 if (3.8) 
This replaces (5.1) for y.* (the two requirements are 
obviously equivalent on physical grounds). Now it is 
equally easy to verify that (5.8) is satisfied by either 
(5.4) or (5.5). For example, expanding (5.4) in a com- 
plete set” of eigenstates Yn of the operator H we get 


Yat —¢a= L n(Ea— En tin) WnR nay (5.9) 


m, 


where 


Rra= (Wa,V ba). (5.10) 


Inserting into (5.6) 
VOQ-¢O=Linfal); ” Gd) 
fn(Q=DaR nala(ELa— Entin) te". (5.12) 


Now we may designate by A a set of variables which 
logether with E, defines the state a and write 


De= a 1 dE (5.13) 


It is then easy to evaluate the integral over E, in the 
limit =+ ©. The integra! is of the form 


I= JDE- Etine, (544) 


23 It is well known that the functions ¥,~ (or alternatively the 
functions Y~) constitute such a set, if the potential V has no 
bound states. Otherwise, one must add to the set Hat (Va) a 
complete set of bound states, For our present proof it is sufficient 
to assume that the set Y, exists. A 


: 
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where g(/) is a smoothly variable function of Æ which 
vanishes outside of a finite interval, It is well known 
(it can be shown by deformation of the path of integra- 
tion into a semicircle) that 


I>0 as i-o, (5.15) 


This then proves that (5.6) satisfies condition (5.8); 
the nature of the argument should make it clear that 
the in device is absolutely general. 

We may, however, push the calculation a little further 
and obtain from it another well-known result. First of 
all, let us remark that if we build a wave packet such 
as (5.6) but using solution ya, it will enjoy in a similar 
manner the property (5.8) but for >+ %. We also 
note that the same procedure which leads to (5.15) 
shows that 


[~—2mig(E,)e#"* as I-H o (5.16) 


in which we have omitted terms in i, since the limit 
n—0 was always understood. Then 


fnt) = — iZ AR nalae (5.17) 


where the states a are now “on the energy shell,” i.e., 
subject to E= Ea. We may notice that if Z, does not 
belong to the internal AZ in which ca7£0, fa) will 
vanish. In particular Æ», must belong to the cont- 
tinuum. 

We now specialize the set Yn by the assumption”? 
that it consists of the bound states (if any) and of the 
states Ya“ and calculate the right-hand side of (5.11) 
in the limit />-+. The contribution of the bound 
state vanishes as explained before. Hence 


E Wnfn(l)~ Doce e Eep (5.18) 


with 
Co = — mid aRwaCa= — 2771 að (Ly— Ea) Roala, 
Rea= (Ws, Vba). 
And now the decisive point: (5.18) is a wave packet 
of the type discussed before, hence for i+% it can 


be evaluated by replacing yy by its plane wave part 
¢». Hence (5.18) tends to 


Haclt)= Doce e t'pa. (5.20) 


The result of our calculation is that a wave process 
y(t) which begins (at ¿~— œ) with the free wave 
packet (5.7) will become after the scattering process is 
compieted (++ ©) 


V)=4()+4.-(!) (5.21) 


where ¢,- is the free wave packet described by Eqs. 
(5.19) and (5.20). Clearly and ¢s- contain all the 


(5.19) 


26 We shall prove in Appendix C, that the states |pat>, 
which are the field theoretic analog of our present ya* form an 
orthonormal system. The proof for the present case is even 
simpler and well known, see for example Gell-Mann and Gold- 
berger Furthermore, it is obvious that the states Yo~ satisfy a 
similar orthogonality relation as the states ya". 
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information about incident and scattered states that 
we may wish, and once the coefficients Ria (for y= Ka) 
are known, the calculation of the cross section is merely 
a matter of wave kinematics. We may save ourselves 
all further calculations, however, by pointing out that 
in the first Born approximation the kinematical part 
of the calculation must obviously be the same, the 
only difference being that in the expression for Ria the 
exact solution Ys~ is replaced by the zero-order solu- 
tion ¢p: i 

Rva— (pv, Vha). (5.22) 


Any cross section can therefore be obtained from the 
corresponding expression in the Born approximation by 
replacing the matrix element on the right of (5.22) by 
the exact expression (5.19). 

And now, back to meson-field theory! 

We must point out, first of all, that the standard 
discussion just presented cannot be applied, strictly 
speaking, to our problem. The whole discussion is based 
in fact on the assumption that the perturbation part 
V of the Hamiltonian becomes negligible when the collid- 
ing particles are far away from each other. Only then 
can we assume that the incident state is an eigenstate 
of Ho, and that the exact energy of the state is equal 
to the eigenstate of Ho for the incident state. 

As is well known, these assumptions fail when field 
theoretic interactions are involved. In particular, our 
perturbation operator J€ cannot be treated like V; even 
when the incident meson is far from the nucleon, JC 
cannot obviously be neglected. This weuld be tanta- 
mount to neglecting the difference between a “real”? 
and a “bare” nucleon. 

One known way to deal with this difficulty is to use 
nonstationary perturbation theory, assuming in addi- 
tion an “adiabatic switching-on” of the interaction 
constant f. For our present purpose, however, it is 
more instructive to rely on the stationary treatment, 
and we shall see that, for a theory having a relatively 
simple structure such as ours, this can be done quite 
simply. 

Various authors?” have shown how to take properly 
into account, in the stationary treatment, the correc- 
tion of the energy of the incident state due to self- 
energy effects. This is not, however, the only correction 
that is required in the standard discussion; we must 
also comedi the wave function describing the “ncident 
state.” 


21 J. Pirenne, Helv. Phys. Acta 21, 226 (1948); Phys. Rev. 86 
395 (1952); M. Gell-Mann and M. L. aan Phys. Rev. 91, 
70 (1953). The existence of such questions was, of course, recog- 
nized much earlier, see for example the literature quoted in W. 
Heitler, Theory of Radiation (Clarendon Press, Oxford, England), 
third edition. k ai i 

28 Again, this question is already discussed in the classical 
papers on the infrared catastrophe of bremsstrahlung [F. Bloch 
and A. Nordsieck, Phys. Rev. 52, 54 (1937); W. Pauli and M. 
Fierz, Nuovo cimento 15, 167 (1938) ]]. In that case, however, the 
Hamiltonian contains two terms Hra and V (x) of which the latter 
can be neglected when the electron is far from the scattering 
center; the above mentioned authors therefore proceed first to 
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This we do simply by writing the i 


as follows: 9 ncident state p, 


P.=a,*la>, (5.23) 
that is, by applying a meson creation operator to the 
state vector describing a real nucleon. Simult 


bing aneously, 
we assume for the incident energy the value 


5 E,=wyt Ey. (5.24) 


We now show that (5.23) satisfies the necessary con- 
dition of being a good solution of the Schrédinger 
equation (2.4) when the meson is far away from the 
nucleon. In order to justify this statement, and in fact 
even to give it a definite meaning, it is necessary to use 
a formalism which somehow emphasizes the meson’s 
position, rather than its momentum. This may be done, 
for instance, by using a wave packet as in (5.7). Con- 
sider a wave packet for the meson 


g(x) =D pC pitty (x)emitr, (5.25) 


where 1p(«) is a plane wave [Eq. (1.3)], and suppose 
the c,’s are chosen so that for ¿<0 the wave packet is 
completely negligible at the nucleon’s position, (0,0) 
=0. To describe now the same meson state in the 
presence of the nucleon we build. 


BL) => ppe "Etong a> (5.26) 


and verify that this satisfies the time dependent 
Schrödinger equation idb/dl= H® as long as the meson 
wave packet does not overlap the nucleon. In fact, 
remembering »that 


[Hap] =— opip, [a] =- V7, 
[Hoa ]= opa, [ea ]= Vp 
we find 
Ha;*|a>=70p*|a>+Vpla>+ 
a;*H|a> = (Et op)ap* la> + Vpla> 
and multiplying this by ¢p(/)=cpe'”"*”) we see that 
iðp/ðl- H= repli) V pla>. (5.29) 


Apart from trivial factors, YX cplt) V p is precisely g(0,t), 
which proves the assertion. i eae 

Having thus shown that (5.23) is a correct incident 
state, we may ask what incident meson density it repre- 
sents. Since |a> is normalized to unity, however, it is 
clear that the incident density at large distances from 
the nucleon is the same as for the meson wave packet 
(5.25). : 

We now proceed to discuss the complete state vector 
|pa> which, like Yat [Eq. (3.1) ], consists of the inci- 
dent state a plus a “scattered wave” containing no 
ingoing mesons at large distances from the nucleon. In 


(5.27) 


(5.28) 


i i eglecting V(x); this 
i i f the problem with Hra, but neglecting V(x); t 
Ses oa en for the incident state. From then on they 
can follow the standard procedure. The present problem is a 
little different. 

0 
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analogy to ya" we write | pa+ > for this state. Just as 
in ordinary scattering theory we may also consider the 
kind of state in which, in addition to Pa, there are 
instead no outgoing mesons at large distances; such 
states will be indicated by | pa—>. We now write 


(5.30) 
and try to set up, as usual, an equation for x, that will 
bring out the difference in boundary condition for the 


+ and — states. 
Using (5.23) and (5.27) we see that 


0=(H—E)| pa> = (H-E)x+V pla>. 


| pact: > =a, la> -Hy 


(5.31) 


This replaces (5.2) ; the operators of the present problem 
are not simple partial differential operators as in (5.2) 
but owing to the general validity of the in device we 
can still write 


| peste > =a,*|e>+(Epin—H)-Vpla>. (5.32) 


We expect the solution (5.32) fo share many of the 
properties of the functions ¥,*. For example, selecting 
a given sign, say +, the functions should form an 
orthonormal system, that is, the scalar product of 
|pa+> and |q8+> should satisfy” 


<9B-+| pat > =8p haa. 


(5.33) 


This does not mean, of course, that the | pa+>’s form 
a complete system. In ordinary scattering theory the 
Ya”’s do not form a complete system, if there are bound 
states. In our case there are obviously infinitely many 
states besides (5.32); namely, first of all, the four real 
nucleon states, then all the states with several ingoing 
and outgoing mesons, possibly also isobaric states of 
the nucleon, etc. Nevertheless, (5.33) plays the role of 
a completeness relation in the subspace of “‘one real- 
meson states.” 

Our next task is to find a prescription on how to 
extract from (5.32) the information about the scattering 
amplitude from the incident state a,*|a> to a state 
say a7*|8>. We may do this by expanding the scattered 
part (i.e., the second term) of (5.32) in suitable eigen- 
states of the exact Hamiltonian. The procedure follows 
closely that of Eqs. (5.9) to (5.19); the state p,- is 
replaced by |g8—>. Just as in that case we have to 
consider other eigenstates, in order to have a complete 
expansion, the real nucleon states, for example, but 
these will certainly not appear in the scattered wave 
(at t-»-+ ©) for the same reason why bound states did 
not contribute before (physically: because otherwise 
energy conservation would be violated). On the other 
hand, if the incident energy is, for example, above the 
threshold for meson production, states with two real 
mesons (in our symbolism states |gk8—>) would not 
only appear in the expansion but would contribute 
asymptotically at + ©. We do not plan to calculate 
such processes, however, and will not consider them 


» For a proof of (5.33) see Appendix Cn 


> 
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any further. The presence of such terms in the expan- 
sion is indicated by the dots in Eq. (5.34) below. 
The analog of (5.9) is then 


pat >— ár la> =w Dp|B> <B|Vola> 
HE alon wt in) |q8— >Rpalg8)+ ++; (5.34) 


where 


Rya(q8)= <48- | V pla>. (5.35) 


The rest of the argument proceeds exactly as in the 
potential case, so that (5.35) replaces Roa of Eq. (5.19) 
in the cross-section formula without any further change. 

As is well known, formulas such as (5.19) can be 
used as the starting point of a perturbation expansion. 
The same applies to (5.35). For this as well as for other 
purposes it is, however, desirable to subject the formula 
to a small transformation. Using (5.32) to express 
|g8—> we find 


Rya(q8)= <BlagVpla>+ 
+ <p|V (Erin H) Vpla>. (5.36) 


The first term on the right can be transformed further 
using 4V p= V pa, and the identity 


aqla> = (Es—w—H)V le>, (5.37) 


which can be proved as follows. The operator H+-w,—E, 
is definite positive, and has a well-defined inverse. 
Hence (5.37) is proved if we show that 


(E,—@¢—H)a,|a>=V,"|e>. (5.38) 


The latter, however, can be easily verified by inter- 
changing H and a, by means of (5.27). 
Inserting now (5.37) into (5.36) we have finally” 


Ra (98)= <B|V (Etin H) V p|a>+ 
+<B| V (E: w,- H)V le> 


where we may notice that the second term on the right 
can be obtained from the first by changing wg to —w, 
(notice that E,=E,+w,) and interchanging V, with 
Vian: t 

Equations (5.39) and (5.35) form the basis for the 
following discussion of meson scattering. 


(5.39) 


6. MESON SCATTERING III 


We shall now discuss the methods that can be used 
to evaluate (5.39). An obvious idea is to expand in 
powers of f. We shall examine this first. 

To lowest order, i.e., replacing |a> by ġa, H by Ho, 


% One can also arrive at the expression (5.39) for the scattering 
amplitude from the S-matrix formula: Sop, pa= <¢8—|pa+>. 
This requires manipulations similar to those of Appendix C, and 
is in fact the way in which I found the expression. The proof 
adopted above includes various simplifications suggested to me 
by Dr. F. Low, see also reference 8. A remark is needed also about 
Eq. (5.37); if one expands the inverse operator and [a> in 
powers of the interaction one gets an expression which has an 
obvious physical meaning in terms of graphs (we leave this to 
the reader); this is, in fact, how I first found the equation. This 
is an elementary illustration of the usefulness of graphs. 
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and neglecting Æ. the expression (5.39) reduces, of 
course, to the Born approximation of Sec. 4, Eqs. (4.1) 
and (4.2). The first and second terms give, respectively, 
the contributions of the graphs in Figs. 7(a) and 7(b). 
Higher-order contributions arise when we expand 
(E—in—H)~ and (E,—w,—H)~ as well as the eigen- 
states |a> and |> in powers of f. In doing these 
expansions it is not necessary to remember that E, and 
E(=£,+w,) are themselves functions of f; we may 
instead regard Æ, as an arbitrary. parameter inde- 
pendent of f. This allows us to use the same kind of 
expansion as in Sec. 2, namely 


la> =Zoh {pat + (E= Ho) ASC (E— Ho): - - 


-< AIC(Es— Ho) pat}. (6.1) 


Also Ze, of course, can be expanded, but it is preferable 
to leave it as an explicit factor. Expanding similarly 
the inverse operators 


(E+in— H) = (E+in— Ho)“ 
+ (E+ in- Ho) I (E+ in— Ho) +, 


we see that the general term of the expansion contains 
both factors 3C and AJC. Representing for brevity 
energy denominators by dashes, the structure of the 
general term is more precisely È 


(62) aS 


bp*IC—AIC—AIC— + ++ —AI—V—IC—IC 
—5C—V—AIC— +++ — ee ATC— Hoda. 


When this is expressed more explicitly aş a sum over 
intermediate states similar to (26a) we notice a funda- 
mental difference. Bare-nucleon states are now allowed, 
although only for the intermediate states between the 
two V operators. A description in terms of graphs 
similar to Sec. 3 is clearly possible; we must only add 
a convention to represent the V operators. Since 
V,(V,*) clearly symbolizes the absorption (emission) 
of the incoming (outgoing) meson, we can represent Vp 
by allowing a dashed line representing the incoming 
meson to terminate at the appropriate point on the 
nucleon line and similarly for V,*. This convention is 
clearly related to that used in Sec. 4. 

To give two examples we plot in Figs. 9(a) and 9(b) 
two graphs contained in respectively the first and 
second term of (5.39). All graphs from the second term 
have of course “crossed real meson lines.” It is.easy to 
see that the graphs that occur are all those that might 
be suggested by naive perturbation theory, except those 
with self-energy parts on the end lines. We have already 
explained what we mean by a self-energy part, so the 
examples of excluded graphs in Fig. 10 will suffice to 
explain the statement. A self-energy part between the 
end points of the free mesons, such as it occurs in 
Figs. 9(a) and 9(b) is of course allowed. ; 

The correct recipe, which was first formuiated by 


Dyson for quantum electrodynamics is then the follow- 
ing: 


a ey! ae 
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Omit all graphs with self- 
nucleon lines, but multiply 
graphs by a factor Z4 for 
(this is the rule in 
nucleons present), 

On the basis of this formulation of perturbation 
theory and of a further discussion of renormalization 
(which wil] not be necessary here), Chew has developed 
an approximate treatment of the meson scattering 
problem. The method is based on a classification of 
graphs in order of importance, suggested by the Tamm- 
Dancoff approximation," and leads to a linear integral 
equation for the numerical discussion of which We 
must refer the reader to the original papers.*2 One can 
show that this linear integral equation also appears 
as an approximation to the nonlinear equation of Low,’ 
to which we now turn our attention.* 

Let us expand the inverse operators on the right-hand 
side of (5.39) by means of the theorem 


(E—H)1=S3|n>(E-E,)"<n|, 


energy parts at the ends of 
the sum over the remaining 
each end of a nucleon line 
a form valid for any number of 


(6.3) 


where the sum is over a complete set of eigenvalues £n 
and eigenstates |7> of the full Hamiltonian H. The 
eigenvalues (omitting the possibility of isobaric states) 
are Es Estur, EstHwrtHwn «++. The contribution from 


Fics. 9. Two graphs representing higher-order terms contained 
S in (5.39). 


; h i T 
z F 6 For a discussion of the true Tamm-Danco 

mete Braid to meson scattering, see Dyson Rote 

peter, Schweber, Sunderasan, Visscher, and Bethe, Phys. Rev. 95, 

Yona 5 54); F. F. Salzman 
z 9 Phys. Rev. 95, 285, 1669 (1954); F. F. 

and JW S e Buys. Rev. 35, 286 (1954); see oi L. Gam- 
: . 95, 209 (1954). ; i 

Eea Baie, and Dyson (to be published) hav e shorn 

that the Low Equation for a (a ena pa init 

A i S : 
ae aa as cae btedly deserves further study. I am 
tioned authors for advance communica- 


tion of their results. 
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(b) 


Fic. 10. Excluded graphs with self-energy parts on the end lines, 


the fourfold level Æ, is remarkably simple, namely 


(1/o_) rf <B| Vly > <7|Vpla> 
—<p|V,|y><y|V*la>}, (6.4) 


which, owing to the proportionality relation (2.15), is 
nothing but the Born approximation (4.1) and (4.2), 
with f replaced by f,. 

In the remaining terms of the sum (6.3), Low chooses 
as states |n>, with energy E,+w,, the scattering states 
Eq. (5.32) with the ingoing convention, i.e. |ky—>. 
For the higher states he would use similarly states 
|khy—>, +++ etc. but no detailed calculations have 
really been made including these states.” Clearly it is 
possible to consider a first, second, ---, nth approxima- 
tion in which one keeps only the amplitudes involving 
no more than one, two, ---, mesons. The mesons 
counted, however, are real (incident or outgoing) meson 
not virtual mesons, as in the Tamm-Dancoff scheme. 
This seems a much better way to achieve a classification 
of terms in order of importance, better, that is, than 
the less well-defined scheme earlier suggested by Chew. 
It seems quite reasonable to assume, that for low-energy 
scattering the first approximation should be adequate 
(estimates of the next term made by Chew and Low 
confirm this). 

Let us write, therefore, the Low integral equation in 
this approximation; to this end we notice.that in ex- 
panding, for example the first term on the right of 
(5.39) there occurs the product of <8! V lky=> 
and of <ky—|V,|a>. The latter, and the complex 
conjugate of the former, are just the kind of expression 
considered in (5.35), except that the state & is not, in 
general, on the energy shell with the state p: wswp. 
Nothing prevents us, however, from assuming (5.35) 
as a definition of the symbol on the left, without restric- 


3 This would involve setting up separate equations similar to 


(5.35) and (5.39) for higher-order processes. Although tisis seems 
to be feasible in principle, it gets quite involved., 


a 


od 
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tions. One then has, finally, using B for the Born 
approximation 


: Rpa(g8)= B pa(qB) 


= SE ARa” (ky)R palky)/ (or =w in) 
HR" (ky) Roalky)/(wxtoa)}. 


We are, of course, interested in the case wp=o but 
again, if we follow the steps of the proof carefully, we 
can see that (6.5) holds for the quantity defined by 
(5.35) without the foregoing restriction; one has only 
to be careful in writing E= 2,4, (and not E,tw,) in 
expressions such as (5.39). As Low points out this is 
only possible because the dependence of (5.35) on the 
energy of the state p is trivial, namely just a factor 
pwp tv(p) present in the operator Vp; exactly the same 
dependence appears™ on the right-hand side of (6.5). 
On the other hand the dependence on wa is nontrivial, 
as one sees from the right-hand side of both (5.35) 
and (6.5). (This is why in the energy denominators of 
(6.5) one must write w; and not Wp.) 

Thus (6.5) constitutes an inhomogeneous, nonlinear, 
integral equation for the scattering amplitude. In order 
to discuss its properties it is convenient to assume that 
the states p and q are not plane waves, but spherical 
waves like those considered at the end of Sec. 4. 
Nothing of what we have said is altered, except the 
meaning of the symbols p and g. Now B pa(qß) is pre- 
cisely the matrix element, between ġg and ġa, of (4.16), 
except that f is replaced by fr. Low also saves some 
writing, by omitting the nucleon indices a, 8, y system- 
atically; i.e., he treats Rpa(ky) as the ya element of a 
matrix Rp(k); Rast (ky) is then the By element of the 
Hermitean conjugate of the matrix R(k). This Her- 
mitean conjugate is designated by R,* (k). As a result 
one can simply rewrite (6.5) omitting the Greek indices 
throughout; the products R,*()Rp(?) and R,*(&)Ra(k) 
are now, of course, matrix products. 

One is naturally tempted to go further, and treat 
also p and q in Rp(g) as matrix indices; more specifically 
p stands for |p|, 7, à and g for |a], j, u where, for the 
sake of brevity, we shall replace the two indices 7 and A 
by a single index à (=1, ---, 9). This index specifies the 
angular momentum and charge state of the meson, just 
like a does for the nucleon. Similarly q will now be 
replaced by |q|u, and k by |k], v». This forces some 
changes in notation, like replacing oi7 by (cr), OjTp 
by (cr), but causes otherwise no trouble. We now write 
R,(q) as the pd element of a matrix 7, namely 


R,(Q=—3Lp(h)3(Q)/Pm«)* a. (6.6) 


Some of the factors introduced on the right will be 
convenient later; the p dependence, however, has been 
specifically brought out, in such a way that r depends 
on w only. The matrix rt with the matrix elements 


(6.5) 


(n= (ra) (6.7) 
i 3 M. Gell-Mann and M. L. 1433 
= ae n ea bra) VG eA (1954); an L. Goldberger, Phys. Rev. 96, 
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is now the Hermitean conjugate of r, when regarded 
as a matrix in a space of 4X9=36 dimensions (this 
being the number of possible angular momentum and 
charge states of the nucleon--one-meson system). In 
(6.5) we can now perform symbolically the summation 
over the intermediate index, v, the first term between 
braces gives (rtr) and the second (rtr),,. We can 
relate Bp(q), Eq. (4.16), to a matrix b(w,) just like 
R,(q) is related to r(w,): j 


Dyn lwa) = Gee) (P/a) 
XL(or)u(o7)a— (or) (or)u], 


where a previous remark”! has been duly considered. 
Equation (6.5) finally becomes Tin summing over |k| 
use rule (1.16) ] 


Tun lw a) =b lw a) 
@ pelkjdo (rir) (rr) xn 
ma iI | m | (6.9) 
T G ees 


wg in wrw 


(6.8) 


The rtr under the integral sign is, of course, a function 
of wz; owing to the fact that in the last term the indices 
ud are crossed, we cannot write (6.9) as a matrix 
equation simply by dropping the indexes altogether. 
(One should also notice that while the meson indices 
are crossed, the nucleon indices are not.) We may just 
mention in passing that this pecular structure is related 
to the Gell-Mann-Goldberger crossing theorem® accord- 
ing to which 

raw) =~). , (6.10) 


Without discussing this relationship in general, we 
may see what it means in terms of Eq. (6.9). One sees 
at once that (6.8) satisfies the condition (6.10). In 
order for the whole expression (6.9) to satisfy (6.10), 
we must simply rewrite the second denominator as 
w-+w,—7n. This, of course, makes no difference as long 
as wą is positive (/ dw, runs from the meson mass 4 
to +œ). It does, however, make a difference when we 
try to extend the value of 7(w,) to negative values of 
wq by using the right-hand side of (6.9) as a definition. 
We now clearly see that (6.10) is a theorem about 
analytic continuation of the function r(w). Integrals 
of the type appearing on the right-hand side of (6.9) 
are called Stieltjes transforms; they are known to 
define analytic functions of the variable w,, in the 
whole complex plane of this variable except for certain 
cuts. In the case of (6.9) the cuts run from —% 
to —u and from u to +. There is, therefore, a gap 
from —p to +u, through which one can pass from the 
positive to the negative imaginary half-plane. If one 
thinks the foregoing relationships through, one sees 
that (6.10) is valid if the analytic ‘continuation from 
the positive to the negative real axis is made along 4 
path which starts just above the positive real axis and 


“ for an instructive conversation about a paper they 
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goes, through the gap, 
negative real axis. 

Special attention must be paid in this consideration 
to the 7 factor in (6.8) and (6.9). In fact q= 
is also an analytic function of w 
cuts defined previously ; when continued along the path 
described earlier, q is real and positive on both the 
positive and the negative real axis, when ley) >u and 
is positive imaginary in the interval —p p <p, The 
sign of g is, of course, essential in Eq. (6.10). 

Equation (6.9) has many other remarkable proper- 
ties, for which we must refer the reader to the work of 
Chew and Low.$:36 We only wish to perform the trans- 
formation of (6.9) to Low’s set of three equations”? for 
the phase shifts. 

As we know, one can pass by means of a unitary 
transformation, from the states defined by meson and 
nucleon indices à and a, to states with quantum num- 
bers J, J., T, Tz. In this representation, the matrix r 
must be diagonal, with eigenvalues depending only on 
J and T. Furthermore, dwing to the symmetry already 
mentioned," the eigenvalues for the states 13 and 31 
must be equal. Let us designate with g, (u=1, 2, 3) the 
eigenvalues of r for the states 11, 13 (or 31), and 33, 
respectively. Remembering Table II we see that the 
matrices é and ¢ defined by 


to a point just below the 


(or — u") 
«1n the plane with the 


(6.11) 


Ep = hu = (07), (o7),, 


have, respectively, the eigenvalues 9, 0, 0 and 1, —2, 4 
for the same'states. Hence introducing three new 
matrices Ay (#=1, 2, 3) such that 
e=9Aj, 
c= Ay— 2At 4a, 
= A+ Aot As, 


(6.12) 


the eigenvalues of these A matrices will be 1, 0, 0 for 
Ay, 0, 1, 0 for As, and 0, 0, 1 for A . We then have 


3 
(fi x Subu (6.13) 
u=l 
and since A„t=Au= Au we have also 
3 
> rtr= Ð |gu| Au. (6.14) 
u=] 


In order to cross the indices \ and p, we may consider 
three other matrices Au’ such that (Aw’)a= (Au) rue 
Crossing indices, however, simply interchange «e and ¢; 
one finds then > h 

3 
Ax = Dy AyA ru 

v= 


(6.15) 


__ 36 The author is also indebted to Dr. R. Serber and Dr. T. D. Lee 


are preparing 


on this subject. 


m * Reference 7, Eq. (3.11). 
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where the matrix A is 


1 ~—$ fő 


f 
| 
A=1/9 | anit} 7 4 (6.16) 


i 4 1 


Equation (6.15) allows us to express the last term in 
(6.9) as a matrix element with uncrossed indices. The 
whole equation then becomes a matrix equation, of the 
form ZuXusu=0 where X, is not a matrix; clearly 
this gives three equations X„=0. 

In the foregoing statement, we have made use of the 
fact that the Born matrix (6.8) is itself of the form 


b (oq) = (P/W) Auu (6.8') 
u= (— 8/3, =2/3, 4/3), 
where one can point out, in addition, that since b(w,) 


satisfies, as it must, the Gell-Mann-Goldberger cross- 
ing theorem, one must have 


(~u) E Auha =o Daly, 
which reduces to *8 
DON (6.17) 


À is therefore an eigenvector of the matrix A. 

To sum up, one inserts (6.13), (6.14), and (6.8) 
into (6.9), and taking into account Eq. (6.15) one finds 
Low’s three equations: 


Bu (w1) = Aug / Wwa 


AE  Avel go(eon)|? 
za f eT ca ol Jee | (6.18) 
T k? 


wr— wg in wr Hwa 


where a `, has been tacitly understood. To complete 
the discussion, we have only to establish the connection 
between the functions g4(w) and the phase shifts ĝiu 
613631, and 633, which, with the abbreviation 46, 
(u=1, 2, 3) for 61:@w,), etc., is 


es sind. = | vlg) | gu lw) (6.19) 
i.e., apart from the cut-off factor, which Low omits, 
g is just e” sind. This comes about as follows: 
Adapting Eq. (5.19) to the present situation, let us 
write the relation between incident and scattered wave 


packet in the form 3 


C43 = — ri) pad Wy—Wp)RpalGB)Cpa- (6.20) 


We notice that (5.39) and (1.20) indicate that R is 
proportional to V°”; this cancels against the inverse 
factor in the density of states (1.16), as expected. 
Transforming to states of definite J and T, Cpa becomes 
Cu(wp) and using (6.6) and (6.13) the relation (6.20) 


. 
3 G. F. Low, Proceedings of the Fifth Annua} Rochester 
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becomes a simple proportionality relation 


Cul = 2iv®(q) Buln (6.21) 


both sides are taken at the same energy wq. Now the 
wave packet existing after the process is equal to inci- 
dent+ scattered wave packet, i.e cto’; this must 
have the same norm as the incident packet, hence the 
ratio of cyte,’ to Cu must be merely a phase factor, 
which is usually written in the form e°., where ôu is 
called the phase shift. Hence 


Cu = iei sinducu 
which together with (6.21) yields immediately (6.19). 


7. SOME CONCLUSIONS AND REMARKS 


We shall not attempt to discuss the problem of 
solving Eq. (6.18). If the “crossed” term is left out, 
the three functions g, become uncoupled, and it is 
rather easy to find a solution. This turns out to be 
identical with an approximate solution of the earlier 
formulation by Chew, in terms of a linear integral 
equation. This earlier formulation was able to reproduce 
the data for the large phase shift 63; quite accurately. 
One has good reason to believe therefore, that also the 
newer and a priori better formulation will possess this 
feature. By means of an extrapolation to zero meson 
energy® it now seems that the best values for the 
coupling constant is 


f{?=0.08 (7.1) 


in reasonable agreement with previous estimates. These 
earlier estimates also gave a cut-off momentum of the 
order 


Rmax~M, (7.2) 


where M is the nucleon mass. As is well known, the 
phase shifts 631:=613 and 6,; are very small, and a com- 
parison between calculated and measured values has 
not yet been possible. It would provide a very interest- 
ing test of the theory. It will also be quite interesting 
to see, how different these phase shifts turn out to be 
in the new as against the older formulation. A plausible 
guess is that they will be affected by the change much 
more than 633. 

There are many other problems to which methods 
similar to those described here can be applied. One is, 
of course, photomeson production; this problem too 
has been formulated by Low in terms of an integral 
equation,’ and work is being done on its solution.t This 
report, however, is already too long to allow us to 
include a discussion of this, except for one point, which 
is too essential to be overlooked. 

Kroll and Ruderman® have proved an important 


2 G. F. Chew (private communication). 

t The reader will find a simple discussion of photomeson pro- 
duction in; E. Fermi, “Lectures on Pions and Nucleons,” repro- 
duced in Nuovo cimento Suppl. 17 (1955). 

. “WN, M. Kroll and M. A. Ruderman, Phys. Rev. 93, 233 (1954). 
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theorem on photomeson production at threshold, in 
the relativistic theory. A similar relationship holds® in 
the cut-off theory. It may be explained as follows. 

In order to discuss photomeson production we must 
add to the Hamiltonian (1.10) terms representing the 
interaction with an incident electromagnetic wave. Of 
these terms, the only one that matters near threshold, 
has the form 


(Urle f oA rdd (7.3) 


where A is the vector potential of the electromagnetic 
wave. It is a term one must introduce in order to 
satisfy the continuity equation of charge and current 
(the finite size of the source function gives rise to some 
complications which we cannot discuss here). 

An interesting feature of (7.3) is that it contains the 
coupling constants f and e simultaneously. Owing to 
the smallness of e, it is much more reasonable to apply 
perturbation theory to (7.3) than to the pure meson 
term. We then calculate the matrix element of (7.3) 
between the initial state which is a real nucleon state 
|a> (we do not apply perturbation theory to J!) and 
a final state consisting of a real nucleon and a_-real 
meson. The interesting point arises, that in this case 
the meson is in an s state and can be treated as free 
[this is because the wavelength of the incident quantum 
is large compared to the size of the source function p, 
hence 4 may be pulled out of the integral, which then 
involves f@pdx and expanding ¢ in spherical waves, 
only s states will contribute; for the sake of comparison 
(1.10) involves V@ instead of ¢ !]. 

Owing to the fact that mesons in s states do not 
interact [except through (7.3)] the final state is not 
one of the complicated states of the previous para- 
graphs, but just of the form dq.*|@> where aqs“ is a 
creation operator for an s state. One then sees that 
(7.3) involves precisely a matrix element of the form 
(2.13). The result is that the cross section at threshold 
can be expressed directly in terms of the renormalized 
coupling constant, and gives another, completely inde- 
pendent determination of this constant.’ Comparison 
with experiment*! gives a value in tolerable agreement 
with (7.1)! 

Other problems to which similar methods can be 
applied are the anomalous magnetic moments of the 
nucleon,” and probably nuclear forces and meson pro- 
duction in the nucleon-nucleon encounters. 
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APPENDIX A 


> 
We summarize here Kemmer’s classical argument” 
for the isotopic-spin dependence of the Hamiltonian 
(1.10). We assume the following: 


(a) Nuclear forces, as observed in low-energy nucleon 
phenomena, are due to virtual emissions and reabsorp- 
tions of m mesons, according to the Yukawa scheme. 

(b) Nuclear forces (as distinct from Coulomb inter- 
actions and other minor electromagnetic interactions) 
are strictly charge independent, as first postulated by 
Breit and Feenberg. 

Assumption (b), which is now much strengthened 
by the comparison of levels in “mirror nuclei,” asserts, 
roughly speaking, that the energy of a nucleon level is 
not altered (apart from Coulomb corrections) if some 
of the neutrons are changed into protons or vice versa. 

It is necessary, however, to formulate this more 
precisely. To this end we avail ourselves of the isotopic 
spin formalism. If we did not do this, the state of the 
nucleus would be described by a wave function 


(A.1) 


antisymmetrio in the proton coordinates xı to +m, 
and again in the neutron coordinates Xm+ı tO Xn (x 
summarizes space and ordinary spin coordinates). A 
discussion of the relationship between states with the 
same number 7 of nucleons, but different neutron- 
proton ratios then involves rather complicated con- 
siderations about permutation operators, similar to 
those which are necessary when one deals with the 
n-electron problem without introducing the spin coordi- 
nates explicitly. As is well known, this procedure, al- 
though the most natural one when one neglects spin 
interactions, leads one to consider a wave function 
which is separately antisymmetric in the electrons with 
“spin up” and those with “spin down.” It is then 
rather complicated to discuss the relationship between 
states with different up/down ratios. 

If we-introduce the spin coordinates explicitly, a 
change in the relative numbers of spin-up and spin- 
down electrons may be achieved very simply by a 
simultaneous rotation of all electron spins, or more 
precisely by repeated application of the operators 
S,tiS,, where Sz, Sy, and S: are the components of 
total spin (which correspond to infinitesimal rotations 
of the aforementioned kind). i 

A rotation of the spin of an electron, however, is 
described in quantum mechanics by a unitary trans- 
o-dimensional vector space sub- 


Y (aida: + "EmEmt1"" Xn); 


formation in the tw 


43 N. Kemmer, Proc. Cambridge Phil. Soc. 34, 354 (1938). 
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tended by the two basic spin states a and 8. On this 
rests the possibility of an analogous treatment of our 
problem. We assume that the reader is familiar with 
the elements of the isotopic spin formalism? The 
analogies noted previously will explain its usefulness 
in the present context. Regarding proton and neutron 
as two states of a single particle, two states, that is, 
of some internal degree of freedom, distinct from trans- 
lational motion and ordinary spin, we may treat these 
states, P and .V, on the same footing as a and 8. One 
introduces then operators 7, Ta, 73 defined by 


mP=N, tN=P, 
mP=iN, nN=—iP, (A.2) 
rP = P, nN = —N, 


in analogy with Pauli’s spin operators oz, ay, oz Of 
course, P and V may be regarded as functions P(i) 
and N (i) of an isotopic spin variable. The wave function 
of an n-nucleon system is now obtained multiplying 
(A1) by spin factors P(i:)-++Plin)N (ms) N (ip), 
where i, is the isotopic spin of the Ath nucleon, and 
antisymmetizing the whole, so as to satisfy the Pauli 
principle for all nucleons. 

All these formal manipulations are completely anal- 
ogous to those normally carried out with electrons; 
the result at which we are aiming is the following. If 
we introduce the components of the total isotopic spin 


T=} un (A=1,2,3) (A,3) 


where 7“) is the operator ma, Eq. (A.2), for the kth 
nucleon, we notice that the operators Tu, Ta, Ta repre- 
sent simultaneous infinitesimal unitary transformations 
of the spin states of each nucleon and may thus be 
used to alter the neutron-proton ratio (in analogy to 
what we said about S+, Sp, 5:). In fact noting that the 
linear combinations 


T= (riı&ir2)/ V2 , (AA) 


enjoy the properties 


7-P=V2N, 4N=V2P, N= P=0 (A5) 


one sees at once that (T:+i7:)y is a state with one 
proton more (less) than y; it satisfies the Pauli prin- 
ciple if y does. The charge independence assumption 
then states that the foregoing state (if not identically 
zero) satisfies the Schrödinger equation if y does, and 
with the same energy eigenvalue. This shows that 
T:+iT2 commutes with H, so that finally we can write 


[T,H4]=0 (A.6) 


for \=1, 2. This is the formulation of charge inde- 
pendence which lends itself to a discussion of meson 
theory. Before we proceed, let us take note of the com- 
mutation relations ` 


[Tx Ta] ifn 
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where (Auv) is any cyclical permutation of (1 2 3). As 
a consequence, (A.6) is valid also for A=3. 

We now ask, what requirements a meson theory 
must satisfy, in order to yield (after elimination of the 
degrees of freedom of the meson field) a Hamiltonian 
possessing the three Hermitean integrals of the motion 
Ty. It is hard to see how this can happen, unless the 
theory possesses quite generally three such hermitean 
integrals (in the same way as, say, conservation of 
energy, momentum, etc. in the interaction between 
charged particles is the result of the validity of more 
detailed conservation theorems in the interaction be- 
tween each particle and the electromagnetic field). The 
quantity 7) which is conserved in meson theory, how- 
ever, need not be the expression on the right-hand side 
of (A.3), but may contain in addition to (A.3) a con- 
tribution ô of the meson field. In fact the assumption 
that @=0 leads, as one can see by an analysis similar 
to the following, to the conclusion that only neutral 
mesons are involved. In view of the facts now known, 
this is not an interesting possibility and we shall not 
pursue it further. 

Let us then write, considering for simplicity the case 
of one nucleon only 
Ty=in+h, (A.8) 


where is a Hermitean operator acting on the states 
of a meson or more generally on the state of the meson 
field.“ We may assume, for example, that mt, a, 
and 7° are three different states of the same particle, 
forming as it were a counterpart of the two states V 
and P of the nucleon. We might, however, also consider 
a theory in which only two states, say rt and 77, play 
a role. For this reason we shall leave the number of 
charge states of the meson undetermined for the 
moment and designate an arbitrary orthogonal set of 
them by wm, where m may take 3 or possibly 2 values. 
For the sake of simplicity we shall forget all dynamical 
variables other than the 7 spin of the particles involved. 

According to our basic assumption, the Hamiltonian 
Je is a linear combination of absorption and emission 
operators bm and b,,*, where bm absorbs mesons in the 
state wm. The coefficients are then operators on the 
nucleon only, i.e., linear combinations of 71, 72, 73, and 
the unit matrix 1. Hence the absorption part A of the 
Hamiltonian will be of the form 


A= mbm(A mol-+ yA mata) (A.9) 
with numerical coefficients A mo, Ama. We set 
fa0= di mbnA mo} f= nbmA m, (A.10) 


where f is a proportionality constant to be determined 
later. We shall see that the a) are themselves absorption 
operators from a set of orthogonal meson states. Now 


44 Tt is customary, in discussing ô), to rely heavily on the analogy 
with angular momenta. This allows one to short cut some of the 
mathematical detail. The following more abstract approach is 
mainly for those readers who do not find the analogy with angular 


_ momeria and rotations in ordinary space particularly convincing. 


WICK 

5¢= A-+-A*, and we shall satisfy (A.6) bythe assump- 

tion that 7, commutes with A and A®* separately; 
Ona Lt Enna] rut Dendy 72,7. ]=0. (A.11) 

Using [71,72 ]=77s, etc., one sees that (A.11) reduces to 


[01,40 |= [1,41 ]=0, 


[61,¢2 |=7a3; [01,43 |= — iaz (A.12) 
plus the Hermitean conjugates thereof 
[61,a2* ]= ia"; [01,a.* ]=0, (A.13) 


etc. We now notice that according to the well-known 
commutation laws 


[OnjOn J=0; Pon So (A.14) 
and Eq. (A.10) one has 
[ay,@.J=0; i EmA ma (A.15) 


By using (A.12) to express a» and the identity 
[Euo], w]= Lu Cow ]]+ Co, Cw,u]] 
we find 


—iLas,ay* |= [0a], 0] On asa] ]+ Cas La, J]. 


Now using the fact that [@s,a:*] is a c number, see Eq. 
(A.15) and (A.13), we see that the foregoing expression 
vanishes. Similarly 


— ilasa" ]=[[61,0s],a2* ]=[01,L45,02*]] 
+[a3,[a2*,01 | |= —iLaa,a3* ]. 


More generally one sees that, by an appropriate choice 
of the arbitrary constant f, we can set 


[a,a,* ] = Onn . 


We thus see that the a’s satisfy the canonical commuta- 
tion laws. This has immediately several important con- 
sequences. First we see from (A.15) that the matrix 
{Am is unitary. This implies that the states wm can- 
not be less than three (a charged-meson theory, with no 
m is ruled out; in this way the existence of a neutral 
meson was correctly predicted before 1940!). If we 
take the states, 


=f mA mom; (A.17) 


as basic states for the meson, instead of w1, w2, ws, We 
see’ that the variables a, Eq. (A.10), represent absorp- 
tion operators for these states. We also see that do, 
Eq. (A.10), can be expressed linearly in the a’s; no 
such linear combination can satisfy [[,,¢0]=0, how- 
ever, unless it is identically zero. The unit matrix term 
in (A.9) therefore vanishes.‘® In conclusion we have 


4 Tt is actually even easier to see that a,* when applied to the 
zero meson state creates a meson in a state (A.17). From that, 
however, the inverse statement follows. 

4 An alternative possibility exists: f=0, and only the 1 is 
present. This leads, however, to the already discarded theory 


with neutral mesons only. => 
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a real orthogonal matrix in 
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reduced A to the form 
3 
A=f an, (A.18) 
d=1 k 


where @,, az az are absorption operators from the 
states (A.17). This is, however, just the i-spin de- 
pendence postulated in (1.10); all one has to do is to 
add momeirttum indices and ordinary spin variables to 
the nucleon. K 

For the sake of completeness, however, let us still 
consider the structure of the operators 0. As operators 
on the meson field they must be expressible in terms 
of the a and a* variables. The linearity of (A.12) in 
the a variables, indicates that 0 is a bilinear form in 
a and m”. One then easily finds by trial that 


9, =1(a2a3*— aza:*) (A.19) 


etc. In order to get the general expression of 0, for the 
whole meson field we must append a momentum index 
to a thus ay, and write , 


O1= 120 x (dy20 3 — Ayu”). (A.20) 


On the other hand we may wish to specialize (A.19) 
even further, namely to apply to a one-meson state 
only. Applying (A.19) to the one-meson states 91, 22, 2; 
we find 


6,2,=0, 6,92= 103, O= — ih. (A.21) 


Thus the matrjx (4),.= (@,,0:2) which represents 4; in 
the case of a single meson is of the form 


00 O 
Pad @ =i. (A.22) 
0i 0 
Similarly one finds 
0 0 7} 0 => (0) 
b= | 0 0 0l; =f 0 0j. 
—i 0 0 0 00 


One should also point out that these matrices, as well 
as the operators (A.19) or (A.20) satisfy the usual 


commutation rules 
: [1,62 ]= 10s (A.23) 


and that therefore the total operators (A.8) do also. 
All this boils down to the fact, then, that the Hamil- 
tonian (A.18) or (1.10) is invariant against a unitary 
transformation in the nucleon (N,P) space, and a 
simultaneous unitary transformation of the meson 
states Q1, 22, 2; the infinitesimal unitary TE 
being in one case of the form 1 Gans ae 

infinitesimal angles gq) and in the other of the ora 
1—igph. In the latter case the matrix turns out to be 
3 three dimensions, but the 

? 
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corresponding geometrical interpretation as a rotation 
in a mystical three-dimensional “isotopic-spin space” 
is of doubtful pedagogical value. 

Finally we should explain the relation of the states 
Qi, 23, 2, to the physically more significant rt and z’ 
states. By means of the inear combinations (A.4) we 
can write 


PaT = Tilp t 7 a_+ Tsaa 

aa (A.24) 
a= (a;-Fia2)/V2. 
The passage from aja; to dz, @_, a; is a unitary trans- 
formation similar to (A.10) and corresponds to using a 
new set of meson states [compare (A.17) J, 


Os, = (QyIN2)/VZ; Oy, (A.25) 


From (A.5) we sce that destruction of a meson in a 
state 2, (by means of a+) accompanies a nucleon 
transition from neutron to proton, etc. Taking charge 
conservation into account, we are thus led to identify 
the states 2;, Q, 2; with m4, T, mo states in the same 
order. 

It is clear that in calculations we are free to use 
either the states 2), 22, 9; or the states (A.25); as 
(A.24) shows even the symbol Jaara can be inter- 
preted to fit either scheme, one has only to remember 
that in the second scheme two matrices ra are not 
Hermitean. 


APPENDIX B 


If the three operators (A.8) are constants of the 
motion, then another important integral is 


T?= aT = 3+ 3d 62+ 74). (B.1) 


The eigenvalues of this operator are of the form 
T(T-+1) where T is a half-integer. This follows from 
the commutation laws (A.7) in the usual way. Here 
the analogy with angular momenta is really useful. 

The value of T is used as a quantum number, the 
total isotopic-spin quantum number; the latter is, of 
course, well known from nuclear spectroscopy. If, in 
particular, we have only one meson, we see from (A.22 
that 


T= =? (B.2) 
and (B.1) becomes 
aE oe (B.3) 


meaning, of course, that the number on the left is an 
eigenvalue of the operator on the right. By means of 
(A.22) it is an easy matter to find which linear com- 
binations of the states 


PQ, PQs, +++, ND (B.4) 


are eigenstates of T?. One finds for example that PQ. 
is such an eigenstate, with T=}. On the other hand 
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N94 is not, etc. We need not go into details, as the 
subject is treated in detail elsewhere.” 


Y APPENDIX C 


One proves (5.33) as follows: Using (5.32) to repre- 
sent |g@-+-> we find 


<9B+|pat > = <La pat > 
+ <B|Vq8(E,—in—H)7| pot >. 
The second term on the right may be written as 
(gain) <E] V," | pat > 
which contains a quantity similar to (5.35) and may 
be similarly transformed to 
(oa wp in) { <B| V (E= p— H) V la> + 
<B| Vv," (Eyt+in—H)7V p|a> }. 


(C.1) 


(C.2) 
The first term on the right of (C.1) can be transformed 
as follows, by means of (5.32) 


<Blaq|po+ >= <6] a,a,*|a¢>+ 
<B|a,(Ep+in—H)“V,|a>. 


In the first term we use a,@,"=6,p+¢a,*a, and then 


(C.3) 


47 See for example Bethe and de Hoffman.? 
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use (5.37) twice, getting 
Sq pSap-t <B| ape (E,— 0-H) Vt la> = 
Sqpbap-t <B|Vp(E.—op—H) 1" (E — wH) la>. 
(C.4) 
In order to evaluate the second term of (C.3) we notice 
that, owing to (5.27) 
a(EptHin— H) 
= (Eye in— H) {art V (Ert in-H). 
The second term of (C.3) then becomes 
(op watin {<l aV p|a> 
+ <| V" (Ep+in- HV la> 
or, after interchanging a, and V, and using again, (5.37), 
(wp — watin) <6| Vp(E—w— H) V * | a> 
+ <B|V 7 (Eptin— HV la>. 
We must now add (C.2), (C.4), and (C.5). This gives 
<B+| pot > =Sopda6t <B|VpLV,*la>, 
where 
L= (wg—wp— in) { (E,W p— H)!— (E, —w— H) 
a Go ea e a a A, 


(C.5) 


Hence (5.33) 


LED è 
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REVIEWS OF MODERN PHYSICS 


Analysis of Variance of the 1952 Data on the 
Atomic Constants and a New Adjustment, 1955* 


E. Ricnarp Conen, North American Aviation I nc., Downey, California 
AND 


Jesse W. M. DuMonp, Tuomas W. LAYTON, AND JouN S. RoLLETT, California Institute of Technology, 
Pasadena, California 


The 1952 data used by DuMond and Cohen in an evaluation of 
the atomic constants are analyzed for the presence of systematic 
errors by a variance analysis performed by an electronic digital 
computer. For simplicity the velocity of light is treated as a fixed 
constant of known value and there remain then eleven linear equa- 
tions in four unknowns subject to least-squares adjustment. 
Least-squares adjustments of 219 over-determined subsets of 
these equations have been made and x? has been evaluated for 
cach such subset. An analysis of these data indicates that small 
systematic errors are most likely to exist in the following input 
data: (1) The determination of the Faraday by the silver volt- 
ameter. (2) The determination of the cyclotron resonance fre- 
quency of the proton by th@inverse cyclotron method of Bloch 
and Jeffreys. (3) Certain of the higher voltage determinations 
of /e by the continuous x-ray quantum limit. In descending order 
of magnitude of discrepancy from the remaining data on the 
constants are the determinations of (a) Felt, Harris, and DuMond 
made at ‘24 500 volts, (b) Bearden and Schwarz at 19 600 volts, 


I. INTRODUCTION 


N an article! published in 1953 (herein designated 
as DC 53)’two of us, from carefully selected high 
precision measurements of quantities related to the 
atomic constants, formed an over-determined set of 13 
equations in the following five unknowns; the fine- 
structure constant, a, the velocity of light, c, the 
electronic charge, e, Avogadro’s number N, and the con- 
version factor, \,/s, from x-units (Siegbahn) to milli- 
angstroms. By the method of least squares this set was 
then solved for the “best” values of these unknowns. 
In that paper it was noted that x*, the measure of the 
incompatibility of such an over-determined set of 
equations, turned out to be considerably larger than 
its expected value which is just the number of degrees 
of freedom of the set, namely eight. This could as 
stated at that time, be ascribed either to the estimated 
error measures of the input data having been chosen 
too smal? or to the presence of unsuspected systematic 
errors in certain of the input data. In DC 53, in addi- 
tion to the complete set of 13 equations, three subsets 
called Cases II, HI, and IV (see Table III, page 702 
of that paper) were also subjected to least-squares 
analysis. In these subsets different groups of the equa- 
tions were omitted to see if such omissions might yield 


values of x? more nearly in line with statistical expecta- 


* This study was supported by the U. S. Atomic Energy Com- 


OW M. DuMond and E. R. Cohen, Revs. Modern Phys. 25, 


691 (1953). 


VOLUME 27, NUMBER 4 


OCTOBER, 1955 


(c) Bearden and Schwarz and also Bearden, Johnson, and Watts in 
the region between about 10 kv and about 6 kv. An analysis of the 
various observations taken by these observers at different voltages 
reveals a possible systematic trend when discrepancy is plotted 
against either voltage or window width in volts. Conjectures to 
account for the effect are discussed. 

The modifications called for by this analysis yield a new 1955 
adjustment in which x? is smaller than it was for the November, 
1952 adjustment. The new x?=3.25 is satisfactorily close to its 
expected value, 3. Thanks to the fact that the error measures 
adopted in the November, 1952 adjustment for the output values 
were conservatively based on the criterion of external consistency, 
the changes in the values occasioned as a result of the present 
analysis are all well within those estimated limits. A welcome 
effect of this new adjustment is that the adjusted output value of 
g/s, the conversion factor from Sieghahn’s x-units to milli- 
angstroms, now lies much closer to the input value, A new table of 
constants and conversion factors is presented. 


tion. No very conclusive evidence of systematic error 
could be drawn from these groups, but it was realized 
that this exploration of the consistency measures of 
subgroups was very far from being exhaustive. At that 
time the labor of performing a completely exhaustive 
survey of x? for all possible over-determined subsets of 
the 13 equations would have delayed the publication of 
the results unduly. For this reason the measure used 
to fix the standard deviations of all of the output values 
in DC 53 was based on external rather than internal 
consistency so as to reflect in the error measures the 
incompatibility of the 13 equations on which the ad- 
justed values were based. 
We have not been satisfied however with this com- 
promise and have felt that further efforts should be 
made to uncover, if possible, evidence for systematic 
errors in some of the data of the 13 equations which 
were used in DC 53. It was decided therefore to attack 
the problem by a form of analysis of variance. In 
such an analysis a large number of overdetermined 7 
subsets are subjected to least-squares Adjustment and E 
x is determined for each. fnspection of these data may 
then lead to the desired clues regarding the w 
abouts of systematic errors. i 
Least-squares adjustments are very laborious, p 
ticularly if they are carried through to an evalu: 
of x2. The task of making an exhaustive explo 
all of the ‘significant subsets of the 13 0 
equations of DC 53 and obtaining a value of x 
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364 COHEN, DUMOND, 
would be prohibitive by hand. Digital computers are 
on the other hand well adapted to just such work. 

In order to minimize the expense both for program- 
ming And for time on the computer it was decided to 
eliminate one unknown, the velocity of light, c, from 
the least-squares adjustment treating this quantity 
instead as a fixed known auxiliary constant. Reference 
to Table IV, page 704 of DC 53, giving the error matrix 
and the matrix of correlation coefficients, shows that ¢ 
is much more weakly correlated to the other unknowns 
than any of the rest and its retention in the variance 
analysis would therefore have been expensive and 
practically valueless. 


Il. THE MEASURE OF CONSISTENCY, x? 


As explained in DC 53 the experimental data com- 
prising the measured values of some 14 physical quanti- 
ties, six of which are so accurately known as to be 
appropriately treated as auxiliary fixed constants, are 
combined to form 13 equations [ (8.1) to (8.13), page 701 ] 
in the five primary unknownsa, c, e, N, and À. A= A/o 
the conversion factor between 2-units in Siegbahn’s 
scale and milliangstrom units.) Each of these equations 
is of the form of a simple product of powers of the 
primary unknowns 


aicie®..- =A (1—r) (1) 


equated to A, the numeric which results from the physi- 
cal measurement. If we were to put true values of the 
variables, a, c, e, etc., into the left-hand side of this 
equation we would, in general, not obtain the number, 
A, since it is the result of measurement and therefore 
subject to error. We must include the factor (1—r) in 
order that the equation shall be valid. The quantity, 7, 
is the actual relative error in the measured quantity, A, 
and is, of course, beyond our ken. Let there be 7 equa- 
tions of the type (1) above, each with its exponents 
in, Ju, etc., its numeric A,, and its relative error 7,. As 
explained in DC 53, the theory of least squares leads 
to the cenclusion that if the error distributions for the 
various physical measurements, A,, are Gaussian and 
have estimated standard deviations (root-mean-square 
deviations), cą, the best choice for the set of 7,, i.e., 
that set whose probability of occurrence is maximum, 
is the choice which minimizes the sum 


Os ae (2) 


This is the “least-squares” <ondition: The n equations 
(1) can be thought of as solved for the residues, r,,, 
written as functions of the unknowns a, c, e, etc., and 
we seek that set of values for the unknowns which 
minimizes the sum of the squares of the quotients of 
each residual divided by its standard deviation. 
If the distributions of the errors of measurement are 
other than Gaussian the condition of Maximum Likeli- 
hood as expressed above is tvo specialized, but it has 
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been shown? that independent of any assumptions 
about the shapes of the distribution curves of errors 
save only that the standard deviations, o,, exist (i.e., 
are finite) in each case, the least-squares condition (2) 
above is equivalent to the condition that the resultant 
solution shall be that set which has minimum root- 
mean-square deviations. 

For convenience in effecting the least-squares adjust- 
ment, the system of equations of type (1) is “linearized.” 
We adopt origin values ao, ¢o, €o, etc., which have been 
chosen sufficiently close to our expected solution that 
any set of values a, c, e, etc., in which we are likely to 
be interested will differ from these individual origin 
values by only small relative amounts; that is ao, Co, eo 
are so chosen that “«=(a—ao)/ao, %e=C—Co/Co"+* are 
always small, and we then express the experimental 
measurements in terms of these dimensionless variables 
Xa, Ve, Xe, etc. To each type of experimental determina- 
tion of a function such as Eq. (1) there corresponds a 
hyperplane 


ipat juve Rute} "= au Tp (3) 


which is tangent to the curved surface typified by 
Eq. (1). The constant, ap, is defined as (A „— A yo)/A no, 
in which A,o is the value of the function (1) when the 
origin values ao, Co, €o, etc., are inserted in the left 
member. The orientation of this plane in the hyperspace 
depends on the exponents ty, jy, Ru: ++ of (1) i.e., on the 
coefficients ip, fp, Ry» of (3), while the origin distance of 
the plane depends on a,. The entire status of our knowl- 
edge regarding the “best” values of a, v, e, etc., to fit 
the data is contained in the way these various planes 
intersect each other to define some compromise point, 
taking into consideration the relative reliability of the 
positioning of each plane, i.e., the magnitude of the 
standardard error o,. The function, Q, of Eq. (2) which 
is to be minimized is an expression of the second degree 
in the variables £a, Xe, Xe, etc., and the minimum condi- 
tion is obtained by equating to zero each of the partial 
derivatives of Q taken with respect to each variable in 
turn. It is easy to show that this process is equivalent 
to the following simple recipe for forming the so-called 
‘normal equations”: Write down the system of n 
observational equations, (1), omitting the residue, 7p, 
in each case. Assign to each equation a weight 


Pu= C/o,?, ' (4) 


where the constant C may have any convenient numeri- 
cal value. To obtain the normal equation for a given 
variable (i.e., the one which expresses the condition 
that the partial derivative of Q with respect to that 
variable shall vanish), multiply each of the linearized 
observational equations (with r, omitted) by the coeffi- 

2E. R. Cohen, Revs. Modern Phys. 25, 709 (1953); E. Whit- 
taker and G. Robinson, Calculus of Observations (Blackie and 
Sons, Ltd.), fourth edition, p. 224; R. L. Plackett [Biometrika 
36, 458 (1949) ] has traced the origins of the generalized theorem 


of least squares for non-normal distributions to Gauss, Laplace, 
and Markoff. “ 
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cient of the variable in question in that eq 
by the weight assigned to that equation 
individual expressions together to give a single “normal” 
equation. Repeat for each variable, and tt this way 
construct a set of q simultaneous equations fox he 
unknown quantities, ta, £e, 2, etc, ; 

The solution is completed by inserting the values 
obtained for the x’s into the original set of observational 
equations and finding the associated residuals: the 
values of the residuals computed using the solution of 
the normal equations we shall denote as R, (a number) 
to be distinguished from r, which is a function of the 
unknowns Xa, Xe, Xert. The minimum value of the 
quadratic expression Q, Eq. (2), is x2, the measure of 
consistency of the set of 1 observational equations. x? 
now can be computed. 


[L= (R,?/o,") : (5) 


The expectation value of x? is »—g. This difference 
n—q is called the number of “degrees of freedom” of 
the over-determined set of observational equations. 
Thus, taking as an example the set of 13 linearized 
observational equations in five unknowns which formed 
the basic data of DC 53 [designated as Eqs. (8.14) to 
(8.26), page 702 of that paper ] the value of x? for the set 
was 52.1 while the expected value was 13—5=8. The 
ratio r-/ri=[x?/(n—q) }! is the generalization to multi- 
dimensional space of R. T. Birge’s? ratio of the measures 
of external and internal consistency. In the above ex- 
ample 7,/r; was 2.55. This means that the error measures 
to be associated with the least-squares adjusted output 
values of Xa, Ve, Ve etc., must be scaled up, from the 
expected a priori values which one would assign to them 
calculated on the basis of the estimated c,’s of the input 
quantities, a,, by this factor r./r; in order to reflect the 
incompatibility of the observational equations with 
each other. If re/r: is significantly greater than unity 
this can mean either that the estimated o,’s of the in- 
put data were chosen too small or that the input data 
contain systematic errors. 

The method of computing the error measures and 
correlation coefficients of the output x’s from the inverse 
of the normal equation matrix is explained in detail in 
DC 53 (Sec. 4) and will therefore not be repeated here. 

R. A. Fisher has tabulated! the values for which the 
probability, e, that x° shall be greater than, or equal to, 
xo? takes, particular values c=0.99, 0.98---0.001 for 
different numbers of degrees of freedom, m=1, 2- - +30. 
This table is reproduced here for the first eight degrees 
of freedom (see Table T). e? 

Reference to this table shows that the probability of 
getting x? as great as or greater than 52.1 for the case 
of 8 degrees of freedom 1s less than 0,001. poem 

The object of our analysis was then to determine x“ 
f significant subsets of the equations 


uation and 
and add these 


for a large number 0 


it ss. Rev. 40, 213 (1932). 
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Taste [. Values of y? as function of e and m." 


‘ 

0.90 0.80 670. 050 
ware E sinc E 
0.000 0.001 0.004 O016 0064 O148 0455 
0.020 0.040 0.103 0.211 OM6 0.713 1.386 
0.115 0.185 0.352 O584 1.005 1424 7366 
0.297 0.429 0.711 1,064 1.649 2195 3.357 
0.554 0.752 1.145 1.610 2343 3.000 4351 
0.872 1.134 1.635 2204 3.070 3.828 538 
1.239 1.564 2.167 2.833 3.822 
1.646 2.032 2.733 3.490 : 


0.98 0.95 


, 


1 
t 
3 
6 
3 


0.30 0.20 0.10 0.05 


002 AOI 0.001 


1.074 1642 2.706 3.841 5412 6635 10.827 
2.408 3.219 4605 5.991 7824 9.210 13.815 
3.665 4.642 6251 7.815 9837 11.341 16.268 
4.878 5.989 7.779 9488 11.668 13.277 18.465 
6.064 7.289 9.236 11.070 13.38% 15.086 20.517 
7.231 8.558 10.645 12.592 15.033 16.812 22.457 
8.3383 9.803 12.017 14.067 16.622 18475 24322 
9.524 11.030 13.362 15.507 18.168 20.090 26.125 


I from Table [I of Fisher 1 Methada for 
published by Oliver and Boyd, Lt idinburgh. by 
author and publishers 
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of observation. Since the number of subsets is large, it 
seemed desirable to plan the work in such a way that 
the digital computer would automatically explore the 
subsets according to prearranged rules. We shall now 
explain briefly what equations were used, how they 
were coded, and the manner in which the computer 
program was carried out. 


III. SETUP OF THE SYSTEM OF EQUATIONS 
FOR THE ANALYSIS OF VARIANCE 
The 13 linearized equations of observation (8.14) to 
(8.26) in DC 53 (page 702) are written in five un- 
knowns xı to x; which correspond in the order named 
to the primitive variables, a, c, e, V, and A{A=h,/A,). 
We drop Eqs. (8.14) and (8.15) and substitute a 
fixed value for x2, namely 
4.=9.98+1.0 (ppm). á (6) 
This is the unknown which corresponds to the velocity 
of light and the value here chosen corresponds to fixing 
that velocity at 
c= 299793.0-+0.3 km sec™ (7) 
in accord with results obtained by Bergstrand and also 
by Froome. This leaves us with a set of 11 equations in 
4 unknowns corresponding to the primitive variables 
a, e, N, and À. In the present paper we shall designate 
the linearized unknowns with symbols x to x4 defined 
as follows: 
a=ay(1+ 10x) ; ay = 0.007 297000 (8) 
e=e(1+10-8x2); eg =4.802200% 10-4 (9) 
N=No(i+10-*xs); No=0.6025000X 10 (10) 
A=) (1410x); M= 1.0020200. * (11) 
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Tani IT. Observational equations, sources of data and coding numbers. 
Kina WARK i 7 l aa 
anes Kitabe (The x's AEN, in 105) Weight Description of experimental source DC 53 
0 1 0.11 Ag/^s = 1.002020 (6.8) 
1 1 0.07 NX (Birge’s average) | ; (6.12) 
2 1 4.92 Dayhoff fine structure splitting in deuterium | (6.13) 
3 1 0.19 Thomas, Driscoll, and Hipple, gyromagn. ratio of 
proton a 
4 1 0.58 Iodine \ 7, : ; 
{4 2 0.25 Silver BERS by electrochemistry (6.4) 
5 1 0.08 Bloch and Jefrics, (6.5) 
inverse cycl. i 
5 2 —3x 1-4 2x- rs SSS 0.83 Sommer, Thomas, and Mag. mom. of proton (6.6) 
Hipple, omegatron 
6 1 — xt x 0.04 Felt, Harris, and DuMond | Short wavelength (6.9) 
6 2 — arb x 0.06 Bearden, Johnson, and Watts ;limit of continuous (6.10) 
6 3 — xt xe 0.015 Bearden and Schwarz J x-ray spectrum (6.11) 


These a’s must not be confused with those of DC 53 
both because the subscripts are differently associated 
with the primitive unknowns, and because these new 
x’s represent relative deviations from the origin values 
in parts per hundred thousand instead of parts per 
million. 

In Table II we show the 11 equations in four un- 
knowns which form the object of our analysis of vari- 
ance. The equations are grouped into “kinds,” each 
kind of equation determining a particular linear com- 
bination of the unknowns. At the left appear two code 
numbers which label these equations as follows: The 
first column gives the code number designating the 
“kind” of equation. When there is more than one equa- 
tion of a given kind these are distinguished from each 
other by a second code number appearing in the second 
column. It will be noted that there is only one equation 
of each of the first four kinds. There are two equations 
belonging to each of the next two kinds; and the last 
kind has three equations. In Table II on the right of 
each observational equation the experiment which 
yielded the corresponding information is briefly de- 
scribed. (In almost every equation other experimental 
measurements of much higher accuracy than the one 
named may also have been involved in determining 
the numeric, but the one named is in each case the only 
one which contributes materially to the estimated 
uncertainty of that equation.) On the extreme right 
in Table II appears the equation number as it appeared 
in DC 53. The reader is referred to these equation 
numbers in that article for a fuller discussion of the 
sources of data and references to the original articles. 

The primary purpose of the entire investigation is, 
of course, to try to select a-subset of the equations of 
Table II from which the most reliable adjusted values 

of the four unknowns can be deduced. Let us call this 
our A objective. A common sense approach would then 
be to search for a subset, which determines all four 
unknowns and whose x? is the smallest. Care must be 
taken in evaluating the significance of these values of 
x? since they are computed from sets which have been 
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specially selected and not randomly chosen. ‘Thus, it is 
not strictly correct to use Fisher’s table (Table I) to 
evaluate the probability of an observed value of x? for 
these sets, but one would tend to give more weight to 
the sets with the greatest number of degrees of freedom 
because such sets repose on a broader experimental base. 

A closely related secondary objective is to try to 
locate which, if any, of the 11 equations of Table IT 
are likely to have systematic errors. We shall call this 
our B objective. One way of doing this is to examine 
those subsets whose x?’s are much larger than the ex- 
pected value to see if the presence of certain equations 
in such sets and their absence in other sets with smaller 
x?s may be used as evidence that those equations have 
systematic errors. A still better way (which we shall 
call the “method of increments to x”) is to make an 
exhaustive investigation of the effect on x? of adding a 
specified equation to each and every subset not already 
containing that equation. The new subset will have one 
more degree of freedom than the original one but, if the 
new x? exceeds that of the old subset by much more 
than unity, one has an indication that the equation so 
added may contain a systematic error, or that its ac- 
curacy has in some other way been overestimated. 

For the present analysis of variance, 219 subsets of 
the equations listed in Table II were selected and the 
x was determined for each. The selection rules for the 
first 171 subsets were chosen with the A objective 
chiefly in mind and with the idea of economizing on 
computing machine time. The following were the rules 
for selection of these subsets: 

(a) Not more than one equation of any one kind 
should be included in any one set, and (b) with this 
restriction, all sets should be examined which over- 
determine all the unknowns. The first 171 subsets 
examined, Tables III, IV, and V exhaust all the possi- 
bilities under these rules of selection. Admittedly this 
is not an exhaustive or complete analysis of variance 
however since (a) it omits all over-determined subsets 
containing two or more equations of the same kind 
which were excluded under the (a) selection rule, and 


"se 


| 
| 
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~ though admittedly only parti 
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(b) iie form of the equations in Table II is such that 
ae over; determined subsets exist which do not 
18 cobscl aE A four unknowns. [Table V(a) lists 

i i £ latter type.] Before discussing the 
reason for adopting these selection rules we shall first 
explain the symbols used to identify particular subsets 
because this terminology facilitates the discussion. 

Fhe computer symbol used for identifying a particu- 
lar subset of the equations of Table II isa number with 
one digit for each kind of equation. Each digit is the 
serial number of the equation of the corresponding kind 
included in the set, the position of the digit indicating 
the kind number. A zero in any position means that that 
kind has been omitted altogether. The set symbol 
1110203 thus indicatessinclusion in the set of the first 
equations of kinds 0, 1, and 2, omission of kind 3, in- 
clusion of the second equation of kind 4, omission of 
kind 5, and inclusion of the third equation of kind 6. 
It is also convenient to designate groups of sets accord- 
ing to the kinds of equations excluded or included with- 
out specific reference to which equation of any given 
kind is to be selected. Thus the set with symbol 1110203 
belongs to the group designated by the group symbol 
[1110101]. 

Consider now the rule that not more than one equa- 
tion. of any one kind should be included in the subsets 
to be examined. We can examine all the subsets satis- 
fying this rule and belonging to a specified group, for 
example the group [1110111]. There will surely be one 
of these subsets whose x? is the smallest, say the set 
1110123. As fgr as our A objective is concerned, once 
we have found this subset there seems little information 
of value likely to be gained by considering subsets in 
which one or more additional equations of any one of the 
kinds are added since such subsets will surely have 
greater x’. Such subsets must however be considered if 
we are to apply the method of increments to x ex- 
haustively. 

The restriction (b) to examine (subject to the first 
selection rule) only those sets which determine all of 
the unknowns eliminates from consideration certain 
degenerate subsets. This restriction was also obviously 
framed with reference to our A objective. 

After the calculations of x”s for the 171 selected 
subsets were completed, Professor John Tukey of 
Princeton University (to whom we are much indebted 
for valuable advice and guidance) pointed out that the 
B objective would be best served by accumulating as 
much statistical evidence as possible for or against the 
liklihood of systematic error in any given equation of 
Table II. To do this quite exhaustively by examining 
the increments in x? resulting from the addition of a 
given equation to all subsets, we shoùld have evaluated 
x? for many subsets which we had not considered. 

Fortunately however the evidence we have obtained, 
jal, seems quite sufficiently 


clear-cut to satisfy both A and B objectives. 


<> a 


IV. METHODS WITH DIGITAL COMPUTER 


The major calculations described in this paper were 
carried out on the Electrodata Datatron digital com- 
puter. The program was designed to carry out least- 
squares adjustments on sets of linear equations in 
either four or five unknowns, and to derive x* for the 
adjusted values. The normal equations were solved by 
inverting the (real symmetric) normal matrix, C, using 
the Gauss-Doolittle method given by Dwyer (1951).5 


The unknowns x- -x4 which we shall symbolize with 
the vector, x, were then obtained by solving 
C-k=Xo, (12) 


where k is the “normal vector” consisting of the numeri- 
cal constants forming the right-hand members of the 
normal equations. The precision of xa was improved by 
iterating : 

Ak,;=-C:-x; (13) 


Xur X= C -L Ak; (14) 


In general two cycles of iteration reduced all elements 
of Ak; to less than the tolerance of 0.0001. (For both 
the x’s and the 4’s one unit represents a relative devia- 
tion from the origin value of 1 part in 10°.) 

It was found to be economical to compute x* from 
xè, the sum of normalized residues for x=0, by using 


=xi—k-x. (15) 


Provision was made so that all of C, k, C+, and x 
could, if desired, be printed as they were obtained so 
that the adjusted values, variances and covariances 
of the unknowns could be found for any set of equa- 
tions. In ‘a survey of a large number of sets, however, 
this printing was omitted because it would have re- 
quired 30 seconds for each set as against 12 seconds for 
the calculations and for printing x? with the group of 
digits identifying the set. 

A great deal of preliminary labor was eliminated by 
preparing, not a detailed list of equations for each set, 
but the single list of 11 equations of Table IL and 
building into the program means by which sets would 
be selected automatically according to the prearranged 
selection rules (a) and (b) explained in the last section. 

A list was prepared by hand, giving all sets of kinds 
(i.e., all “groups”) of equations satisfying the selection 
rule (b), namely the sets must over-determine all the 
unknowns. For each set of kinds the machine generated 
all the sets of equations required and calculated x’ for 
each set as it was generated, printing only the set 
symbol and the value of x°. The more interesting of 
these cases, as judged by examining the x’’s were then 
reexamined in greater detail. Provision was made fora 
slight modification in the program so that the machine 
would calculate for a hand-prepared list of particular 
sets of equations (the interesting ones to be reexamined) 


5P. S. Dwyer, Linear Computations (John Wiley and Sons, 
Inc., New York, 1951), p. 191, . 
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stor TO READ IN SET Ke Ko 
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CLEAR READ IN su eeek Nag Kes pe Nee 
MEMORY ORDERS ORDERS READIN OBS. EQNS B 


@ © 
(Us)= SET wô? AND 
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pt TO C AND k 


(A) < 0 ||COLLECT GROUPI| (A)>O [EXTRACT Nx WITH 
-3 GROUP SYMBOL TO 
GROUP) GIVE Ne 


SET THE SIGNAL 
66669966066 
IN A AND STOP 


ld 


6666 


(33) 


SET THE SIGNAL 
IN B AND STOP 


A< o 


ITERATE 
ag ty = CN (k- Cay) 
UNTIL k= Cap 


©) 


SET (t9= 
CONTRIBUTIONS 
OF EQN, K, Ng 


[CAND K] AND 
ADD KN, TO 
Swat 


A= Ng 
% CHECK 


RESET (L6) 
E AND k] 


BFO |SKIPSET 


PRINT C 


FROM (L6) 


SKIP SET 


PRINT x 
FROM (L6) 


SIGNAL INDICES 
OF WRONG. & 


ELEMENT 


REARRANGE C 
IN (t5) ano (L6) @ 


FORM AUXILIARY 
MATAIX IN (L5)| 63) 
AND (L6) 


CHECK ALL 
ELEMENTS 
OF c,c"! 


FORM CIN (L6) (2) 


SKIP SET 


Fic. 1. Block diagram of the steps in machine computations of x?. It will be noted that certain operations of printing can be bypassed, 
see blocks 50, 51, 57, 61. This was done to save computer time on the first exploratory run in which only the set symbol and x? were 
printed for each set. Later the more interesting sets, as judged by the x”s, could be printed out in full detail as regards the C and C~! 


matrices and the k and x vectors. 


all the pertinent data and print the complete results: 
normal matrix, normal vector, inverse matrix, adjusted 
values of the unknowns, x?, and set symbol. 

As already stated, x? was computed and printed for 
171 sets, covering all cases under selection rules (a) and 
(b). From these, 26 sets were selected for recomputation 
with complete printing out of all details. At a later date 
the 48 additional sets of Table V(a) were computed. 
These are the sets which, though over-determined, fail 
to over-determine all four unknowns. 

The entire work of checking the code to be fed into 
the machine, as well as performing the calculations for 
the first 171 sets, examining the x?”s for the interesting 


Tagle III. Equations in 3 degrees of freedom. 


x? Designation of Eq. set 
13.71 1111223 
19.34 1111222 
26.91 1111221 
25.21 E 1111213 
30.66 1111212 
38.14 1111211 
103.25 1111123} 
07.71 1111122 
15.01 1111121 
11.29 1111113 
17.33 1111112 
24.92 1111111 - 


cases, and recalculating and printing the details occupied 
somewhat less than a “block” of time (6 hours). The 
net time expended in calculation and printing was only 
a little over one hour. 

A full account of the computer program, including the 
complete code and a detailed flow sheet is given in a 
Special Technical (A.E.C) Report [No. 18, Contract 
AT(04-3)-63] now available for private circulation. 
We reproduce here in Fig. 1 a block diagram of the 
steps in machine computation. The code is designed for 
the use of anyone wishing to perform similar variance 
analyses in as many as 50 equations of 10 kinds in 5 
unknowns. 


V. RESULTS OF THE VARIANCE ANALYSIS 


Subject to the selection rules (a) and (b) the equa- 
tions of Table II yielded 12 subsets of 3 degrees of 
freedom, 64 subsets of 2 degrees of freedom, and 95 
subsets of 1 degree of freedom. We list here, in Tables 
III, IV, and V, the sets in 3, 2, and 1 degrees of freedom, 
respectively. 

We shall abbreviate the words “degrees of freedom” 
with the symbol d.f. and shall refer to the x”’s of a whole 
class of those sets containing a given kind of equation 
and a given equation of that kind by writing the kind 
number in parenthesis after the symbol x? with the 
equation number as its subscript. Thus x2(63) refers to 
the class of all x”’s of sets containing equation number (3) 


Se is 
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“4 F 
09.34 1110223 A EA 
16.29 1110222 146 k no 
24.05 1110221 19.20 1111011 
16.99 1110213 03.96 1111203 
27.10 1110212 05.91 1111202 
35.04 1110211 12.04 1111201 
03.22 1110123 03.08 1111103 
07.55 1110122 06.78 1111102 
14.81 1110121 13.72 1111101 
08.50 1110113 10.93 0111223 
16.80 1110112 16.17 0111222 
24.57 1110111 23.66 0111221 
03.09 1111023 22.43 0111213 
07.33 1111022 27.55 0111212 
14.55 1111021 ° 34.95 0111211 
10.33 1111220 
21.75 1111210 


Taste IV. Equations in 2 degrees of freedom. 


> x? 

02.72 0111123 11.27 1011113 
06.99 O111122 16.07 1011112 
14.24 0111121 22.92 1011111 
10.47 0111113 08.78 1101223 
16.35 0111112 09,28 1101222 
23.90 0111111 13.45 1101221 
13.00 1011223 20.23 1101213 
19,11 1011222 20.77 1101212 
26.34 1011221 24.96 1101211 
24.50 1011213 01.59 1101123 
30.48 1011212 02.65 1101122 
37.63 1011211 07.48 1101121 
03.24 1011123 08.86 1101113 
06.65 1011122 10,41 1101112 
13.11 1011121 15.65 1101111 
00.44 1111120 


07.52 1111110 


of the sixth kind (the last equation in Table II). Inspec- 
tion of Table III shows that, in every 3 d.f. case, 


x (63) <x? (62) <x? (61), 


there being four such triplets for comparison. Exactly 
the same statement is true in every 2 d.f. case, there 
being among these 20 triplets for comparison. In the 1 
d.f. cases the same statement can be made for 27 out 
of the 29 triplets. The remaining two exceptional 
triplets are listed in Table VI. : 

The evidence seems strong, therefore, that the third 
equation of kind 6 yields better consistency in nearly 
every instance, than the second equation of this kind, 
and that the second equation of this kind yields better 
consistency than the first. 

A similar inquiry as regards the two equations of 
kind 5 leads to the conclusion that, in all six cases of 3 


d.f., all 29 cases of 2 d.f., and in 29 out of a total of 33 
cases of 1 d.f., Eq. (52) yields lower x? than Eq. (51). 
Also, as regards the two equations of kind (4), it appears 
that (4,) gives lower x? than (42) in all six cases of 
3 d.f., in 24 cases out of 26 of 2 d.f., and in 15 cases out 
of 33 of 1 d.f. Tables VIL and VIII show the few excep- 
tions to the very strong trends we have described. 
The evidence for the trend, x Ch) <x (4:), is ad- 
mittedly not quite as strong as for the other two trends 
because of the rather large number of exceptions (18) 
in the 1 d.f. cases. Of these 1 d.f. exceptions 15 are actual 
reversals of the trend and three are stalemates in which 
x? is unchanged by going from Eq. (4;) to Eq. Ch). 
The trend is so strongly indicated in the 3 d.f. and 
2 df. cases, to which we attach much more significance 
however, that we believe it is quite clearly indicated. 
Having now arrived at the conclusion that, among 


Taste V. Equations in 1 degree of freedom, 


ignati esignatio Designation Designation 
X ae £ PARE x£ prie £ of Ba. set 
7 2 07.58 1010223 05.25 1110220 00.36 1111020 
0473 1110022 16.21 1010222 10.25 1110210 00.17 2111010 
11.08 1110021 23.80 1010221 00.43 1110120 01.60 0111203 
04.42 1110013 14.15 1010213 02.82 1110110 03.40 0111202 
10.09 1110012 26.99 1010212 03.08 1111003 09.42 0111201 
17.35 1110011 34.80 1010211 03.89 1111002 02.63 0111103 
00.00 1110203 03.22 1010123 11.94 1111001 06.26 0111102 
00.05 1110202 06.45 1010122 0.271 0111023 13.19 o111101 
00.49 1110201 12.79 1010121 06.89 0111022 03.08 1011203 
00.70 1110103 08.26 1010113 14.10 0111021 05.89 1011202 
00.70 > 1110102 15.56 1010112 04.34 0111013 „1194 > 1011201 
07 1110101 22.70 1010111 11.45 0111012 03.08 1011103 
AG 0110223 00.87 1100223 19.19 0111011 05.89 1011902 
oaol 110222 01.21 1100222 03.08 1011023 1194 + 1011101 
ioe Cina 05.12 1100221 05.89 1011022» 00.35 1101203 
1833 0213 01.80 1100213 11.94 1011021 00.31 1101202 
0733 10212 03.00 1100212 03.08 1011013 03.03 1101201 
Gs 0110211 » 07.92 1100211 05.89 1011012 01.56 1101103 
aa 0119123 01.59 1100123 11.94 1011011 02.54 1101102 - 
02:1 O 02.64 1100122 01.58 1101023 07.27 1101101 
rete 0110121 07.48 1100121 02.60 1101022 02.27 1111200 
wae 9110113 02.78 1100113 07.36 1101021 00.42 1111100 
0613 . 110112 05.06 1100112 04.13 1101013 
152 1 10.76 1100111 07.46 + 1101012 
22.96 onon 13.52 1101011 ‘ 
. "e e f x 


<=> 3 
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Tapte V(a). x? for additional sets violating selection rule (b). 


07.73 0011223 07.73 1001223 
07.73 0011222 07.73 1001222 
07.73 ° 0011221 07.73 1001221 
19.12 0011213 19.12 1001213 
19.12 0011212 19.12 1001212 
19.12 0011211 19.12 1001211 
00.00 0011123 00.00 1001123 
00.00 0011122 00.00 1001122 
00.00 0011121 00.00 1001121 
06.78 0011113 06.78 1001113 
06.78 0011112 06.78 1001112 
06.78 0011111 06.78 1001111 
07.73 0101223 07.73 0111220 
07.73 0101222 19.12 0111210 
07.73 0101221 00.00 0111120 
19.12 0101213 06.78 0111110 
19.12 0101212 07.73 1011220 
19.12 0101211 19.12 1011210 
00.00 0101123 00.00 1011120 
00.00 0101122 06.78 1011110 
00.00 0101121 07.73 1101220 
06.78 0101113 19.12 1101210 
06.78 0101112 00.00 1101120 
06.78 0101111 06.78 1101110 


the equations of kinds 4, 5, and 6, those which give almost 
invariably the lowest x?’s are (41), (52), and (63), we 
now call attention to perhaps the most significant result 
of all which is the fact that, among the 12 sets of 3 d.f. 
in Table III (no kinds omitted), by far the best x? is 
associated with the set 1111123. The expected value of 
x for all these sets is 3 and x? (1111123)=3.25 is in 
strikingly good agreement with this, whereas the x? 
of all other 11 sets is much larger ranging from 7.71 
to 38.14. Interpolation in Table I shows that the odds 
are about 35 to 65 for getting x? in excess of this value, 
3.25. It seems to us that this set, 1111123, is therefore a 
satisfactory answer to our A objective. 

We cannot claim in this discussion to have made the 
most exhaustive possible efforts in the direction of our 
B objective, the accumulation of all possible evidence 
for or against the liklihood of systematic error being 
present in each of the 11 equations. To make such a 
complete search it would be desirable to explore the 
effect on x° of adding each of the 11 equations in turn 
to all sets not already containing that equation, and 
our present computer program has not covered enough 
cases to permit this. 

Table V(a) gives x? for 48 interesting subsets of 
equations which are exceptions to selection rule (b) 
in that, although they are over-determined by 1 degree 
of freedom, none of them over-determines all the un- 
knowns. This situation arises because the 48 sets of 
Table V(a) all contain equations of kinds 3, 4, and 5 
which are not algebraically independent. These three 


Taste VI. Exceptions to the rule, x?(63) <x?(62)<x2(61). 


x?(63) x?(62) 
OE 20.70 IET 
X 


x? (61) 


x?(1191201)=3.08 


1110102)=0.70 x2(1110102)=0.70 x?(1110101) = 2.97 
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equations are of the following form: 


3xı— Xo = 3 


xot x= da4 
— 323+ 2x: 4+ x= Gs. 


Tf we add the first and third equations and subtract 
the second the result is 


0= a3— a+ a5, 


and the numerical values of the a’s will in general fail 
to satisfy this relationship. 

We see from Table V(a) that the 48 sets can be 
grouped into four classes each characterized by a 
different value of x2, which is determined by the choice 
of the equations of kinds 4 and 5. Those sets which con- 
tain (4;) and (51) yield x?= 6.78 ; those which contain (41) 
and (52) yield x2=00.00 (more accurately this value is 
0.00122); those which contain (42) and (51) yield 
x2= 19.12; and (42,) (52) yield x?=7.73. The explanation 
of this lies in the fact that from equations of the kinds 3, 
4, and 5 we can only evaluate 3xı— x2 and #2+24 and we 
still need two other equations to completea just-determi- 
nate solution for our four unknowns. Thus in all these 


TaBLe VII. Exceptions to the rule, x°(52) <x2(51). 


x2 (S2) x? (51) 
x2(1011023)= 3.08 x2(1011013)= 3.08 
x2(1011022)= 5.89 x2(1011012)= 5.89 
x2(1011021)= 11.94 a er aa 
x?(1111020) = 00.36 x2(1111010) =00.17 


sets, which are 1 degree of freedom systems, the fourth 
and fifth equations (the ones beside 3, 4, and 5 which we 
choose) do not contribute to the over-determinateness. 
The x? value is determined entirely by the consistency 
of the over-determined set of 3, 4, 5. If this triplet 
of equations were consistent, @;—4@4-++as should be zero 
and hence the inconsistency is measured by the square 
of the residue (a3—a4+<as)?. If the standard deviations 
of the a’s are os, o4, and os then we have 


(@3—as+as)? 
os toy os 


The very low value, x? (41,52) =0.00122 is a result of 
the excellent agreement between the following ex- 
perimental values: The proton magnetic moment of 
Sommer, Thomas, and Hipple 


p’ = 2.792685 +-0.000030 
(before diamagnetic correction), 


the gyromagnetic ratio of the proton of Thomas, 
Driscoll, and Hipple 


2 


r= 


= 26752.3-£0.6 sec gauss, 


the atomic mass of the proton (regarded in our analysis 
= 


of 


of 


sis 
e 


| 


| 


} 
j 
5 


tions (42), (51), (61), 
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as a fixed auxiliary constant) 


M p=1.007593, 


and the Faraday constant measured 


f with the iodin 
coulometer of Vinal and Bates ge 


F=9652.15+0.13 emu (g mole) (physical scale). 


The iodine value of F is to be compared with the value 
computed from y, M, and p’ 


F=yM /p! =9652.16-£0.25. 


We thus have strong corroborative evidence that the 
iodine Faraday is more likely to be correct than the 
silver Faraday, and hence that Eq. (4) is to be 
preferred to Eq. (42). ° 

If we now utilize our selected best set of equations 
1111123, we obtain the least-squares solution given in 


TABLE VIII. Exceptions to the rule x2(41) <x? (4). 


x?(42) x2(41) 
2 degrees of freedom 
x2(1111202)= 5.91 x°(1111102)= 6.78 
x?(1111201) = 12.04 x2(1111101) = 13.72 
1 degree of freedom 
x°(1110203) = 0.00 ane 0.70 
x2(1110202)= 0.05 x2(1110102)= 0.70 
x2(1110201)= 0.49 x2(1110101)= 2.97 
x7(1100223)= 0.87 x2(1100123)= 1.59 
x2(1100222)= 1.21 nibs 2.64 
x2(1100221)= 5.12 x2(1100121)= 7.48 
x2(1100213)= 1.80 x2(1100113)= 2.78 
x2(1100212)= 3.00 x2(1100112)= 5.06 
x2(1100211)= 7.92 2(1100111)= 10.76 
x2(0111203)= 1.60 x2(0111103)= 2.63 
x2(0111202)= 3.40 x°(0111102)= 6.26 
x2(0111201)= 9.42 2(0111101)=13.19 
x2(1101203)= 0.35 x2(1101103)= 1.56 
x2(1101202)= 0.31 x2(1101102)= 2.54 
x2(1101201)= 3.08 2(1101101)= 7.27 
x2(1011203)= 3.08 x2(1011103)= 3.08 
2(1011202)= 5.89 AONE 5.89 
x-(1011201)=11.94 x 1011101) = 11.94 


Table IX. The result, x°=3.25, yields for the ratio of 
the scales of error by external and internal consistency 


the very satisfactory value 
re[ri=Le/ (n—D P= 1.04. 
Table X lists the error elements by external con- 
sistency (which now differ insignificantly from those by 


internal consistency). ; 
Tt is to be noted that the adjusted values of the un- 


knowns yielded by this new adjustment differ from 
those of DC 53 by amounts which are entirely inside 
the standard error ranges of the 1953 adjustment. 
The new standard errors are, however considerably 
smaller than those of DC 53- 9 

VI. DISCUSSION OF THE REJECTED EQUATIONS 


Possible Sources of Systematic Error 
m our new adjustment the equa- 
and (6s) because of the strong 


a 


` We have rejected fro 


a 


371 


Tante IX. Least-squares solution of the set 1111123. 


Change in 


unknowns L.S. adjusted values of ppm from 
lin ppm) primitive variables DC 53 
m= 392 = 7.29728 XK 1073 3 
a 0373 -3 
x= 137.2 $.80286X 10 esu —49 
x= — 23.7 N = 6.02486X 108 (g mole)! Phys. 22.3 
x= 194 d,/d,= 1.002039 —23.5 


probability, which our variance analysis has raised, 
of their containing systematic errors. These equations 
correspond respectively to (42), the determination of 
the Faraday by the silver voltameter, (51), the measure- 
ment of the magnetic moment of the proton by the 
inverse cyclotron method of Bloch and Jeffreys, (61) 
the x-ray continuous spectrum quantum limit determi- 
nation of the conversion constant from wavelengths in 
x-units (Siegbahn scale) to quantum energy in electron 
volts (“h/e” experiment) at 24.5 kv by Felt, Harris, 
and DuMond, and (6+) the same experiment at 10 and 
6 kev by Bearden, Johnson, and Watts. 

The fact that the Faraday determinations by the 
electrolysis of iodine are in better accord with the con- 
sensus of the data on the atomic constants than are the 
Faraday determinations using silver has been realized 
and pointed out now for many years. The reason is 
still obscure. Both determinations were made long be- 
fore the existence of isotopes was suspected. The fact 
that natural iodine is an isotopically pure substance 
while silver has two isotopes in nearly equal abundance 
has been often suggested as possibly having a bearing 
on this but the work at the U. S. National Bureau of 
Standards undertaken by Craig and Hoffman to clear 
up these questions seems so far to be inclusive. The 
discrepancy here is nearly four times the value to be 
expected from the estimated standard errors of the 
measurements. 

The results of our analysis of variance indicate that 
the discrepancy of 114 ppm between the inverse cyclo- 
tron measurements at Stanford University by Bloch 
and Jeffreys and the ““omegatron” measurements at the 


Tague X. Error elements by external consistency for 1111123. 
Sr 
Error matrix, vij =gis;rij (in ppm?) 


a e N Aa/he 
a 21.52 62.50 — 60.60 17,70 
e 62.50 373.2 — 480.0 139.6 
N — 60.60 — 480.0 726.0 —210.6 
Ag/As 17.70 139.6 —210.6 204.2 
Standard errors si (in ppm) 

4.65 19.30 26.9 14.5 
ee 
Correlation coefficients rij 

1.000 0.697 —0.485 0.263 

E 0.697 ` 1.000 —0.922 0.500 

N e —0.485 —0.922 1.000 — 0.533 

Ao/ds 0.263 0.500 — 0.538 - 1.000 
a 
nn 
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Tapir XI. X-ray quantum limit determinations. i 
Yomi c Discrepancy Discrepancy 
Experimenters Motoa (volts) het/ (eXo/ds) : Gated (volts) Remarks Í 
BJW" 6112 0.27 12370.8 —116 —0.7+0.4 W target 
BSb 8050 0.37 12371.9 —28 —0.2+0.5 Cu target — 
BS» 8050 0.37 12371.0 —101 —0.8+0.5 3 observations | 
Mo, Ta, Au targets / 
BS> 8050 0.37 12370.1 —174 —14+0.5 W target | i 
BSL 9860 0.47 12370.1 —174 —1.7+0.5 2 observatiors 
Cu, Ni targets | 
BJWs 10168 0.48 12371.2 —80 —0.8+0.6 W target | 
BS» 19600 0.96 12371.9 —28 —0.6+1.1 W target, W fil. | 
BS» 19600 0.96 12370.1 —174 —3.4+1.1 W target, Ox. Cath. i 
FHDs 24500 6.0 12370.0 —177 —4.341.3 W target, W fil. 


a Bearden, Johnson, and Watts, Phys. Rev. 81, 70 (1951). 
b J. A. Bearden and Guenter Schwarz, Phys. Rev. 79, 674 (1950). 
e Felt, Harris, and DuMond, Phys. Rev. 92, 1160 (1953). 


United States National Bureau of Standards by 
Sommer, Thomas, and Hipple on the magnetic moment 
of the proton, should be resolved in favor of the omega- 
tron results. This conclusion has recently been corrobo- 
rated at the Clarendon Laboratory, Oxford University, 
England, where D. J. Collington, A. N. Dellis, J. H. 
Sanders, and K. C. Turberfield have repeated the 
inverse cyclotron measurement, using instead of “dees” 
a straight-sided central conductor connected to the rf 
supply and two grounded outer segmental conductors. 
Their result (uncorrected for the diamagnetism of the 
proton resonance sample) is 


up =2.79273+0.00004 nuclear magnetons,° 


which differs from the omegatron value by only 16 ppm, 
an amount which is well within the probable error of 
the difference. Great care was taken to eliminate from 
the construction of the apparatus all material which was 
sufficiently magnetic to disturb the field in the region 
of the proton orbit by as much as 5 ppm. These workers 
state that most samples of brass tested had an unac- 
ceptably high susceptibility. The presence of slightly 
magnetic materials might well account for the low re- 
sult obtained in the Stanford experiments. 

The measurements of the short wavelength limit of 
the continuous x-ray spectrum have been made over a 
sufficiently wide range of voltages to permit a con- 
jecture as to the cause of discrepancy. The measure- 
ments published by Bearden, Johnson, and Watts, 
hereafter designated as BJW were performed at two 
different voltages and those of Bearden and Schwarz, 
hereafter designated 2s BS were made at four different 
voltages. As a result the voltage resolving power of the 
two-crystal spectrometer these workers used was differ- 
ent for these different voltages, the half-width at half- 
maximum height of the spectral “window” expressed 

-in volts being greater at the higher voltages. The work 
of Felt, Harris, and DuMond (FHD) done at the highest 
voltage of all had a still wider spectral window width in 
volts. : 

¢ J. H. Sanders (private communication, May 4, 1955); see also 


Letter to the Editor by Collington, Dellis, Sanders, and Turber- 
field, Phys. Rev. 99, 1622 (1955). ` 


-r 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


Table XI lists these different experimental results in 
order of increasing voltage. The first point of maximum 
bending of the observed isochromat (or spectral in- 
tensity curve) was assumed in all this work as the true 
location of the quantum limit. As we shall see, the 
results cast some doubt on the exact validity of this | 
assumption. Each experimental determination is ac- 
tually aimed at measuring the quantum energy-wave- 
length conversion factor, Voo, relating energy, Vo, in 
volts to the corresponding x-ray wavelength ^o on the 
Siegbahn scale in x-units; 


Vodo=he?/ (eAg/ds) (16) 


then permits calculation of the universal constant, 
h/e, provided the constants c and ),/Astare given. (In 
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_ Fic. 2. Eleven determinations of the quantum limit of the con- 
tinuous x-ray spectrum measured at six different applied voltages. 
If we assume as an arbitrary reference a constant value of the 
voltage-wavelength conversion factor (the value of DC 53) the 
ordinates in this figure represent in volts how much the position 
of the observed first point of maximum bending in the intensity Í 
curve differed from the calculated position, while the abscissae i 
give the applied voltages at which the determinations were made. | 
The choice of a straight line for the least-squares fit to the points 

is purely hypothetical. 
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a 
DISCREPANCY IN VOLTS 


| 
| 


} 
B25, I960OV 
OXIDE COATED CATH, 


ə 


Fic. 3. Eleven determinations of 
the quantum limit of the continuous 


e 
| 


x-ray spectrum measured at six differ- 
ent applied voltages. Here the same 
data as those of Fig. 2 are plotted as 
ordinates, but the abscissae are the 
half-widths at half maximum height 
of the spectrometer windowecurve in 
cach case. The choice of a straight 
line for the least-squares fit to the 
points is purely hypothetical. 
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this equation e is expressed in absolute electrostatic 
units, V in absolute electromagnetic units.) 

The column labeled “a (volts)” gives the half-width 
at half-maximum height of the spectrometer “window” 
curve. The column labeled “‘hc*/(ed,/A;) gives the 
observed value of the voltage-wavelength conversion 
factor. The point of maximum bending is used as the 
criterion to determine the threshold. The columns 
labelled “discrepancy” give the relative and absolute 
deviations of each experimentally determined voltage 
threshold from the value required to give the DC 53 
least-squares adjusted value for /c*/(ed/ds) = 12372.2 
kilovolts «-units. 

Because the different experimenters listed in Table 
XI used different values for the constants c and Ag/As 
to calculate their results as to h/e we have recomputed 
and listed what they actually measured before these 
constants were ntr ioduced, namely the conversion 
factor Voo from volts to x-units. If no systematic 
or random errors were present the result as to this 
conversion factor should be the same at all voltages, 
but instead there seems to be a systematic trend toward 
higher values of VoAo as one goes toward higher voltages. 
We take our least-squares adjusted value (DC 53) 


hef (eħo/hs)=12372.2 kv x-units (17) 


as an arbitrary reference against which to compare 

these results. The columns in Table XI marked dis- 

crepancy” give the difference, expressed in parts per 

‘million and also in volts, between the observed Ser 

threshold position (first point of maximum bending 

and the calculated position based on Eq. (17). The 
3 
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IN VOLTS AT HALF MAX. HEIGHT OF SPECTROMETER WINDOW 
7 8 9 10 WW 12 


column marked a gives the calculated half-width at 
half-maximum height, expressed in volts, of the spectra- 
“window” curve or acceptance band of the spectroml 
eter. Figure 2 exhibits the discrepancy from the refer- 
ence value as a function of voltage and Fig. 3 shows the 
same quantity as a function of spectral window half- 
width, a. 

Our present conjecture is that the cause of systematic 
error may reside in the method used for locating the true 
threshold from the experimentally observed x-ray in- 
tensity curves. This supposition is based on the fact 
that the systematic trend exhibited in Fig. 2 and Fig. 3 
seems to indicate an increasing discrepancy or system- 
atic error either with increasing voltage at which the ex- 
periment is performed or with increasing spectral window 
width. In Fig. 4 we reproduce one of the isochromats 
from the FHD determination on which we have marked 
at A, the point of maximum bending and at B, the 
point at which the quantum threshold should occur to 
accord with the value of Eq. (17). The variation of about 
3.7 volts exhibited by the least-squares adjusted line 
of Fig. 3 is the same order as this separation between 
points A and B of Fig. 4. So much care was taken in 
the matter of both voltage and wavelength measure- 
ments in the FHD determination (about which one of 


us had direct knowledge) that it is very hard to | 


believe so large a discrepancy could be the result of 
errors in these measurements. 

Figure 5 shows how the point A of maximum bending 
in Fig. 4 was determined by plotting the third deriva- 
tive of the isochromat and locating its zero point. The 
third derivative was obtained by means-of a numerical 
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Fic. 4. One of the isochromats taken at 24.5 kv by FHD. The 
position, A, marks the point of maximum bending, determined by 
the method of Fig. 5, and the arrows indicate the estimated un- 
certainty in locating this position while the position B is the cal- 
culated position of the quantum limit based on the least-squares- 
adjusted best value of the voltage-wavelength conversion factor 
from the DC 53 adjustment. 


interpolation method using points at equidistant 
abscissa spacings on the smoothed curve of Fig. 4. The 
method is of course no more precise than the smoothing 
of the curve is reliable, and the assigned error in Fig. 5 
was based on variations in the location when the curve 
was smoothed by different people. This same error 
spread is shown in Fig. 4 by the arrows on either side of 
the abscissa location A. 

In the case of all the experiments listed in Table XI, 
save those of BJW, the method of isochromats was used. 
This consists ideally in monochromatizing an x-ray 
beam and measuring the continuous spectrum intensity 
associated with the selected wavelength as a function 
of the voltage applied to the x-ray tube. The resulting 
curve is called an isochromat. (In the case of BJW the 
voltage was fixed and the wavelength variable, but this 
difference is of no consequence.) In practice the “‘mono- 
chromatization” will actually consist of selection from 
the continuous spectrum by a monochromator whose 
band pass curve has a sharp peak at some determined 
wavelength with asymptotic decay on either side. The 
dynamical theory of x-ray reflection in perfect crystals 
leads one to expect an inverse square law of asymptotic 
decay at large distances from the peak of this mono- 
chromator window curve, even when the two-crystal 
spectrometer is used. As a result of this unavoidable 
experimental impurity the sharpness of definition of the 
continuous spectrum threshold is blurred because the 
observed intensity curve is the fold of the true spectrum 
into the window curve of the monochromator. It has 
been shown in an earlier paper’ that if F(z) is the ob- 
served intensity curve, where z stands for a variable 

proportional to the voltage if the curve is an isochromat, 


‘and f(z) is the curve expressing the ideal shape of the 


continuous x-ray spectrum, that 


rO=f sead 


7 Panofsky, Green, and DuMond, Phys. Rev. 62, 215 (1942). 


(18) 


LAYTON, 
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Herein we choose z=0 to be the quantum limit of the 
ideal spectrum so that f(z) =0 for z<0. Here the mono- 
chromator window curve is the function g(x) and the 
upper limit of the integral is «=z, since the ideal spec- 
trum vanishes there, i.e., f(2—«) =0 for x> z. 

The continuous x-ray spectrum from a thick target 
descends to meet the background at the quantum 
threshold in a more-or-less linear way with however 
certain localized breaks in slope or “knees” so that the 
curve becomes increasingly steep as the threshold is 
approached. Quite close to the threshold (within 10 or 
20 volts) certain irregularities first reported by Ohlin® 
appear in the continuous spectrum which are believed 
to be related to the banded structure of (1) the incom- 
pletely filled energy levels of structure electrons in the 
solid target material, and (2) the losses sustained by the 
bombarding electrons due to energy communicated by 
them to structure electrons.® 

An earlier method of locating the true threshold from 
the observed intensity curve was to project a tangent 
to that curve, from some pot on it just above the 
threshold region where a stable slope seemed to have 
been attained, downward to the background level, 
taking the intersection of this tangent therewith as the 
true threshold point. If the window curve had wings 
whose asymptotic approach to the background was 
faster than the inverse square law this method might be 
made reliable, but it is not so as the case stands. Even 
a strictly linear spgctrum without irregularities or 
breaks in slope, wien folded into such a window curve 
with inverse square “‘tails,” will give a curve whose slope 
increases logarithmically without limit as we go up from 
the threshold so that there is no definite correct point 
from which to draw the tangent. Window curves of the 
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Fic. 5. The third derivative of the isochromat of Fig. 4. The 
mean Crossover or zero point of the two curves was used to locate 
position A in Fig. 4. The assigned error in this position, which 
corresponds to the uncertainty indicated with the arrows in 
Fig. 4, was based on the spread in results obtained when the curve 
Es ae 4 was smoothed through the observed points by different 


8P. Ohlin, Arkiv Mat. Astron. Fysik.29A, No. 3 (1942); 29B, 
No. 4 (1942); 31A, No. 9 (1944); 33A. No. 23 (rose) f 


B. R. A. Nijboer, Physica 12, 461 (1946). = 
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type here described have infinite second moments, and 
even their fizst moments are logarithmically divergent 
It is this circumstance which makes the difficult i. 
Consequently the following alternative method ee 
been adopted. an 
If we differentiate Eq. (18) twice and assume that the 
ideal spectrum can be approximated by a series of 
straight-line segments with abrupt breaks in slope 
between tlem, i.e., let ve 


S@)=4y tier, i=1,2---0 


then it has been shown’ that 


z (9 


F") = g2)f' OHE g(e—2i) lai a;). 
> . e i=l 

This means that the second derivative, F” (z), of the 
observéd intensity curve will consist of a series of 
images of the window curve, each centered at one of the 
breaks in slope of the ideal spectrum and each pro- 
portional in height to the magnitude, (a:ı—a;), of the 
discontinuity in slope with which it is associated. The 
leading term g(z)f’(0) is no exception to the rule since 
in deriving (20) the quantum threshold itself is as- 
sumed to be a second order discontinuity at z=0, i.e., 
a break in slope from zero slope to slope f’(0). 

In the case of all the determinations listed in Table X 
the method used for locating the true threshold position 
was to take it at the first point of maximum bending, 
i.e., at the maximum value of F’’(z). In the FHD de- 
termination this was done by finding the zero point of 
the third derivative. In some of the other curves taken 
by BS the bending was so sharp that the point of 
maximum bending could be estimated by inspection of 
the curve. 

Now the point of maximum bending will fail to give 
the correct threshold position if there are adjacent 


(20) 


wy 
` 
Fg’ idi 


7 


ipi ~ discontinuity) whose 
Fic. 6. Postulated precipice (first-order 3 ose 
PIRE at the foot of ae continuous ray en fram ami 
t may account for the systematic trend (w1 i s 
ed r of the discrepancy, between the observed points 
of maximum bending and the positions computed from the least- 
squares adjusted value (DC 53) of the jolase mayd or co 
-version factor. The iacha R Geh would be eoi i E ike 

vindow-curve g(5) 1m y 

T a then consist, as shown, of the sum of two 


components g(s) and hg’ (). 
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Fic. 7. Calculated curves of F”, the second derivative of the 
isochromat assuming a witch-like g-curve for the spectrometer 
window profile if the true spectrum has the postulated precipice 
of Fig. 6. The abscissae, z, are plotted in units of the window 
half-width, a. The numer value, ô, attached to each curve is 
the ratio of the precipice overshoot za to the window half-width, a. 
The origin, z=0, is the quantum limit position and the shift of the 
maximum point away from this position with increasing 6 is 
evident. 


strong breaks in slope in the true spectrum which lie 
too close to the threshold break. This is because then 
the corresponding images of the window curve will lie so 
close together as to be only partially resolved or perhaps 
even totally unresolved. Such proximity of adjacent 
g-curves may produce a shifting of the peak correspond- 
ing to the quantum limit of the spectrum. We have 
made an investigation to see if any of the known Ohlin 
structures could have such an effect, but on no reason- 
able assumptions regarding their shape have we been 
able to conclude that they do so. 

There remains the possibility that the assumption 
of a mere second-order discontinuity in the ideal spec- 
trum at the quantum limit is too simple. We believe 
that the systematic trend indicated by the shifts of 
Table XI and Figs. 2 and 3 might be explained however 
if the ideal quantum limit consisted of a superposition 
of a first- and second-order discontinuity as shown in 
the upper diagram, Fig. 6. The first-order discontinuity 
or step function considered by itself may be seen to be 
the limiting case of two equal breaks in slope of oppo- 
site sign which have approached infinitely close to each 
other and have become infinitely large. Referring to 
Fig. 6, if % is the height of the precipice, s= tanĝ the 
slope above it, and s,=//s is the overshoot as shown, 
then it is easy to show that 

F" (2) =sg(2)-thg’ (2) =sg(2)+seag'(2). (20) 
Thus it should be clear that, whereas the second-order 
discontinuity generates a g-curve, the first-order dis- 
continuity generates a curve which is the first deriva- 
tive of the g-curve. The superposition of these two 
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Fic. 8. Calculated curves of F’”, the third derivative of the 
isochromat assuming a witch-like g-curve and a true spectrum 
with a precipice as well as a second-order discontinuity at the 
quantum limit. The parameter ô attached to each curve is the 
ratio of the overshoot, sa, to the window half-width, a. The shift 
with increasing 6 in the crossover or zero point of these curves 
away from the quantum limit at s=0 is clearly evident. These 
curves are to be compared with the curve of Fig. 5. The sym- 
metry of the upper and lower portions of the latter curve indicate 
that a value of ô can hardly be ascribed to it much larger than 0.5. 


curves, the primitive and its first derivative, may in- 
deed result in a peak in the F” (z) which is considerably 
shifted from the true quantum limit in just the direction 
which the observations show. It remains to investigate 
whether this explanation is quantitatively plausible as 
regards the magnitude of the shift. To test this we have 
assumed the window or g curve to be a “witch” of the 
form: 


g(x) =A (1407/0?) 7. (22) 


This is the simplest curve which has the appropriate 
inverse square behavior in its approaches. It may fail 
to represent the true window curve somewhat by hav- 
ing its points of inflection occur too near the peak 
(at x=a/V3 in fact). Substitution of (22) into (21) 
gives us the curves shown in Fig. 7 in which the shifts 
of the peak of the F” curve are clearly evident. One 
more differentiation gives us the F” curves of Fig. 8, 
the shifts of whose zero points can be seen. 
Let Az represent the shift in the position of the 
maximum of the F” (z) away from the true quantum 
limit, shifts to the left in Fig. 4 being counted as positive 
for convenience. Let a be the half-width at half-maxi- 
mum height of the g-ctirve [ witch, Eq. (22) ], and z the 
overshoot as defined in Fig. 6. We show in Fig. 9 a plot 
of Az/z;, as a function of a/%, for the witch. Points on 
this curve corresponding to different values of Az/a 
are also shown. The largest value of Az/a, the slope of 
the curve of Fig. 9 at the origin, is Az/a=1/v3, and we 
see that in this region Az/z,, behaves almost linearly with 
afz». According to Fig. 9 then, when the window half- 
width is much larger than the overshoot, the shift in 
the point of maximum bending away from the true 
quantum threshold is of the ozder of the overshoot and 
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Az/z, cannot exceed unity. When the window half- 
width is of the order of or smaller than the overshoot, 
the shift is proportional to the window half-width and 
for a witch-shaped window the ratio Az/a cannot 
exceed 37}. 

Now if we assume that the overshoot, Zm expressed 
in volts is independent of the voltage at which the ex- 
periment is performed and that the shift Az is pro- 
portional to window half-width, a reference to the 
experimental data of Fig. 3 shows from the slope of the 
line that Az/a must be of the order-of 0.6. Not only is 
this ratio uncomfortably large but, much worse, this 
hypothesis requires us to suppose that at the small 
window half-widths below 0.5 volt, such as that of 
BJW at 6 ky, the overshoot is so much larger than the 
window half-width that the presence of the precipice 
should be manifest in the experimental intensity curve. 
Nothing of this sort however appears there. We must 
therefore abandon this assumption. 

Thus, if we are to explain the shift it appears neces- 
sary to make the hypothesis_that the overshoot, Az, 
varies with the voltage on the x-ray tube, becoming 
smaller at lower voltage, and that the window half- 
width, a, is large compared to the overshoot 2, so that 
we are working in the upper right-hand region of the 
curve of Fig. 9 where Az=z;. 

It may not be implausible that the magnitude of the 
overshoot of this conjectured precipice should vary with 
applied voltage. Other features of the isochromat curve, 
such as the apparent distance of the Ohlin peaks from 
the first point of maximum bending seer to vary con- 
siderably with voltage. If the overshoot, Zm and there- 
fore the shift Az were known to be approximately 
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Tic. 9. The ordinates show the ratio of shift, Az, to overshoot, 
Za. The shift, Az, means the amount by which the point of maxi- 
mum bending of the isochromat is shifted away from the true 
quantum limit assuming a witch-like window curve and a true 
spectral profile incorporating both first- and second-order dis- 
continuities at the quantum limit after the manner of Fig. 6. 
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proportional to the applied voltage then an extrapola- 
tion of the Straight line of Fig. 2 to zero voltage might 
be expected to give a better value of he?/(edg/d.) than 
any of the observations. No comprehensive - 
the radiative interaction between high-voltage bom- 
barding electrons and a real crystal lattice taking into 
account modern knowledge of the solid state has yet 
been attempted however and help from such a theory 
is probably needed before further progress can be made. 
From the experimental side there is little hope of 
improving the spectral resolving power attainable with 


crystal diffraction much beyond what has been so far 
attained. 


theory of 


Note added August 10, 4955.—We are indebted to Professor 
John Tukey of Princeton University for certain privately com- 
municafed comments received after this paper had been accepted 
by the editors. Professor Tukey feels that it is wrong in principle 
to select the equation set (1111123) merely on the grounds that its 
x°(=3.25) falls very much closer than that of any of the other 
3-degree-of-freedom sets to the expected value x?=3. He much 
prefers the method of increments to x? because it is a more con- 
clusive method than the one we have used for the purpose of 
pinning suspicion of systematic error on specific equations. He 
therefore urges us still to make a complete exhaustive analysis 
involving determinations of x? for equation sets containing 
more than one equation of each kind. 

This would involve the additional evaluation of x? for 4 times as 
many equation sets as we have already done, if only one equation 
were added to a single kind in each set, or 19 times as many if all 
possible combinations were explored! Our original restriction to 
sets with only one equation of each kind (selection rule, a) was 
to economize on the heavy expense for programming and for 
computer, time. Neither funds nor other facilities are at present 
available to thé authors of this paper to carry out Professor 
Tukey’s suggestions. 

We admit the cogency of Professor Tukey’s contention, if the 
problem had to be solved by the variance analysis alone without sup- 
port from physical considerations. However, we do not claim 
that the value, x?=3.25, by itself consitutes a compelling reason 
for the choice of the set (1111123). Rather we regard it merely asa 
clue pointing out which equations to suspect in order that we may 
examine these more closely from the point of view of physics. 
In experimental physics most error estimates are at best very 
rough, and there is therefore wide room for honest differences of 
opinion as to how much money and effort it is wise to spend on 
the analysis of data whose weights must necessarily reflect this 
roughness. We understand Professor Tukey’s eagerness as a 
mathematical statistician to see this variance analysis pursued 
exhaustively and regret that it is beyond our present means to do 
so. We believe however that even in its present fragmentary state 
it has served very usefully. 

In discussing our choles e 5 

x important and valuable A 
TPE F7 among the 3 d.f. sets, comsspondng a me i 
(1111122) is “still quite bearable. according to a a 
chance of x? exceeding 7.71 is about 5%.) If it is a permissible = 

. this set uses Eq. (62) of weight 0.06 in place o 
at ath Sari eight only 0.015, it should be a stronger set, 
See Lee If both equation sets are acceptable, we 
DIT should combine them susing ciie weights, he quite 


ae ee "Tukey’s remarks were made, he had not been 
informed as yet about the voltage-dependent eee or ue 
(Table XI and Figs. 2 and 3) which appear when the in x pal 
measurements constituting the data of (62) and (6:) are moe ey 
examined. In the light of these trends and our eet as aoe 5 
origin we believe that a still better procedure g 


f the set (1111123), Professor Tukey 
point. He notices that the next 
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Taste XIT. Auxiliary constants. 


Rydberg wave number for infinite mass* 
R= 109737.309-+0.012 cm=* 
Rydberg wave numbers for the light nuclei 
Ru = 109677 .57640.012 cm 
Rp=109707.4194-0.012 cm 
Rite? = 109717.345+0.012 cm 
Ryot= 109722.267 40.012 cm 
Velocity of light 
¢=299793.0-0.3 km sec 
Atomic mass of neutron (physical scale) 
n= 1.008982+-0,000003 
Atomic mass of hydrogen (physical scale) 
H = 1.008142-£0,000003 
Atomic mass ratio of hydrogen to proton* 
H/M,=1.00054461 (Computed using atomic mass of electron, 
Nm =0.00054875) 
Atomic mass of the proton* (physical scale) 
M p= 1.0075930,000003 
Atomic mass of deuterium* (physical scale) 
D=2.014735-+0.000006 
Atomic mass ratio of deuterium to deuteron” 
D/M a= 1.00027244 (Computed using atomic mass of electron, 
Nm=0.00054875) (physical scale) 


Ratio of electron magnetic moment to proton magnetic momenf 
of Koenig, Prodell, and Kusch* without diamagnetic correction” 


CM p/(Nmp'))(1-+a/27— 2.973/1?) = 658.2288-+0.0004 


Correction factor pe/po for anomalous magnetic moment ot 
electron* 


Hef po= (1+a/24— 2.97302 /x*) = 1.0011453 
(Computed using the value 1/a= 137.04) 
Gas constant per mole (physical scale) 
Ro= (8.31696+9.00034) X 107 erg mole deg 
Standard volume of a perfect gas (physical scale) 
Vo=22420.7+0.6 cm? atmos mole 


a Koenig, Prodell, and Kusch, Phys. Rev. 88, 191 (1952). 


weighted average of the results, (62) and (63), is to select from both 
of them those individual determinations performed at the lowest 
voltages. Nine of these, 2 from (62) and 7 from (6s), form a quite ho- 
mogeneous cluster in the voltage range 6112 to 10168. No deter- 
minations have been made at voltages below this range. The 
weighted mean results of this cluster is . 


kef (€Xg/ds) = 12370.8-£0.2 kilovolt x-units, 


wherein the standard deviation, +0.2 is merely an index of the 
disagreement among the results of the 9 different measurements 
and does not reflect any allowance for the now-very-likely sys- 
tematic error of the group mean, (because of the use of the criterion 
of maximum bending). From the slope of the straight line in Fig. 2 
this systematic error could well be of the order of a volt at the 
group-average voltage of 8000 volts. This corresponds to a shife 
of 1.5 kilovolt x-units in /c?/(ed,z/A.). Thus the numerical value 
obtained as the mean from the selection of the individual data of 
Eqs. (62) and (63) does not differ significantly either as to value 
or probable weight from the one we have actually adopted using 
the results of Eq. (62) namely : 


he?/ (g/h) = 12370.77E1.03 kilovolt x-units. 
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Tagit XIII. Least-squares adjusted output values, The quantity following each Æ sign is the standard error, In the present 
adjustment, unlike that of DC 53, there is no significant difference between the scales of error by internal and by external consistency, 
Attention is called to the fact that the quantities in this table are observationally correlated so that in the computation of the error 
measures of derived values dependent on two or more of the values in this table the method explained in DC 53, Secs. 4and 10 must be 
used, Table X of this article gives the elements of the error matrix to be used in such computations. 


Avogadro’s constant (physical scale) 
N = (6.02486+-0.00016)X 10 (g mole) 
Loschmidt’s constant (physical scale) 
Lo=N/Vo= (2.6871924-0.00010)X 10 cm 
Electronic charge 
e= (4.80286-£0.00009) X 107" esu 
e' = e/c = (1.60206+0.00003)X 10- emu 
Electron rest mass 
m= (9.1083+0.0003)X 10-78 g 
Proton rest mass 
mp=Mp/N = (1.67239+0.00004)X 10 g 
Neutron rest mass 
mna=n/N = (1.674704-0.00004) X10! g 
Planck’s constant 
h= (6.62517 +0.00023)X 10?" erg sec 
h= h/27= (1.05443 +0.00004)X 10" erg sec 
Conversion factor from Siegbahn «-units to milliangstroms 
Ao/^s= 1.002039-0.000014 
Faraday constant (physical scale) 
F= Ne= (2.89366--0.00003) X 10" esu (g mole) 
F’=Ne/c= (9652.19+0.11) emu (g mole) 
Charge-to-mass ratio of the electron 
e/m= (5.27305-+0.00007)X 107 esu gm™ 
e [m= e] (mc)= (1.75890=-0.00002) X 107 emu gnr™ 
Ratio h/e 
h/e= (1.37942+-0.00002) X 107" erg sec (esu) 
Fine structure constant 
a=e/(hc)= (7.29729=-0.00003) X 10? 
1/a= 137.0373+0.0006 
a/2m = (1.1613983-0.000005) X 10? 
: a= (5.32504+-0.00005)X 10-5 
: 1—(1—a?)'= (0.2662523-0.000002) x 107 
Atomic mass of the electron (physical scale) 
Nm= (5.487633-0.00006) X 1074 
Ratio of mass of hydrogen to mass of proton* 


First Bohr radius 
a=? / (me?) =a/ (4rRao) 
= (5.29172+-0.00002) X 10-° cm 
Radius of electron orbit in normal #7’, referred to center of mass 
do! =a(1—a2)#= (5.29158-£0.00002) X 10 cm 
Separation of proton and electron in normal M! 
do! =a’ Raf Ru = (5.29446-+-0.00002) X 10- cm 


Compton wavelength of the electron 
Nee=h/ (nc) =02/(2Ra) = (24.2626--0.0002) X 107"! cm 
Ree=Ace/ (Zr) = (3.86151+0.00004) X 1071! cm 
Compton wayelength of the proton 
Nep=hh/(mpe) = (13.2141-£0.0002) 10-4 cm 
Kep=Aep/ (2) = (2.10308--0.00003) X 107" cm 
Compton wavelength of the neutron 
Nen=h/ (mnc) = (13.1959 0.0002)X 10-4 cm 
Xen=Nen/ (2r) = (2.10019 0.00003) X 1071 cm 
Classical electron radius 
ro= e&/ (mc?) =03/ (47 Reo) 
= (2.81785+0.00004) X 1073 cm 
r= (7.94030+0.00021) 10-76 cm 
Thomson cross section 
(8/3) mre? = (6.65205-£0.00018)X 1075 cm? 
Fine structure doublet separation in hydrogen ý 
AEn= (1/16)Rnæ[1+a/m+ (5/8—5.9+6/7r)] 
=0.365871+0.000003 cm7! 
= 10968.56+0.10 Mc sec 
Fine structure separation in deuterium 
AEp=AEnRpd/Ru=0.365970+0.000003 cm 
= 10971.54-0.10 Mc sec 
Zeeman displacement per gauss 
(e/mc)/ (4c) = (4.66885+0.00006) X 1075 cm™ gauss 
Boltzmann’s constant 


k= Ro/N = (1.38044-£0.00007) X 1071S erg deg 
k= (8.61670.0004) X 10~* ev deg 


7: 3h 
H/Myp= [4ra] 1/k=11605.4-40.5 deg ev- 
= 1,000544613-0.000000006 First radiation constant 


Atomic mass of proton (physical scale) 
M,=H—Nm=1.007593=.0.000003 
Ratio proton mass to electron mass 
3 Mp/(Nm)=1836.12+0.02 
Reduced mass of electron in hydrogen atom 
u=mM p/H = (9.1034+0.0003)X 108 g 
«  _ Schrödinger constant for a fixed nucleus 
£ 2m/h?= (1.63836+0.00007)X 107 erg cm 
Schrédinger constant for the hydrogen atom 
2u/42= (1.637483-0.00007) 107 erg cm 


¢1= 8mhc= (4.9918+0.0002)X 10715 erg cm 
Second radiation constant 
ca= hc/k=1.43880-£0.00007 cm deg 
Atomic specific heat constant 
¢2/c= (4.79931+-0.00023) X10-"! sec deg 
Wien displacement law constant® 
Xmax 1 = c2/ (4.96511423)=0.289782+0.000013 cm deg 
Stefan-Boltzmann constant 
a= (n7/60) (k4/7i8c?) = (0.56687-E0.00010) 


10-4 erg cm7? deg~! sec 


a The binding energy of the electron in the hydrogen atom has been included in th i 
atom is nae m, but more correctly m(1 —ja?-+--+:). à e quantity. The mass of the el 
> The numerical constant 4.96511423 “is the root of the transcendental equation, + =5(1—e-=). 
Casi . R j ‘ e = i 
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; Taste XII. — Continued. 


Sackur-Tetrode constant (physical scale of at, wt ) 
(So/Ro)pr= 5/2-FIn{ (2r Ro) N-14} 
= —5.57324+0,00007 
(So)px= — (46.35244+0.0020)x 107 


i : erg mole deg! 
Sackur-Tetrode constant (chemic 


al scale of at. wt.) 
(So/Ro)on= —5.57256-£0,00007 
(SoJon=— (46.3467-£0.0020) x 107 erg mole™ deg 
Bohr magneton 
o= he/(4irmc) = her, 
= (0.92731-+0.00002) x 10~™ erg gauss“ 
Anomalous electron moment correction 
Cl-ba/(2x)—2.973a4/a*)= e/uo= 1.001145358-+0.000000005 
(Computed using adjusted value a= (7.29729 -£0,00003) x 107) 
Magnetic moment of the electron 
Me= (0.92837 +0.00002)X 10 erg gauss! 
Nuclear magneton 
Bn=he/ (4armyc)=poNm/H* 
= (0.505038--0.000018) x 10- erg gauss-! 
Proton moment e 
u=2.79275+0.00003 nuclear magnetons 
= (1.41044-40.00004) x 10-72 erg gauss! 


Gyromagnetic ratio of the proton in hydrogen, uncorrected for 
diamagnetism 
y'= (2.675230.00004) X 10! radians sec“! gauss“! 
Gyromagnetic ratio of the proton (corrected) 
y= (2.67530=£0,00004) X 104 radians sec™! gauss“! 
Multiplier of (Curie constant)? to give magnetic moment per 
molecule a 
(3k/N) = (2.62178+-0.00010)X10- (erg mole deg=!)3 
Mass-Energy conversion factors 
1g= (5.61000+0.00011)X 107° Mev 
1 electron mass=0.510976-£0.000007 Mev 
1 atomic mass unit=931.1410.010 Mev 
1 proton mass=938,211+0.010 Mev 
1 neutron mass=939,505-£0.010 Mev 


Quantum energy conversion factors 
1 ev = (1.60206+.0.00003)X 10° erg 
E/i= he = (1.98618 +0.00007 ) x 10718 erg cm 
Ex,= (12397.67 40.22) 107° ev em 
FEX.= 12372.444:0.16 kilovolt x-units 
E/ v= (6.62517 +0.00023)X 10 erg sec 
E/v= (4.13541 -£0,00007) % 107 ev sec 
#/E= (5,03479-0,00017)% 10" cm~ ergs 
5/ E= 8066.03-+0.14 cm=! ey 
»/E= (1.50940 +0.00005)X 105 sec erg“ 
v/ f= (2.41814-+0.00004)X 104 sec ev~! 
de Broglie wavelengths, Ap, of elementary particles? 
Electrons 
Ape= (7.27377 0.00006) cm? sec/y 
= (1.552257+0.000016)X 10-" cm (erg)t/ (£) 
= (1.226378-£0,000010) ¥ 10-7 cm (ev)i/(E)s 
Protons 
ADp= (3.96149-+0.00005) X 1073 cm? sec™/v 
= (3.62253 +-0.00008) X 107 cm (erg)i/ (E) 
= (2,862020.00004) X 10 cm (ev)t/ (Eji 
Neutrons 
A pa = (3.95603-+0.00005)X 10-4 cm? sec™/v 
= (3.62004+0.00008)x 10™* cm (erg)i/ (Ej 
= (2.86005-+0.00004)X 10? cm (ev)/ (E)! 
Energy of 2200 m/sec neutron 
Ez29= 0,0252973 -+0.0000003 ev 
Velocity of 1/40 ev neutron 
9,025 = 2187.036+0.012 m/sec 
The Rydberg and related derived constants 
R.z= 109737.309-+0.012 cm 
Rac = (3.289843+0.000003 ) X 10" sect 
Rhe= (2.179580,00007 ) X10 erg 
Rhee XK 10 = 13.60483+0.00022 ev 
Hydrogen ionization potential 
To= Rut (hee) (Att /4-+ +++) K10- 
= 13.59765+0.00022 ev 


e These formulas apply only to nonrelativistic velocities. If the velocity of the particle is not negligible compared to the velocity of light. c, or the 
energy not negligible compared to the rest mass energy, we must use Av =AeLe(e+2) 14, where Ae is the appropriate Compton wavelength for the 
particle in question and e is the kinetic energy measured in units of the particle rest-mass. 


No significant modification whatever in any of our present output 
values would therefore have resulted. 


VII. CALCULATION OF THE ATOMIC CONSTANTS 
AND CONVERSION FACTORS AS OF 1955 


In order to calculate a useful table of atomic con- 
stants and conversion factors, from the adjusted values 
of the unknowns, œ, e, N, and A,/A., and their error 
elements, as given in Tables IX and X, it is necessary 
to combine these with certain auxiliary constants whose 
numerical values are in most éases known much more 
accurately than are the output values of the four un- 
knowns. The less accurately known auxiliary constants 
are quantities whose methods of measurement render 
‘them in a practical sense statistically independent of 
the above four unknowns. Table XTT lists these auxiliary 


D ? 


constants most of which are identical with those used 
in DC 53. Two of them however merit brief discussion. 


Velocity of Light 


We have adopted the value and its standard error 
given by K. D. Froome” as the result of a set of ten 
measurements with a symmetrical free-space microwave 
interferometer with four electromagnetic horns operat- 
ing at 24005 Mc/sec. Because of diffraction, the ve- 
locity thus obtained was higher than the true free-space | 
value by an amount dependent on the size of the horns 
and the spacing between them. The experiment was a 
preliminary one in preparation for a more accurate 
definitive experiment to be performed at 72 000 Mc/sec. 


2K. D. Froome, Proc. Roy. Soc. (London) A22% 195-215 
(1954). $ K 2 p 
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Its purpose was primarily concerned with the use of 
the instrument under conditions of large diffraction 
correction (up to the equivalent of 100 km/sec), so 
that the accuracy obtainable in eliminating this effect 
could be assessed. Although even better accuracy can 
be anticipated for the final definitive experiment, it was 
possible in even these preliminary measurements to 
correct very satisfactorily for diffraction effects with 
highly consistent results for the different types of wave- 
front. The standard error of +0.3 km/sec is stated by 
Froome to be the estimated error of a single observation, 
although the value obtained for the velocity depended on 
ten measurements, a number considerably in excess of 
that required to make the corrections for the diffraction 
effects. Hence this standard error of only one part per 
million is probably somewhat overestimated. The 
value obtained by Froome is in excellent agreement with 
the values obtained over long light paths with visible 
light waves by Bergstrand"! in Norway and by Macken- 
zic? in Scotland. Both of these observers used Berg- 
strand’s “‘Geodimeter,” a method employing Kerr cell- 
modulated light beams. The relative uncertainty in the 
velocity of light, c, is probably now so small compared 
to the uncertainties in the output values of the present 
least-squares adjustment that it appears very unlikely 
that further improvements in our knowledge of it will 
make any significant change in those output values. 


Rp, the Gas Constant per Mole (Physical Scale) 


This is the least accurately known of the auxiliary 
constants (+41 ppm). Its numerical value is changed 
from the value given in DC 53 by reason of the recent 
change" in the definition of the Kelvin scale of tempera- 
ture. The Tenth General Conference on Weights and 
Measures which met in Paris and Sevres during October, 
1954 decided: “to define the thermodynamic scale of 
temperature by means of the triple point of water as 
fixed fundamental point, by assigning to it the tem- 
perature 273.16°K, exactly.” This yields a new slightly 
different thermodynamic scale of temperature such 


1 Eric Bergstrand, Arkiv Fysik 2, 119 (1950); 3, 479 (1951). 

2I. C. C. Mackenzie, “The Geodimeter Measurement of the 
Ridgeway and Caithness Bases, 1953.” Her Majesty’s Stationary 
Office, 1954. 

43 E, C. Crittenden, Science 120, 1007 (1954). 
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that the best value of the ice point is 273.15°K (instead 
of 273.16°+-0.01°K as formerly). y 

All the other auxiliary fixed constants used in the 
present adjustment are unmodified, as to values or 
sources of the data, from those in DC 53 to which the 
reader should refer for such information. 

The seven auxiliary constants in Table XII, to each 
of which a star is affixed, are the ones used in setting 
up the basic observational equations of Table IT above. 
The remaining eight either enter the computations only 
for the purpose of computing the derived values of 
certain constants and conversion factors in Table XIII, 
after the least-squares adjusted values of the unknowns 
have been obtained, or are giyen simply for their in- 
trinsic utility. 

Throughout these tables, unless explicitly stated to 
the contrary, the physical scale of atomic weights is 
used because its definition is less ambiguous than that 
of the chemical scale. The chemical scale could be 
rendered not ambiguous by specifying a standard 


abundance ratio for the oxygen isotopes but no such “ 


action has, to our knowledge, been officially taken. 
R. T. Birge" has given the conversion factor r= 1.000272 
+0.000005 as the ratio of physical to chemical scales 
based on the assumed abundance ratio O':O!8:0" 
= (506-10) : 1: (0.204+0.008), and this we have adopted 
in the present calculations. Alfred O. Nier has dis- 
cussed the variations in isotopic abundance of oxygen 
from various natural sources and the various resulting 
values of r which range from r= 1.000268 to r=.1.000278. 
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INTRODUCTION 


HE problem of deriving the thermodynamic prop- 
erties of conduction electrons in metals can be 
nangen (o that of computing the number of states 
N (£) with energy less than Æ for the case of interest. 
The primary- purpose of this paper is the present 
methods for calculating N (Æ). Its secondary purpose is 
to apply these methods to various problems in the 
theory of metals and in particular to the case of elec- 
trons in a magnetic field. 

Before outlining the contents of the present paper, 
we shall sketch the historical background of the methods 
discussed in the following pages. 

The problem of computing N (E) for systems com- 
posed of identical and noninteracting parts can be re- 
duced to that of calculating traces of certain exponential 
operators. This last problem is formally identical to the 
evaluation of the partition function in quantum statis- 
tics (see Sec. 1.1). 

The first author to recognize and exploit this fact 
was Peierls,! in connection with the magnetic properties 
of conduċĉtion electrons in metals. The main advantage 
of computing N (E) in terms of traces is that one need 
not have any knowledge of the eigenfunctions and 
eigenvalues of the Hamiltonian of the system of interest. 
The recent work of Sondheimer and Wilson? on the 
magnetic properties of free electrons is based on ideas 
of this type. An important contribution of this paper is 
the fact that powerful complex variable techniques are 
introduced for evaluating N (Æ) in terms of traces of 
exponential operators. 

Perturbation-theoretic methods for calculating such 
traces for systems so complicated that it would be out 
of the question to evaluate them exactly have been 
developed by many workers in the quantum-statistical 
theory of nonideal gases and the theory of metals. 

In the first category, the papers of Kirkwood,’ Uhlen- 
beck and Beth,* and Goldberger and Adams’ are basic 


for our present purposes. Reference 5 is particularly 


= h ‘A.W.S.) was in the Applied Mathematics 
Rees E HON Naval Research Laboratory, during 
a major portion of the time required to complete this paper. 

1R. Peierls, Z. Physik 80, 763 (1933). ne 

2E H. Sondheimer and A. H. Wilsôn, Proc, Roy. Soc. (London 
A270, 173 (1951). `^ T PAN 

S Jent review of J. de Boer, Repts. Progr. in Phys. 
12 305 O81049) in complete references on the work of J. 

; oe 
G. Kirkwood in quantum statistics., DOT 

ie E. Beth, Physica 3, 729 ( 

$ ‘C Untenbe so and E. N. Adams, I, J. Chem. Phys. 20, 
240 (1952). 
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relevant, because it introduces a modern way for per- 
forming trace computations, inspired by a procedure of 
Schwinger’ in quantum electrodynamics. Method I of 
our article is based on certain results of reference 5 
(see Sec. 1.2). 

In the second category, we mention the beautiful 
study of Peierls' alluded to earlier, which contains a 
procedure for calculating traces of exponential operators 
which is related in a formal manner to the work in 
references 3, 4, and 5, and to our Method IL (see 
Seen L2): 

The most important application of the foregoing 
methods, as far as the present paper is concerned, is to 
the study of the magnetic properties of metals. A com- 
plete historical and critical summary of the older papers 
on the subject up to the year 1931 is contained in the 
excellent text of Van Vleck.” The more modern aspects 
of the problem can be found in numerous publications, 
such as Sondheimer and Wilson, Dingle,’ Osborne and 
Steele.’ More specific references, as well as the relation 
of the calculational procedures used in previous work 
to our own, will be given in the body of this paper 
(see Sec. 2.4). 

We shall now give an outline of the present article. 

In Sec. 1.1 we employ arguments essentially due to 
Dingle? to reduce the problem of computing .V(Z) to 
that of evaluating N(E), the number of orbital states 
with energy less than Æ, thus eliminating the difficulties 
introduced by the spin from future calculations. The 
problem of obtaining N(E) is virtually solved if we 
can calculate traces of suitable exponential operators, 
involving the spin-independent Hamiltonian» [see 
Eq. (1.1.6) ], as would be expected in view of our earlier 
remarks. 

In Sec. 1.2 we present two methods for evaluating 
such traces. Method I rests on the so-called Schwinger 
trace formula [see Eq. (1.2.8)], stated in the paper of 
Goldberger and Adams.’ Method If is based on a pro- 
cedure for expanding exponentia! operators similar to 
those in references 1, 4, and 5. We show that Methad IL 
is better than Method I in the senses that the Nth 
approximation in Method Fis properly contained in the 
corresponding Vth approximation in Method IL As 
far as practical applications are concerned, there are 


6 J. Schwinger, Phys. Rev. $2, 664 (1951), Sec. VI. 2D 
7? Van Vleck, Electric ard Mamai Susceplibilities (Oxford Uni- 
versity Press, London, 1932), first edition. 
R B. Dingle, Proc. Roy. Soc. (London) A211 500 (1952). 
9M. F. M. Osborne, Phys. Rev. 88, 438 (1952); M. C. Steele, 
Phys. Rev. 88, 451 (1952). , 
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382 A. W. SAENZ 
important differences between I and II. The main 
difference is that I appears to be particularly suitable 
for the discussion of properties of weakly bound elec- 
trons, while II would seem to be applicable not only 
to this situation, but also to the case when the per- 
turbing potentials are slowly varying over certain micro- 
scopic intervals, but are not necessarily “small” com- 
pared to the kinetic energy of the particles of the system 
of interest. 

In Sec. 1.3, using I and IJ, we present asymp- 
totic formulas for calculating the number of states 
for systems confined to the interior of “Jarge” con- 
tainers. In this discussion, the notion of wall potential 
plays a central role. The advantage of dealing with 
such large systems is that N(ÆE), and thus N (E), can 
be replaced by smooth functions of Æ. 

In Sec. 1.4 we formulate the problem of calculating 
the magnetic moment of an electron gas in a periodic 
potential supposing that the system is in a large con- 
tainer, in the sense of inequality (2.2.1). We find that 
the free energy, thermodynamic potential, and, a for- 
tiori, the magnetic moment, have the correct depend- 
ence on the volume of the container. 

In Part 2, we apply the foregoing methods to two 
examples. 

In Sec. 2.1 we use Method I to calculate N (Æ) for 
weakly bound electrons in a periodic potential to within 
terms with <2 in the sense of Eq. (1.3.15). 

In Sec. 2.2 we compute N (E) for a gas of free elec- 
trons in a magnetic field and discuss the corresponding 
magnetic moment. This case is one of the few problems 
in quantum statistics where an exact solution can be 
found, and a great deal of attention has been given to 
it, as can be seen from the number of references in 
Van Vleck’ and in Sec. 2.2. The basic tool which we 
employ in obtaining N (E) is a celebrated formula due 
to Mehler and applied very early to quantum-statistical 
problems by Uhlenbeck” and Husimi." We believe 
that our approach, although yielding no new results, 
represents a fresh and simple way of discussing this 
question. ; 

It would be most interesting to treat the effect of the 
walls on the magnetic properties of free electrons by 
means of Method II. We hope that the procedures 
presented here will be of aid in solving the difficult 
problem of the magnetic properties of electrons in 
periodic potentials. q 

In the Appendix we present a proof of Eq. (1.2.9), 
which is virtually the Schwinger trace formula, for 
terms with <2. In this proof we employ only elemen- 
tary results from degenerate perturbation theory. 

In this paper we are chiefly concerned with obtaining 

' formal results. We do not investigate questions of con- 
vergence or the legitimacy of interchanging limiting 


10 G. E. Uhlenbeck, J. Math. and Phys. 14, 10 (1935). A proof 
of the Mehler formula may be found in A. Erdélyi, Math. Z. 
44, 201 (1938). 

u K. Husimi, Proc. Phys. Math. -Soc. Japan 22, 264 (1940). 


r 
on 


AND 


CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


R. C. O'ROURKE 


processes. However, we have been very careful to point 
out such interchanges when they occurs The mathe- 
matical problems left open are fascinating ones in their 
own right and we hope that they will stimulate the 
attention which they deserve. 


PART 1: GENERAL METHODS 


1.1. General Relations between N(E) and Traces : 
of Exponential Operators | 


In the present paper, we shall deal with systems | 
composed of a large number of identical parts, whose 
mutual interactions we shall suppose to be negligible. 
Because of this circumstance, their total Hamiltonians 
are equal to a sum of identical one-particle Hamil- 
tonians. We shall suppose that the latter possess pure 
point spectra, that the degeneracy of the energy eigen- 
values is finite, and that there is a minimum eigenvalue. | 
In discussing the statistical-mechanical theory of such | 
assemblies, we shall encounter sums of the types 


S=} Ep), (1.1.1) 


involving the values of functions F(E) at the eigen- 
values Æp of one of the aforementioned one-particle 
Hamiltonians, say Ér." In (1.1.1), p specifies a one- 
particle state of the system and the sum runs over all 
these states. 

The evaluation of such sums is greatly facilitated by 
the introduction of function N (E) defined as 


N(£)=>_,U (E-E,), (1.1.2) 
where 
0 «<0, | 
ues x=0, (1.1.3) | 
1 «a>0. | 


From (1.1.2) we can interpret N (E) as the number | 
of states with energy smaller than Æ, provided that E | 
does not coincide with any of the Æp. i 

Let F(E) be differentiable in Æ, and let it vanish for 
E>, Then j> 


F(E) 


a @ 
s=- f dE 
dE 


—% 


N(B), (1.1.1) 


as the following argument shows: 


dF (E) dF (E) 


— aB——N(B)=— [ dE——— U(E-E 
J 3p VO- f a ZTE 


2 dF(E 
=-5 f az ©) SpE). 
A PIED dE P? 


_ 2 It will not be necessary to use an index to denote the par- 
ticular one-particle Hamiltonian we are considering since only 
their structure plays a role in this paper and since this structure 
is common to all of them. | 
A 13 In this article, we shall always denote abstract operators by 
lightface or boldface German letters, depending on whether we 
are dealing with scalars or vectors. We shall designate the eigen- 


eo 


es 


values of these operators by the co 
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We shall now exploit the fe 5 
ioes act that U(x) 
following integral representation": at U(x) has the 


1 qtio Jy 
UG)=——0 f A 


EI (1.1.3) 
where x 
ytin yhiT 
a f dz=lim f de 
y~ io To 7-iT 


We suppose that the series 


È exp(—zE,)=trace exp(—20r), (1.1.4) 
- f 


where we employ the familiar trace notation, converges 
for Re{z} >0, and furthermore, that we can interchange 


Epand 
ytin 
of dz 


ymn 


in what follows. From (1.7.2), (1.1.3)’, and (1.1.4), we 
are then led to the identity" 


rin zE 


1 
N(E)=0— |. dz— trace exp(—27). 


2ri yin Z 


(1.1.5) 


Now, trace exp(—zr) is nothing more than the 
analytic continuation of the ordinary partition function 


Z(8)=tracaexp(—BH7) => exp(—6E,), 
P 


E (1.1.4) 


in the sense that we have replaced 6 by the complex 
variable z, whose real part is positive. 

In the present paper, we shall deal with Hamiltonians 
which can be separated into two commuting parts 6 


> and &,, where © stands for an orbital and H, for a 


spin Hamiltonian, so that 


Sr=H+H,; HH.=9.9. (1.1.6) 


The eigenvalues Æp are therefore separable into a 
part E, belonging to and ghwm belonging to ©., 
where m takes all integral or half-integral values from 
— j to +j, depending on whether j is integral or half- 
integral, w is the Larmor frequency eH/2moc, and g is 
the Lande factor, which is 2 for electrons. Therefore, 


Fae nie, (GS) 


of Ór are purely discrete, 


E,=Entghom; |m| <i; 
Since the eigenvalues Æp 

so are the eigenvalues En of 9. ‘ 

responding Latin lightface or 

boldface letters. We shall not employ any fixed symbolism for 


Sigs ee Tane G. Doetsch, Theorie id Amea 
Laplace-Transformation (Dover Publications, New , p 
B; OTE earliest applicatioñ of formulas of this type to the theory 


of metals occurs in reference 2. 
T a 
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From (1.1.4), (1.1.6), and (1.1.7), we see that 
trace exp(—2%)7) is factorable: 


trace exp(~z0r) 


(1.1.8) 


= trace exp(—z§9)- E exp(—zghwm), 
Imi gi 
Re{s}>0. 


Combining (1.1.5) and (1.1.8), we arrive at the im- 
portant formula!’ 


N(E)= E E+ għam), (1.1.9) 
Imi <j 
where 
poe rtie exp(zÆ) 
NE) =— of dz——— trace exp(— z0). 
Rri Siyi Z 


This formula eliminates the difficulties introduced 
by the spin and reduces our problem to the computation 
of N(E), which represents the number of orbital states 
with energy less than Æ, provided that Æ does not 
coincide with any of the £,’s. We shall show in the 
following pages that this problem is virtually solved if 
we can construct trace exp(—s3C), since there exist 
powerful complex-variable methods for evaluating the 
contour integral in (1.1.9). 

Consider now an arbitrary representation, say the 
-diagonal representation, whose states are supposed to 
be complete and orthonormal, so that we have, in 
particular, 


SA) (1!) =4(A'—A), 


(1.1.10) 
fawna [n)= fn, ny 


where we have assumed, for convenience in future dis- 
cussions, that the \’s form a purely continuous spectrum. 
It is now advantageous to introduce the density 


matrix , 
NIPONE |n) exp(—2E,) (|); 
Re{z}>0, (1.1.11) 
where 
‘a n factors 
exp(D) = —O-D- + -V. (1.1.12) 
n=0 7! 4 


From now on we shall suppose Re{z}>0 and 7>9, 
without always stating this hypothesis explicitly. 
Using (1.1.10) and (1.1.11), we find 


trace exp(—2) = exp(—zE,) > 
= falsely 0143) 
16 Compare with reference 8, especially p. 501." iy 
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It is well to emphasize the fact that in terms of the 
g-diagonal representation 


trace exp(—s6)= f dq’(q'|exp(—2)|q’), (1.1.13) 


where the integration extends over the entire configu- 
ration space. 

The fact that the trace of an operator is invariant 
with respect to a change of basis is very welcome, since 
it means that we may employ any basis which is con- 
venient for the problem at hand. 

In the present study we shall identify the \ represen- 
tation with the coordinate-diagonal representation and 
shall expand our results in terms of plane waves. 

From (1.1.13), (1.1.13)’, and the fact that 


(q’ |exp(—2)|q’”) 


h 
exp 25 (vena) parar (1.1.14) 


1 
and 


8(q’—a"”) = (27) 7 f dk exp[—ik: (q’—4”)] 


a) 


we conclude that 


(q’|exp(—2) |a”) 


= of dk exp(—7k-q’) 


h 
xexp| =€ Evora’) exp(ik-q’’), 
t 


trace exp(—z69) 


(1.1.15a) 


= af an’ f dk exp(—ik-q’) 


Wie 
xel -26(-vea’) | exp(tk-q’). (1.1.15b) 
i 


We shall call Eq. (1.1.15b) the Kirkwood-Uhlenbeck 
formula, since these authors were the first to recognize 
and exploit the advantages of plane wave bases in con- 
structing partition functions in the theory of nonideal 
gases.2~ 

Our further work rests on Eqs. (1.1.13), (1.1.13)’, 
and (1.1.15). 

It is necessary to state at this point that the formal 
manipulations used in deriving these results cannot be 
justified for unrestricted operators É. Although the 
exact nature of the restrictions is unknown at present, 
it has been noted that one must be extremely careful 
with Hamiltonians whose coordinate dependence in the 

q'-diagonal representation exhibits either finite or infi- 
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nite discontinuities. The case of the rigid-sphere poten- 

tial, for example, introduces complications of this type.!7 
In the following discussion, we shall deal only with 

Hamiltonians which are analytic functions of the co- 

ordinate and momentum operators, in an attempt to 

avoid the aforementioned pitfalls. 


1.2. Two Methods for Calculating Traces of 
Exponential Operators 


We are ready to consider two perturbation-theoretic 
methods for constructing trace{exp(—2))} for systems 
for which it would be out of the question to evaluate 
this trace exactly. 


A. Method I 


Let us begin with the simpler method, Method I. 
Consider the exponential operator 


OQ (s)=exp[—s(a+b)], (1.2.1) 


where a is “large” comparect to b in the conventional 
sense of perturbation calculus and s is a real parameter. 

Following Goldberger and Adams, we define B(s) as 
follows: 


OQ (s)=exp(—sa)¥3(s). (4.2.2) 
Letting 
6(s)=exp(sa)b exp(—sa), (1.2.3) 


we can conclude from (1.2.1) to (1.2.3) that B(s) 
satisfies the initial value problem below: 


ð 

—B(s)=—b()B(S), 

ðs (1.2.4) 
%(0)=1, 


where 1 is the unit operator. 


Equations (1.2.4) are equivalent to the integral 
equation 
B)=1- f ds, (s1)B(s1), (1.2.5) 
0 


whose formal solution, obtained by the usual iterative 
procedure, is 


B= EB), 


Bo(s)=1, 


B=)" f asi fas: 


x f e 


(1.2.6) 


` 


17 See, for example, G. V. Chester, Phys. Rev. 93, 606 (1952): 
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Combining (1.2.2) and (1.2.6). we ota; f 
a identity 2.6), we obtain the formally 
Q(s)=exp(—sa) OB, ls). 


n=0 


(1.2.7) 


It can be shown, from (1.2.6) and (1.2.7), that 
traceQQ(s) = trace exp(—sa)— trace{b exp(—sa)} 


(-) 


; A 5 
i tracefb exp(—sn) f ds f dsz 
1 ; 


xf dbb b]. (1.28) 


+E 


Goldberger and Adams*-'* attribute this result to 
Schwinger. We shall call it the Schwinger trace formula. 

A simple method of proving (1.2.8) for the case when 
a has discrete eigenvalues is to calculate traces in the 
representation in which a is diagonal. 

Placing s=1; a=2zo, b=2,, where Oo is an un- 
perturbed Hamiltonian and §; the corresponding per- 
turbation operator, we get from (1.2.8) 


trace expl —2(©»+1) ]=trace exp(— 20) 


43 2"F,(2), (1.2.9) 


n=l 

where 

Fı(z)=— trace{ 9; exp(—260)}, 

n> 2: 

E n=l a 
F,(z)= trace 9, exp(— a») f dss f dso»: 
n 0 0 
xf ised) Sa, 
0 

and 


H1(s2)=exp(—s2o) D1 exp (sso). 


In the Appendix, we shall prove (1.2.9) for n<2 by 
means of conventional degenerate perturbation theory. 

The circumstance that the mth term of the sum in 
(1.2.9) contains É, exaclly m times suggests that 
Method I is ideally suited to cases in which : is 
“small” compared with Ho, in the usual sense of per- 
turbation theory. We shall discuss a situation of this 
type in Sec. 2.1. 


B. Method II 
In this method, we begin byewriting” 
; Qs) =exp(—sb) exp(—sa) B(s). (1.2.10) 


18 For anosher approach, see reference 17. 


19 This way ding exponential operators ¥ 
THE var o Sree Wem in quantum statistics, as can be 


by many one Weierence 3. See also references 1 and 5 
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From (1.2.10), we conclude that 


ð 
-B(s)=T6)W(s), (1.2.11) 
Os 
I(O) =1, 
where 
T(s)=explsa) exp(sb)Lexp(—sb),a] 
Xexp(—sa), (1.2.12) 


where the square bracket represents a commutator. 

In the cases considered in this paper, a is quadratic 
and 6 is at most linear in the one-particle momentum 
operator p, so that it is possible to work out the com- 
mutator in (1.1.30) in closed form. The final result 
does not involve exponential operators exp(-:s6). This 
result depends critically on the particular ordering of 
exp(—sb) and exp(—sa) in (1.2.10). For example, if 
this ordering were reversed, we would end up with 
commutators which could not be easily evaluated for 
the cases of interest. 

Equations (1.2.11) are equivalent to the integral 
equation 


wi)=1+ f ds\X(s:)W8(s,), (1.2.13) 
0 
whose formal solution, obtained by iteration, is thus 


Ws) = FW, (s), 
n=0 (1.2.14) 


Wıls)=1, 


nzi: 
a 41 inel 

w= f ds f dss f dsn Els) Elsa) e Tlen). 
0 0 o 


From (1.2.10) and (1.2.14), we obtain 


’ 


2 (5) =exp(—sb) exp(—sa) EWals), (1.2.15) 


n=l 


We now show that the series (1.1.15) is better than 
(1.2.7) in the sense of the “dominance” theorem: 
The Mth appréximation, 


Lon 
Oy (s)=exp(—sa)OB.(s), (1.216a) 
n=) . 


to Q(s) in Method I is properly contained in the Mth 
approximation 5 X 
Pie e > 
Qu (s) =exp(—sb) exp(—sa) Z W.(s), (1.2.16b) 
n=O 


to O(s) in Method II, for M=0, 1,2, +++, in the sense 
that all the terms of (1.2.16a) are contained in 41.2.16b) 


see, for example, from reterenc bul nol vice versa. ; 
for expansion procedures of similar type- i 
D $ | 
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Proof: 


We shall prove this theorem for the nontrivial case”? 
where [a,b ]0. 


First, we observe that for n=0, 1, 2, +++ we have 


(i) exp(—sa)%,(s) contains only terms with 
exactly nb’s. 

(ii) exp(—sb) exp(— sa)W, (s) contains an in- (1.2.17) 
finite number of terms with rb’s, 7> n, 
provided [a,b ]+0. 


Second, we notice that the expansions (1.2.7) and 
(1.2.15) represent the same operator © (s). Equating 
these expansions and using (1.2.16a) and (1.2.16b), 
we get: 


Qu (s)=Qu® ()+ Ras), (1.2.18) 


where Na(s) contains solely terms with > (M+1)b’s 
VETOR 

Third, we employ (1.2.17), (1.2.18), and this property 
of Rar(s) to deduce that all the terms of Qu ® (s) are 
contained in Qa (s), for Rahs) consists solely of 
terms having more than Mb’s, while Oy,“(s) has only 
terms with < Mb’s. On the other hand the terms of 
Qu (s) cannot all be contained in Qa (s) because, 
according to (1.2.16b) and (1.2.17), the former consists 
of a sum of terms with 7b’s, where r takes on an infinite 
number of integral values > M, while Qx (s) consists 
of a finite number of terms each one of which contains 
rb’s, where r < M.QED. 

Placing s=1;a=20; b=20u, we obtain from (1.2.1), 
(1.2.14), and (1.2.15), 


trace exp[—2(©0+ 61) ] 
= trace{exp(—z1) exp(—2©o)} 
Hank (e), (1.2.19) 


n=l 


where 


Fn'(z)== trace {exp(-26) exp(—2o) 


1 a Oe 
xf dss | ds: f dSn 
0 0 0 


TO AS |, 
and 7 
ay (sz) =exp(szOo) exp(szD1) 
x [exp(—s2§;),Go]] exp(—se&x). 


- In view of the preceding “dominance” theorem, we 
expect that the series (1.2.19) is more accurate than 
(1.2.9), if an equal number of terms is kept in both 
cases. 

2 Tf [a,4]=0, Eqs. (1.2.12) and (1.1.14) imply that 9%,(s)=0 
for n=1, 2, ---, so that exp[—s(a-+b)]=exp(—sa) exp(—sb) as 
is well known. 


— > 
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It is clear that Method II is closely akin to the usual 
W. K. B. method of quantum mechanics. We thus ex- 
pect that it will prove useful in cases where $y, although 
not necessarily “small” compared to Ho, is nonetheless 
a slowly varying function of position in the q’-diagonal 
representation over distances of the order of the mean 
De Broglie wavelength of the assembly of particles of 
interest. = 


1.3. N (E) for Large Systems 


The functions N(E) in Sec. 1.1 are sums of step 
functions. The aim of the present section is to replace 
them by smooth functions of Æ for systems confined in 
containers which are “large” in a sense ‘to be specified 
presently. The introduction of these smooth functions 
will enormously simplify the calculations in the suc- 
ceeding sections. The results obtained below may be 
viewed as asymptotic expressions for containers whose 
volume Q>. 

In carrying out this program, we shall make use of 
the concept of wall potential, whose main role is to 
simulate the walls of the aforementioned containers. 

We now regard Éo in Sec. 1.2 as composed of two 
parts: 


Ho=K+W, (1.3.1a) 
so that 
H=RK+W+ I, (1.3.1b) 


where Ñ can represent, for example, the kinetic energy 
operator of a particle in 2, while YW stands for the wall 
potential operator. W depends only on q and its eigen- 
values W (q’), defined by 


8 (q)| a’) = Wa’) |’) 


are different from zero only when q’ is close to the walls 
of 9, in a sense which we proceed to make clear. 
Figure 1 shows a sketch of the qualitative shape of 
which we shall adopt in this paper. The wall potential 
W (q’) must effectively vanish in region I, which repre- 


(1.3.2) 


Wig!) 
t 


Wig) %o oe 


` 


Fic. 1. Qualitative shape of W(q’). 


GD 


, approach zero rapid] 
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oe We Bani of Q. In II, which simulates the walls 
K , W(q’) must undergo an abrupt but continuous 
change to a large positive value Wo. In TII, W(q’) must 
. . d : p; 
go to eG in any manner such that the wave func- 
tions (x|q’), corresponding to the eigenvalues £, of © 
. . > pasi amy 
i ) y in this same region III. This 
particular choice of W (q’) also eliminates the possibility 
of continuous spectra of ©. 
We shail now give a plausibility argument, based on 


the above model of X, for the validity of the formula 
below: 


faf das: f dq,F(qi, q2, +++, qr) 


or je -0 o 


(di exp(—2) | q2)- (qo| exp(—z2) |q) 


X (a-| exp(—2 | qr) 


=f d f dar: f dq-F (qi, q2, +, q+) 
[2] 12) 9) 


-(qu] expl—2(S—W)]| q2) 
X (q2| expl—2z(S—W)]] q) + - 
-(a-|expl—z(©—)]]| a1), 


where [Q] indicates an integration over {2 and where 
$(qi,"--q,) is a smooth function of its arguments 
which is slowly varying over distances comparable to 
the thickness òf the walls. 

We shall base our arguments on Method IL. It will 
be sufficient to consider the case r=1: the others can 
be dealt with similarly. 

From (1.2.7), (1.2.12), (1.2.14), and (1.2.15), placing 
a=2(9—YW) ; b=% we find: 


(1.3.3) 


(q’ |exp(—2) |q”)=expl—21V (q’) ] 
Xa’ |expl—2(—X) J] a”) 


expla @)]s2"( al lexol 20-8] 


1 81 8n-1 
x f as f figs f ds Ulsa) Ulss): 
5 O Y 


XU(snz) la), (1.3.4) 


where i 
(sz) =exp[sz(H—-W)] exp(s:®) 
X[exp( 8), H-Wexpl-s2(H-W], 
and where exp (s2¥8)[exp(—s228), H—W] can be ex- 


ici i dinate diagonal repre- 
plicitly worked out in the coordi Baker Ses 
‘sentation for the Hamiltonian in (1.4.7), with the 


result 


(d |exp(s28)exp(—s28), SWI} q”) 


h? 
=--> a Va" (q')—s2( vW (a) 
iA 


y 


2ie 
+ —¥ W (q) Alq’) 
ne 


H27 W (q) Vy laa"). (1.3.5) 


If the volume of region I, say 9, is made arbitrarily 
large with respect to that of region II, keeping 
W (q')&0 for q' in I and letting W take on arbitrarily 
large values, we obtain, for fixed z and Re{z}>0: 


f dq’ 5(q’) expl—zW(q’) } 
x(q’ |exp[—2(—%W) ]/ q/) 
~f saa le-a- 1.3.6) 
(2) 


If it is possible, by an appropriate choice W (q') to 


dominate 
1 n Saml 
expl—2(9-28)] f dsi f ste f dsa 
0 o o 


| 
E ---E(ond) “’) 


kd 
DC 


n=l 


=P(¢,4"l2), (1.3.7) 


whose absolute value differs from zero only in regions 
II and III, by means of exp[—zW (q')], in the sense of 
making the product of these quantities as small as de- 
sired, then we also have, for Re{z}>0 and fixed z: 


f dq’5(q’) expl—2I(q’) ]P(q‘,q’"|2)=0. (1.3.8) 


-o 


In view of the fact that each term of the sum 
(1.3.7) is a polynomial in W(q') and its deriva 
according to (1.3.5), its dominance by the exponel 
exp[—zW (q')] in regions IL and III is quite pla 

If we accept (1.3.6) and (1.3.8), we arrive 
result (1.3.3) forr=1. | 

No attempt will be made to estimate the acci 
(1.3.4) since the failure of this approximation | 
correspond to “surface phenomena.” These p 
are of importance only in small enough 
teria for determining whether a system is la 
sense are trustworthy if and only if the 
actual calculations. A frequently used cri 
trons in a magnetic field is given i 


- 
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From (1.3.3) follow two useful results. 
First, we can combine this equation, for n=1 and 
$(qr)=1, with (1.1.9) and (1.1.13), thus obtaining 


1 vie dz exp(sl) 
N(E)S— -f ayo f PE 
2riv’ im ie s 


x(q’ |expl—s(©—W)]|q’). 


The approximation (1.3.9) is valuable for the follow- 
ing reasons: the right-hand side can be calculated 
without knowing IW and is a smooth function of Æ for 
the case considered in detail in this paper (see Sec. 2.2) ; 
and it leads to interesting conclusions regarding the 
thermodynamic properties of the systems of interest 
(see Sec. 1.4). 

Second, we can use (1.3.3) to construct a smooth 
function 9t*(E) to replace N(E) by a perturbation 
method based on Methods I and II. 

To carry out this construction, we use (1.1.13)’, 
(1.2.9), (1.3.1), and (1.3.3), where we replace © by 
©b, to reduce the integrations over configuration space 
to integrations over 9. The replacement of by Éo in 
(1.3.3) is legitimate, since it merely involves setting 
,=0 in this equation. 

As an example, let us consider the first term on the 
right-hand side, employing the foregoing equations. We 
have 


trace exp(—20o) = trace exp[ —2(%+28) ], 


(1.3.9) 


= f aga Jep- 


= f dq'(a’|exp(—2)|q’). 


[9] 


Treating the remaining terms in a similar way, we 
obtain: 


trace exp(—2)& | dq’(q’|exp(—z8)|q’) 
[9] 


+r >a Bune (z), 
n=l 


eye j K J asl Gal a) aslexp(—28) La, 


ne 2: 
(o 


F,*(¢)=—— 


so=1 |3 81 8n—2 
dsi | dss: G Ji dSn- 
n vo 0 g 0 
xf a f dax: f dan 
[9] {2} [9] 


-(q’| 1/4) exp(—2&) | a2) 
X (q2| D1*(s1z) |as) : 


oy X(dnn| D1*(snz)|q’), (1.3.10) 


AND R. C. 


O'ROURKE 


(a| *(ss) |q2)= f dq’ Í dq’’(qi| exp(—szSt)|q’) 


[9] [2] 
-(q'| Sil a’)(a"" |exp(—s2) |q). 


The expressions for F,” (z) reduce to a much simpler 
form when $Ñ; is diagonal in the q’-representation (see 
Sec. 2.1). In the case considered here, the /’,,*(z) possess 
inverse Laplace transforms, and this will vastly simplify 
our further discussions. 

The factors z” in (1.1.42) require special care; other- 
wise they are bound to cause convergence difficulties. 
To overcome this obstacle, we begin by considering 


integrals of the type 
Í dE 


o 


Í dEg(EN(E), 


| f dge) fou), 
rR (1.3.11) 


where g(Z) has derivatives of all orders for |Z] <% 
and is negligible, together with its derivatives outside 
the interval 


PARLES Ble (1.3.12) 


where KE. In the present paper, devoted to de- 
generate Fermi gases, we shall identify g(Z) with 
(0/0E)fo(E| 8,6) [see Eq. (1.4.8)’], which is very sharp 
in the vicinity of the Fermi energy E=f and has a 
half-width of order kT, so that we may choose £o=¢, 
e=kT. We clearly have eK Zp, since kT&¢ for a degen- 
erate Fermi gas. 

We then write 


2ri 


yt exp(zE) 
xf dz -trace exp(—2), 


Z 


70 


n i) 1 
Í dEg(E)N(E)= lim f dEg(E)— 


(1.3.13) 


PE 


interchanging the orders of 


and f dE. 


} Combining (1.3.10) with (1.3.13), we obtain by par- 
tial integration, using the previously stated assump- 
tions about g(Z), 


f JERES lim f 
= —0 E 


1 fa exp(zE) 


lim 


T= 


EoHe 


dEg(£) 


mé 


f aale zola) 
[9] 


Qn 


y-iT z 
ond r Eote § Qno(E 1 
+m a 
nee Eo—e OE” 2ri 
HiT ex E) 
x Í O) 
y—iT Z 


e 
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where we have interchanged 
à 


w 


Egte 
> and lim f dE. 
Ey- 


n=] T=ro ; 
flg—e 


Taking limr.,,, inside the inte: gral 
partially once more, we get 


3 


f eeatoecyms dEglE)N,* (E), 


n=O 


1 i exp(sk 
Mo’ (H=—o f. dz abe 


2ri 
a 


and integrating 


st-tn > 4 


x f dq’(q’| exp(—2v) | q) 
{2} 


Re 1 san +4 exp(zE) 
at.(E=——e[ a PA*(), 
Pri OE" Vyio z 


(1.3.14a) 


provided that 


1 rtis exp(zE) f 
— dz i dq‘(q’| exp(—290)|q’) 


Ird Uho rs zZ [9] 


1 rt” exp(zE) 
dz Enaz) 


rid yi z 


= y-two 4 


have continuous derivatives of all orders in the interval 
(1.3.12). 

Arguments and assumptions of similar type lead to 
the following conclusion: 


© E 
f dEN(E) f dE'g(E') 


=— f dEg(E) IL: dE'N,*(E!). (1.3.14b) 
n=0 
From (1.1.9), (1.3.14a), and (1.3.14b), we obtain the 
following: 


Rule for large sy »slems.—N (E) may be replaced by 


N(B EN * D; (1.3.15) 
É n=0 
where A 
NDS Z SLE then) 
“imal <: 
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and N,* (E) is given by (1.3.14) in integrals of the type 


if dEglE)N (E), 


"o 


Ea E 
Jf cevcy f dE'g E’), 


n 


(1.3.16) 


where g(E) has the properties assumed in this section. 

Since the study of the magnetic properties of metals 
depends on the computation of integrals of type (1.3.11), 
as shown in Sec. 1.4, we hope that this rule will be 
useful in studying such properties. 

The fact that 9,*(2) contains 9, exactly # times 
and the fact that the H-integrations contribute only 
in the narrow band (1.3.12) lead us to believe that 
(1.3.14a) and (1,3.14b) will fail, for example, in the 
case when $5, depends solely on coordinates and its 
eigenvalues //(q’) in the q’/-representation are much 
larger than Æo, i.e., ¢ in our case. We must clearly de- 
mand that Eoœ>| 11,(q’).. 

For the sake of completeness, we remark that it is 
also possible to approximate N(E) by a series based 
entirely on Method II, involving the operator X’(sz) of 
(1.2.19). This series should be useful under the same 
circumstances which we mentioned in connection with 
this method. However, since we shall not require it in 
this study, we shall not write it down explicitly. 


1.4. General Formulation of the Magnetic 
Properties of an Electron Gas 


The one-electron orbital Hamiltonian for our model 
of an electron gas in a metal is the customary one used 
in the theory of metals: 


Soo (r a) +Blq)+IW(q), (1.4.1) 


2mo 


where p and q are momentum and position operators 
of a particle, e and mp are its charge and mass; $ is 
the triply periodic crystal potential i in which the con- 
duction electrons move; W is the wall potential, dis- 
cussed in detail in Sec. 1.3; YW is the vector potential 
corresponding to a uniform external magnetic field 
acting on the particle; and c is the speed of light. We 
shall only assume that we are dealing with a Bravais 
lattice. The explicit form of B(q) need not be stated 
here, since we shall not require it in the present general 
formulation. 

From (1.3.9), we proceed to prove the following 
fundamental result about 9t(£) for orbital Hamiltonians 
of type (1.4.1): 

1 rte exp(zZ) 
N(E)= see dz 


Rri Vyio a fw} 
a epf- a +200} «+) j- (1.4.2) 


dq’ 


= 


nn 
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where [w] is the volume of a unit cell, » is the number 


of unit cells in Q, so that 
(1.4.2)! 


n=Q/w, 


and the q’-integration extends over an arbitrary unit 
cell in Q. 

For proof of (1.4.2), we begin by showing that the 
periodicity of 8(q) implies 


1 €é\? 
q apl- (+) +20 |j) 
2mo c 
1 ON 
= (a+ exp| -| (+x) 
2mo G 


+30] 41), (1.4.3) 


where l is an arbitrary lattice vector. 
To prove (1.4.3), we write, as in (1.1.15a), 


(eleo) 


= ny f ak exp(—7k-q’) 


1 /n e 2 
| (Zver aan) 
2mo \ i c 


+van |} exp(ik-q’’) (1.4.4) 


Xexp 


and we choose the symmetrical gauge 


A(q’)=3[q" XH] (1.4.5) 


to describe our uniform magnetic field H. 
The assumption that V(q’+1)=V(q’) coupled with 
(1.4.5) leads to the operational identity 


1 s/h 2 
exp| =] —(-verartA(a"+) ) ivan] 


2mo \1 


: Ao. 
=op -Žar axm | i 
he 
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From (1.4.4) and (1.4.5), we get 


1 e \? 
(a+ ap| -d — (r) +2] 
2mo c 


= (2) 


20 


1 /h e 2 $ 
xex] -| (Cveta) + va")|| 
2mo \1 G 


esplik axm fa”. (1.4.7) | 
CAG 


a'+1) 
e 
1 


Zaxm |a] 


dk exp| ~if 
he 


Changing the variable of integration in (1.4.7), and 
using (1.4.4) we obtain the desired result (1.4.3). 

Combining (1.4.3) with (1.3.3), where we place 
f(q’)=1 and n=1, and employing (1.1.9) and (1.1.13)’, 
we arrive at (1.4.2). Q.E.D. 

Let us now formulate the equations for calculating 
the magnetic moment of the conduction electrons in Q. 

The basic definition of the magnetic moment, M, of 
a system of N electrons, whose total Hamiltonian is a 
sum of N one-particle Hamiltonians whose orbital and 
spin parts are given by (1.1.6), (1.1.7), and (1.4.7), for 
a given direction of H, was proposed long ago by Pauli 
in his famous paper on paramagnetism in monatomic 
gases.! The definition is a direct generalization of the 
classical one and reads as follows: 


aE 
u=5(-—) IEB), (1.4.8) 
Pp oH N, T, 9 


where H=|H| and where the direction of H is held | 
constant; E, is given by (1.1.7), with j=1/2, g=2; | 
B has the usual meaning of 1/kT; | 


flE|B,0)= (1.4.8)! 


expe (E-t) ]+1 E 
and the sum in (1.4.8) involves all orbital and spin ; 
states. The thermodynamic potential ¢ is determined | 
by the familiar equation | - 


N= fo(En|8,5) (1.4.9) 


where N is the number of particles in the box Q. 

An equivalent definition of M can be obtained from 1 
that given in the foregoing. It is the well-known thermo- | 
dynamic relation ; 


j z 1 Å e 2 OF 
xep (Evet aan) van] a a) 
2mo\1 c Bia se ogo i | 
ie ; 
: p| a” axm]: (1.4.6) where the Helmholtz free energy for an electron gas 35 
ey. , Lhc 21 W, Pauli, Z. Physik 47, 81 (1927). i 
— 4 ; = 
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siven b : 
£ y a trons in metals. These equations reveal that the heart 
P= NE —— S lori a ae of the problem is the calculation of 9U(2), In real metals 
; z> atl texp[A(—£,)]}. (1.4.11) it is out of the question to compute this function in 
| closed form, so that one must resort to perturbation 

When we combine (1.4.8) and (1.4.10), we get the methods. 

~ useful result f We conclude this section by remarking that the form 


i f of Eqs. (1.4.13), (1.4.14), and (1.4.15), coupled with 
j $ Eq. (1.4.2) allows us to conclude: 
M=- (F-N0 q. (1.4.2) allows us to conclude: 

òH F—N¢, N, and M are proportional lo Q for those sys- 


lems having wall potentials W(q') with the properties 
| = ô f assumed in Sec. 1.3. 

aN Aa log{1+exp[6(t—E)]}, (1.4.12) We are thus led to results in agreement with thermo- 
| dynamic requirements for systems in equilibrium. 


where ô/ôH represents a partial derivative which oper- PART 2: APPLICATIONS 


a ates solely on those terms (coming from the E,) which 
3 L 5 p. i 
contain the magnetic field strength H explicitly; the 2:1. Simple Bandana 
implicit H-dependent terms in ¢ are to be ignored In this section we shall consider the problem of 


| Since the sums in (1.4.9) and (1.4.11) are of the form finding the function 3(*(£) of Sec. 1.3 for conduction 
i (1.1.1), we can write them as follows: using (1.1.5)and electrons moving in a periodic potential with no mag- 


(1.1.9), with j=1/2, g=2, netic field present. We shall suppose that this potential 
«=> p 1 is so weak that it constitutes a perturbation on the 
| -Ne= E T E a free-electron states, so that we may reasonably apply 
3 JE Eq. (1.3.15) to the problem with neglect of terms 
7 having > 2 in this equation. 
Xlog{1+exp[e(¢— £)]) (E+ 2hwm) To this approximation, our final answer for 9t*(Z) 
agrees with that which one would obtain by the per- 
= 3 turbation-theoretic procedures used in the familiar 
== 2 GES E|BS)SUE+ om), treatments of band theory for the case of weak binding.” 
í This result was to be expected in view of the work in 
2 Afo(E|B,s) the Appendix. 
N=- > f pene If H=0, Eq. (1.4.7) reduces to 
m=t}J_ ðE 
: 1 
KN(E+2hwm). (1.4.13) (9,0) = + Bq) + Wa), (2.1.1) 
mo 
By making a partial integration, we can transform where a 
| the last equation for F— Nç to read p= (|p). 
| We place 
: , = afo(E|8,S) pune 
F—-Ng= È ij dE JE 2mo 
am ; , Gey 
J E+2hwm Di=B(q). 
E/N (E’). (1.4.14 
5 x f. LCD) ) We now proceed to calculate No* (E), Na” (E), N” (E). 
| F From (1.3.13) and (2.1.2) we have 
: 4.14) involve the function 
| - Since Eqs. (1.4.13) and (1.4.14) involve the f See 
| i -like behavior for a 1 Ene 
0/dE)fo(E|B,0), which has a delta like be! ee So 
Cm He gas, they are quite convenient to use To*(E) ae FA d p A 
in this case. s hoai ; 
Combining (1.4.12) and (1.4.14), we get d f ala exp (— 0") «). (2.1.3) 
s 1e Aa (2) al 2my /| 
| 
| ree aH zÍ dE zf Ele) Using (1.1.15a), where we place 
i m =I a . pa 
7 E+2kunt 2 É ‘) i ass 
° EIN(E!). (14.15 Sh -vaa )=-—v_? 
x f dE'N(E’). ( ) 9) ; va Fae a> 


| 3 F z 2 See book on the theory of metals, for example, A. H. 

| z 3 (1.4.1 3) and (1.4.15) constitute the basic wilson, Th Theory of Metals (Cambridge University Press, Cam- 

j 3 ; ear z aeng the magnetic properties of elec- bridge, 1953), second editign, Chap. 2, Sec. 2.5. _ 
oundatio: 5. 


. 3 
> 
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we find that 


Cel 
J - 2mo 


= On) f dk exp(—ik-q’) 


h? 
xexn( Tava) exp(7k-q’) 


2mo 
9 sk? 2rmo\! 
=(2r)> | dk exp( =: )- ( ) zi, (2.1.4) 
n 2mo hè 


where k= |k]. 

From (2.1.4), we see that we can forget about @ in 
(2.1.3) since the s-integral converges for y>0, an in- 
equality which we suppose to be satisfied without ex- 
ception in this paper. Combining this remark with the 
formula 


1 tio 
= dze" 0+) = 


2”Er=U (E) 


1:3-5---(2n—1)m! 


2ri y~ io 
n=1,2,:--, (2.1.5) 


where U(x) has been defined in (1.1.3), and using 
(2.1.3) and (2.1.4), we obtain 


4 f2rmo \? 
m= ( ) QEU(E). (2.1.6a) 


T? j 


The usual smoothed-out orbital density of states for 
free electrons is merely 


2 


d Ju*(E) (=): Bt f 
3) = QEU (E). 2.1.6 
y = (E£). (2.1.6) 


mi 


Since H=O, the total density of states including spin 
is obtained by multiplying this answer by a factor two. 
We notice that the factor © in (2.1.6) and (2.1.6) is a 
consequence of the sharp rise of W (q’) at the boundaries 
of the-box, as expressed by (1.3.3). 

In order to proceed, we expand V(q’) in a Fourier 
series for q’ in 2, thus 


V (q’)=2() exp(ix-q’), 


K 


1 
== f dq’ exp(—ik-q’)V(q’)." (2.1.7) 
f sa“ [9] 


GG 
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If we adjust the zero of potential energy so that the 
average of V (q) over the crystal vanishes, i.e., if 


f dq’V (q')=0, 


[9] 


(2.1.8) 
we have” 
Mı” (E)=0. (2.1.6b) 
To prove (2.1.6b), we employ Eqs. (1.3.10), (1.3.13), 
(2.1.2), (2.1.4), and (2.1.8), thus obtaining 


E 


1 ð ytin eE 
m=- o f dz— 
2ri OL Yy—io Z 


x f dq’V (a’)(q' |exp(—20?/2ms) 10"), 
[2] 


since (q'|exp(—zq?/2mo)| q’) is independent of q’ by 
(2.1.4). 

We now turn to 9t2*(£). Equations (1.3.10), (1.1.13), 
and (2.1.2) imply 


9 


1 8 il ytico eE 1 
Mo* (£E)=- — —e f da—f as f a f dg” 
2 OB? 2ri Y y~ix0 Z vo 19] [9] 


XEL (x)ulx’) exp (ix-q’) 


xexplird-a”)(2n)-* f dk exp[— ik- (q'—q’’) | 


0 


Xexp[— (1—s)z(h°k?/2mo) ](27) af w : 


Xexplik’- (q’—q’’) ] exp —s2 (7k? /2m0) ]. 


Since the q’, q” integrations are over a finite volume 
Q, it is convenient to use the following familiar asymp- 
totic result: 


(a’| expl— (1— s) (2p?/2mo) ]|q”’) 


= (on) f dk exp[— ik: (q’—q’’) ] 


=o 


Xexp[— (1—s)z (#2k?/2mp) | 
1 

~-È exp[— ik: (q’—q’) ] 
Ok 


Xexp[— (1—s)(zh?k?/2mo)], (2.1.11) 


where the vectors k of this sum range over the usual 
Born-von Kármán spectrum appropriate to a cubical 


# The fact that the first-order correction to the Schwinger trace 
formula (1.2.9) vanishes under the above conditions has been 
noticed by several authors. See, for example, references 5 and 17. 
In reference 17 it is pointed out that this is not true when we 
deal, for example, with Hamiltonians with binary interactions, 
if we use Fermi or Bose statistics. 


wu? 
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PREN Tho gB 
N (E) ae een S dz EN 
2 OE? Ini J,_,, 


1 
A ds$ Y (x) (x)o(x’) 
v 


0 


ə 
1 
a 2 exp[— (1—s)z(h°k?/2mo) ] 
1 
: F 2 expl—sz(#7k’2/2mp) | 
f dq’ expli(x—k+k’)-q'] 
(21 
f dq” expli(y’+k—k’)-q/7]. 
. k {2} 
Using the basic orthogonality relation 
1 


-f dq’ expliv-q')=ô(x,0), (2.1.12) 


2419} 
we are led to 


À 1 ð? 1 ytin eE 
END — — f dz— 
IDUR Iwi de 2 


. f dZ elkel) expl- (1—s)3(h*k?/2mo) | 
0 K k 


Xexp[—szh?(|k—%|?/2mo) J. 


If we replace the sum over k by an integral 


wo 


(27) 2 f dk, 


=0 


and use the reality condition 
o(=)= 0l)" 
and the fact that 
z v(0)=0, 

as follows from (2.1.7) and (2.1.8), we find 
=F o 
e*(E)= 322m)? E le 
(E) Pel: 


: Sl pre E 
x f ds—9 — f dk, 
n 5 Yo 


£ Ini yin 


Xexpl— (its/2me) ILS) rs) k—«/2]. (2.1.13) 
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We carry out the integrations in (2.1.13) in the 
following order: (i) over z; (ii) over k; (ili) over s. 
Carrying out (i) and (ii), we have 


os 


1 tio gtk pa 
- of dz if dk 
2ri z 


y= in 5 Yn 


Kexp{ — (A3/2m) LO —s)k?+s k—x!?]} 


2 h? he 
=f awt ( £- s1 —s)Ż— r) 
s 2mo 2me 


L 


dap mE y he 
=— | —- -s(1—s)e| u(z- —s(1=s)'), 
3 h? 2ma ; 
(2.1.14) 


where we have placed k'=k— vand x=! x. 
Combining (2.1.13) and (2.1.14), we find 


2ma 
N* (E) =—-r2 (2r)? Z [o(x)l? 
h vo 


ð f 
<S dsExo?—s(1—s)x? PU lk? —s(1— s), (2.1.15) 
ðE a 


where K? = 2mo E| /h*. 

In carrying out (iii) we must consider carefully the 
cases x <2ko, KZ 2xo. We then arrive at the elementary 
formula 


1 
if ds(xot—s(1— s) PU (xo? — 8 1 — 5)x?) 


0 
K[ Ko 1 kê 2ka-bK 
ead 
2L x A Ke | 2ko— x! 


which, in conjunction with (2.1.15), yields 


mèh |o() |? 
N.*(L)= 2 log- 
trh “~0 K 


Qko-tK 


~» (2.1.16) 


2ko- Kk 


The inclusion of spin leads to no complications. In 
fact, from (1.3.34), putting g=2, j=}, w=0, we see 
that the total number of states with energy less than 
Eis bs 

N*(E)= 29" (E)2N6* (E) +27 (£), (2.1.17) 


to the degree of approximation contemplated here. 


2.2. Magnetic Properties of an Ideal Electron Gas 


The properties of free electrons in magnetic fields 
have been of interest in physics since the period where 
Bohr* discovered a special case of a celebrated the- 
orem?’ due to van Leeuwen. As is well known, this 


2 N. Bohr, Dissertation, Copenhagen, 1911. 
23 J. H. van Leeuwen, Dissertation, Leiden, 1919. Á 
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theorem states that a system of charges subject to the 
laws of classical dynamics and statistical mechanics, 
and in a state of thermodynamic equilibrium, has zero 
magnetic moment. In proving this theorem, the fact 
that all momenta are allowed in classical statistics plays 
a decisive role. 

This paradoxical picture prevailed until 1930, when 
Landau?’ proved that an electron gas in equilibrium 
in a container exhibits a diamagnetic behavior, if one 
employs a correct quantum-mechanical and quantum- 
statistical approach. This is a purely quantum-theoreti- 
cal result which is connected with the quantization of 
the “orbits.” In Landau’s work, explicit account was 
taken of the walls by a certain spatial cut-off procedure. 

Peierls! arrived at the same conclusions as Landau 
in an entirely different way using the operator method 
mentioned in the Introduction. He also treated the 
case of strongly bound electrons.”’ 

In 1930 de Haas and van Alphen? discovered that 
the magnetic susceptibility of Bi at low temperatures 
had an oscillating component with period proportional 
to 1/H. The de Haas-van Alphen effect has since been 
observed in other metals.” The work of Landau and 
Peierls only gave the so-called “normal” susceptibility, 
due to the approximations which they used. Peierls” 
was the first to show that the de Haas-van Alphen effect 
could be understood by a more refined approach. The 
whole theory has been further perfected by Blackman,” 
Landau,” Akhieser,® Rumer,* Sondheimer and Wilson,’ 
and Dingle. 

With the exceptions of Peierls and Sondheimer and 
Wilson, these investigators employed the Landau cut-off 
procedure mentioned previously. The studies of Os- 
borne and Steele, Dingle,’ and Ham?” have dealt with 
the validity of such procedures for treating the walls. 
It appears that they are allowable for system such that 


eHR 
—_>m 0UFermiy 
C 


(2.2.1) 


where ?¥ermi 1S the velocity of the electrons at the Fermi 
energy, R is the radius of the container, and the re- 
maining symbols have been defined in Sec. (1.4). 


261. Laudau, Z. Physik 64, 629 (1930). 

27 See also R. H. Wilson [Proc. Cambridge Phil. Soc. 49, 292 
(1953)] for a treatment of steady susceptibility of metals by 
mW eh gaara ppreach. 

WV. J. de Haas and P. M. van Alphen, Proc. Acad. Sci. - 
stgrdam 3 1106 £1930). phen, Proc. Acad. Sci. Am 
ee, for example, D. Shoenberg, Phil. Trans. A245, 1 

2R. Peierls, Z. Physik 81, 186 (1933). ; Dao 

ZM. Backman TELO.. Roy. Soc. Tondon) A166, 1 (1938). 

. Landau, Appendix to D. Shoenberg, Proc. Roy. A 
` (London) A770, 347 (1939). pease es 

33 A. Akhieser, C. R. Acad. Sci. U.S.S.R. 23, 874 (1939). 
(1948) B. Rumer, J. Exptl. Theoret. Phys. U.S.S.R. 18, 1081 

36 R. B. Dingle, Proc. Roy. Soc. (London) A211, 517 (1952). 

3 R. B. Dingle, Proc. Roy. Soc. (London) A21?, 47 (1952); 
A216, 118 (1953); A219, 463 (1953). 

37 F, S. Ham, Phys. Rey. 92, 1113 (1953). 
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We believe that the wall-potential method in Sec, 
(1.3) and (1.4) is equivalent to the foregoing cut-off 
procedure, because our final answers agree with those 
in the references cited previously for the case of per- 
fectly free electrons. 

In this section, we shall adopt the one-electron orbital 
Hamiltonian in (1.4.1) with 8(q)=0, we shall take the 
direction of H along with q;’-axis, and we shall use the 
symmetrical gauge (1.4.5). We then have 


h h 
se(—ve.a') =—— Vy Hamo g +g] 
Y 


2mo 


+oLs-W (q), 


h 0 ð 
r= a- s>] 
iL ðq? ðq 


(2.2.2) 


where 


We shall take (2.2.1) as our criterion for “large” 
systems, so that we shall suppose, in particular, that 
(1.3.9) holds when (2.2.1) does. Since we shall only deal 
with systems for which this is the case, we need not 
mention it explicitly in what follows. 

Employing (1.1.15b), (2.2.2), and (1.3.9), one can 
write for the orbital number of states with energy less 
than £ 


1 ytio gE 
M(B) f dg —P dz— 
my A ae z 


(2x) f Akola) 


A 
xexp| — |- — V +o imw (q+ | | 


2mo 


Xexp(ik-q’). (2.2.3) 


It is clear that the Hamiltonian in (2.2.2) is invariant 
under rotations about the g;’-axis, so that L commutes 
with H. This leads to a happy simplification of the 
exponential operators, i.e., 


h? 
exp | — |- Vg? Hmo lgo”) toh] | 


2mo 
=exp(— zwLs) i 
h? 
Xexp| -d -Zve Imalaia) | | 
mo 
=6. ; e (2.2.4) 


It is convenient to introduce the so-called natural 
units as follows 


h i 
(q1',92',9s') =—(E1, E23), (2.2.5) 
mow 
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so that 
te ô 
" m=-i—, k=1,2,3, 
Ok, 
A [r g]= —16x1, 
and 
a OL3=he(Eyre— Eom) =hwL,, 
Combining Eqs. (2.2.4) to (2.2.6b) we get 
hwz 
O=ex(- 5 D) exp(—ñwz£,) 
hus 
xen] -= mitn] 


hws 
xexo| e+ | 


9 


where we have also employed the fact that the four 
operators £3, m3, mité, T2 +E? commute in virtue 


of (2.2.6a). 


Using (2.2.5), (2.2.7), and the change of variables 


o= (f/m) ik, we obtain 

co} h? 
(on) f dk exp(—ik-q”) epl -d ->v 
—> 


2mo 
A 


tobrtdmas(o+o')] | exp(ik-q’) 


j a] 
= (=) CO do; exp(—iasés”’) 
h —0 


hws /1 ð \? A 
xep- (- —) | exp(ioséd) 
INTI 


Xexp(—wz£s) | (ony f doy exp(— ioii") 


2 


hoz /1 9 ) Jj EPEn 
E aa | fe E exp(iork’) 
Xerp| 7 G an) 


x (on) f dos exp(—io2é2”) 


—0 


; xexp| Coa 


2 
We shall require two iden 


exp (10.23)F (Engages) = F(E), 


tities. The first is 


== ? Zyre) expliass)| 
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where 
t” = cosO&,— sinta, 
&)* = sinbi -H cosb£o, 
E = £5, 


and where 9 is a complex number. Equation (2.2.9) 
expresses the fact that exp(i@£,) is a finite rotation 
operator corresponding to the (complex) angle 8. 

The second identity is 


(2.2.6a) 


(2.2.6b) 


(EIB g.-0| €)= (27) afi da exp(—iat’) 


huz /1 0* 
xexp] -“( -) +e] explisg) 
2 i dé 


= (2r sinh(fw2))—3 
1 fuse 
xexp| — [eter tanh (=) 
4 2 


huss 
+(¢—prenth(—)]], (2.2.10) 


which follows by transformation theory from a cele- . 
brated summation formula discovered by Mehler.” 
Equation (2.2.8) becomes 


(2.2.7) 


mwN? Ă 
( =) (2ahwz) exp(—hwzL;) 


h 


Xlir |D geno! t) (Eo |B s-no) E2) 


moa $ 
-(=) - (2rhwzy 
h 


-(é|B¢.2.0| cosh(hws)t1—1 sinh (wz)£}z 
7 li| P s-u-0! i sinh (wz) £:-+ cosh (ftwz) £2). 


Substituting (2.2.10) into this equation and taking 
the limits &;/—>£1; £2’ —&2 in the final results leads after 
some manipulations to the following: 

- ie 
(ony f dk exp*—ik-q’) ep -d -—vy" 


2mo 


FoLrtHimat (gg) | exp(ik-q’) 


i 2 = ae 
i) aak 


(2.2.8) 
This result for the coordinate diagonal matrix elements 
of the density operator is independent of coordinates as 
(2.2.9) it must be for a “nonlocalized” particle.in a box. 
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Equations (2.2.3) and (2.2.11) imply . 
hes 


(E) (= ) Q prtm dze 
w= — f OY AN 
It 2ri 252 sinh (#ws) 


yt 


(2.2.12) 


where we no longer require the principal value sign. 

As mentioned in Sec. 1.1, once N(ÆE) is known for 
separable systems of the type considered in this paper, 
the question of determining their equilibrium properties 
is virtually solved. 

We now proceed to compute the magnetic moment of 
our electron gas. We shall omit the straightforward but 
lengthy contour integrations since they are available in 
reference 2. We wish to emphasize the circumstance 
that our method of computing the expression for the 
number of orbital states having energy less than Æ in 
(2.2.12) is completely different from the one in this 
reference. 

From (2.1.12) performing the indicated contour inte- 
grals, we find 


i 2mm? Q p eË howz 
Í abo EYE ( ) . f dz 
a I? Qari z7! sinh (hws) 


y— io 
2rmoy * rif EN} 
ESE 
h? 3 \pH 
8 E5 172 
-O 
15 pH TY 


E 1 y 1 
xesp( — y) =-->- | 
uH y 6 sinhy 


je [7 1 


stj ——-p |UE), (2.2. 
m=) (na)! osm pH Al ( ) ( 13) 


where p= eh/2moc. 
We shall now require the asymptotic formula 


i eer) ee 
LS exp] —\L— —J (E ? 
a i pH 20E 8) 


, (2.2.14) 


Bull ml 
sinh Fal 
Bul 


valid when |a|<¢ and B¢>>1. We shall identify a with 
-tyH in the following, so that both inequalities will 
hold for degenerate electron gases subjected to the mag- 
netic fields currently available: 


AND 
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Combining Eqs. (1.4.13), (1.4.14), (2.2.13), and 
(2.2.14), we conclude the following : 


2rmoN ? $ a? {E—uHy 
xy=-( ) oemt f a|- E 
I 6 (uH): 


—o 


r (E+ul > 4 {E-H} 
6 D 


m? 2 
3 (H)?! 


E 
mas 
4 {E+pH}} ae uH 
0 


H-ra - dy 
3 (WH)? muH) 


( 1 1 ) 
X{ -—-y-——_ ) |- 


y 6 sinhy 


yo! 


4r? œ 1 


Irmo  ? S mi {E—pH}} 
paves ( ) Qu H) f arl- “GE 
2 = 6 pH)} 


12 


m {E+pH}} 8 mè{E—uHy. 


6 (uH) 30 (uH). 
8 {E+uH)*2 2 2 
arre + cosnt f dy 
30 (uH)>!? T 0 
( E \(- 1 1 Ofo( Z| 8,5) 
Xexp ey, === )| 
uH y 6 sinhy ðE 
ne il 
A cosr( ——- 
4r » 1 uH 4 
DTR (2.2.16) 
BLH n=1 n? nT? 
sinh] ==] 
Bull 


where {x} =0 for «<0 and {x}=- for x>0, so that the 
Greece powers of E--pH in (2.2.15) cause no diffi- 
culties. 


The results for the magnetic moment are given below: 


M=M1-M», (2.2.17) 
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where the meaning of M, 4 i i 
leaden Mi and Mo is explained in the Let of, denote the energy eigenvalues corresponding 
My, represents the nc i the peretbocied he ae a oe 
resente ənoscillatory, or “ 9 A i Prete 7 OF Du WO A 
ee sclilatory, or “normal” portion the perturbation changes from 0 to some “sisal Ge 


1 
KE, (2.2.18a) 


£ 


1 

uH, 

B 

it dominates M». If inequalities (2.2.18a) hold, then 


MM pati tM bandau, 


P 8 mmo! 
M vandau= a, CONE 


5 > l 


(2.2.19) 


M pawi= —3M tandav, 


where fo is the Fermi energy for T=0 and H=0, 
M Landau is the orbital diamagnetic moment discovered 
Sa i 
by Landau and M paui is the spin paramagnetic mo- 
ment pointed out by Pauli?! 
If 


1 
r /BuH ži, S (2.2.18b) 


then Mo, which is given by 


i 3r [6 \3 » 4 pH 
M x= — —) > ——.,_ (2.2.20) 
BpH \pH 2] 


is appreciable, and may even dominate Mı. From 
(2.2.20), we see that M» exhibits characteristic oscilla- 
tions whose frequency is proportional to 1/H. These 
oscillations are qualitatively of the same type as those 
observed by de Haas and van Alphen.”* 

We shall say nothing concerning the ingenious, but 
rather artificial elaborations of the foregoing work using 
effective mass concepts,” although we could easily 
include such schemes in our calculation of N(E). We 
hope that the perturbation-theoretic procedures in Secs. 
1.2 and 1.3 will aid in approaching the problem of cal- 
culating M on the basis of the Hamiltonian in (1.4.1), 
without the introduction of these concepts. 


APPENDIX 


Let É, ©o, and ©; have the same meanings as in 
Sec. 1.1, so that 
$ (A.1) 
were ©, is “small” compared with ©». We suppose 
that both Go and ©, have pure-point spectra. 


E 
. 


£ =S 


ator, the o£, go to eigenvalues £,* of ©. 
We shall now calculate 


trace exp(—2))= 0 exp(—ck,* (A.2) 
for Re{z} >0 in terms of 
trace exp(— 20o) = expl —0Ep), (A.3) 
p 


which corresponds to the unperturbed problem, and 
AE tS Ep Ep (A.4) 
From (A.4), we have 


exp(—3/,") =exp(—2/,) exp(—zSE,") 


a (—)* 


=exp(—2o£,) ),——a"{AE,*)™. (A.5) 


n= y! 


Let AE,, -“ (r=1, 2, ---) be the contribution to AE,“ 
in (A.4) according to rth-order perturbation theory, so 
that 


AE = EAEn. (A.6) 
rel 


From (A.2), (A.3), (A.5), and (4.6) we obtain 


trace exp(—2)) = trace exp{—2o) 


=F exp(—208,)AE, HE exp(—aoEp) 
p P 


2 
m2 


x -=sAEp + (AEn 1°) +0: (A.7) 


where O; corresponds to terms containing products of 
AE,“ such that the sum of their 7’s is greater than 2. 
It is easy to see that these terms involve products of 
three or more matrix elements of 3. 

It is well known that if we choose a set of eigenkets 
| pa) such that ` s 


(pa! 1! p8) = ĉas pa’ Dıl pa) 


ə 
for every fixed p, which we can always do by appro- 
priate linear combinations of the g(p) linearly inde; 
pendent eigenkets of Dp belonging to «Ep, then . 


AE,, t= (pa D1) pa), 


*(A.8) 


3 5 a | (pal D1! p'B) |? 
Aas xpanding = a = 5 
aS z ive the normal magnetic moment by expanal AE, s= A tidy 3 (A.9) 
ane an Ae a Aee of H up to quadratic terms esing nt Ts a 
(4.15). $ j S 
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In order to complete our derivation, we shall also 
require the following formulas: 


DY exp(—20H,)AE,1*= trace{H1 exp(—2o)}, (A1 0a) 
p 


ad) exp(— Ep] (pal Or] pa) |? 
P 


a expl—2Ep) 
TOE Sareea 


p'tp lop — oltp 


| (pa| Dıl p’a) |? 


1 
=z acel Di exp(—269) | ds exp(sz%o) 
0 
- 91 exp(—szHo) l. (A.10b) 


To prove (A.10a), we merely evaluate trace{ 1 
exp(—zo)} in terms of the basis | pa), using (A.8) 
and (A.9). 

To prove (A.10b), we employ the two last equations 
as follows: 


1 
strace| exp(—20o) f ds expl + szo) Hi exp(— sz%o) 
0 


a, ĝ 1 
=s} (pal $1128) exp(—toEy) | ds exp(szoEp’) 
Pip’ 0 


AND 
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Hil pa) exp(—sz0Ep) 


-(p'B 
=25" | (pa| Kıl pa)|? exp(—20Hp) 


2 exp(—20/2p)— exp(— 20 p’) | 
Ep —oEp i 


+E |(pa| Sil p’) 


p'#p 
which can be readily seen to be equivalent to (A.10b). 
Q.E.D. 

Combining Eqs. (A.7), (A.9), and (A.10), we arrive 
at the following result: 


trace exp(—2) = trace exp(—z%0) 


—z trace{ Ó: exp(—20o)} 


i 
+42? trace| Di exp(—2%9) | ds 
0 


Xexp(sz60) D1 exp(—szHo) | +0;. (A.11) 


Comparing (A.11) with (1.2.9), we see that the series 
for trace exp(—s) computed by means of degenerate 
perturbation theory, neglecting Os lerms, agrees with the 
Schwinger trace formula, throwing away terms with n23 
in the latter. 

It is of interest to compare the methods of this 
appendix with those of Peierls. 
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A number of methods, suitable for com 
lected, ntost of them previously well know 
together cover all Positive values of p and n 
adapted for direct numerical computation. 


putation of Coulomb wave functions with high accuracy, are col- 

n. It is shown that the regions where these methods can be used, 
PS f 

in the case L=0, All formulas are presented in a form well 


a ° 1. INTRODUCTION 


S is well known, Coulomb wave functions appear 
: in many problems of great physical interest, espe- 
cially in Scattering problems involving charged particles. 
It is quite natural that many articles have been pub- 
lished on this subject. In the beginning the interest 
centered around obtaining different representations of 
the functions, and in this connection we only mention 
works by Sexl! and by Yost, Wheeler, and Breit.? (An 
extensive bibliography can be found in the NBS table.*) 
During the last years one has become more and more 
interested in questions concerning numerical problems 
related to these functions. This is not surprising since 
a physicist working in this field is likely to desire an 
answer to the question: With a given set of values of 
p, n, and»L, how can the numerical values of the func- 
tions and their derivatives be obtained? Do tables exist 
for this region, and, if not, which method should be 
used in order to obtain reasonably accurate values with 
a reasonable amount of work? It is the aim of the 
present paper to give an answer to these questions, and 
the given formulas will in general admit at least 5-6 
digits accuracy. 


2. NOTATIONS 


The differential equation, written in standard form, is 
d?y/dp?-+ (1—2n/p—L(L+1)/e’)y=0. (2.1) 


We suppose throughout the paper that p and ņn are 
positive and L a positive integer. The solutions are 
chosen in the following way: Fz(n,p)=0 for p=0; the 
amplitude of Fr>1 when p>. It can be shown that 


L . 
F y~sin [- 7 log2p— are -| =sinĝ z, 


7 j Then the other 
where =argl (in+L+1), when po. The te 
aE (np) is defined by its asymptotic behavior: 
Gi cen when p>. A useful integral repre- 


1p See, Z. Physik 56, 72,1929) 0 yy cesta 
a3 Vase Wine and Breit, Phys. Rev. o 1 ek sai 
3 Tables of Coulomb Wate F anie: Vol. I. NBS, Appl. > 
Series 17 (Washington, D. C., 4952). 
D 


ES 


sentation is the following: 

Fr+iGr=ie"[(2L+1)!C pf! 

f U-in(t-+ Zip) +n etd (2,2) 
9 

with 


Cr= (24/(2L+1)!)- (149?) H 
{Qan/(277—1))5. (2.3) 


Further D; is defined through C,D,=1/ (20-41). 

In what follows, F’ and G’ stand for dF /dp and dG/dp, 
respectively. Further ¢, and ©, are defined through 
Fr=Crp" $1; Gr= Dro "Or. 


3. RECURRENCE RELATIONS 


The Coulomb wave functions depend on three vari- 
ables p, 7, and L. However, by making use of the 
assumption that Z is an integer, it turns out that, e.g., 
Fr+ı can be computed if Fz and Fr’ are known, and 
consequently, it suffices in principle to compute Fo, Fo', 
Go, and Go’. Recurrence formulas, obtained from the 
integral representation, were given by Powell.‘ It was 
pointed out by Infeld, however, that the recurrence 
relations only depend upon the differential equation 
and can be obtained as special cases of a factarization 
method.® 

The following elementary proof may also be of some 
interest. We try the representation 


ur 1=(atb/p)ur-+eur’ 


where wz and u ae are any solutions of the-correspond- 
ing differential equations. Ther? making use of the 
differential equations for wz, and uz, the following 
condition is easily obtained,: 

{aL—cn+ (b—cL)(L+1)/p}uz— (b—cL)uz'=0 ; 


from which we conclude 


utr=const{ (n/L+L/p)ur+ur’}. 


«John L. Powell, Phys. Rev. 72, 626 (1947). 


5L. Infeld, Phys. Rev. 72, 1125 (1947). i 
tL. Infeld and T. E. Hull, Revs. Modern Phys. 23, 21 (1951). 
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The constant is the only thing which depends ex- 
plicitly on the definition of the solution. The following 
relations are the most important ones (wz now stands 
for Fz or Gr): 


(HLY ur L= (n/L+L/p)ur tur 


t (Z+-1)) una E1) 
= (n/ (L+1)4+ (L+1)/p)ur— tr’ (3.2) 


HH uma (LF 1) + L?)hya/L 
= (2L-+1)(n/L(L+1)+1/p)uz. (3-3) 


To these formulas we may add the Wronskian relation 


(3.4) 


(3.1) 


F,/Gi—F 1G1/=1. 


Substituting Fz’, Gz’ expressed in Fr, Fz and Gr, 
G1, respectively, from (3.1) we obtain 


Fr1G1—GriFr=L/P+L)}. (3.5) 


Both relations are useful for checking purposes. 
From (3.1) and (3.2) we can form a pair of equations 
in n= uL Hur; =U — UL; 


{L/p— (+L) —n)/L}ot o =0 
{L/p+((n?-+L*)*+-n)/L}v2-+01'=0 


which integrated numerically will give 4z, “1-1, %# r, 
and uz at the same time. 

As is shown in reference 3 the recurrence relations can 
be used over a large range of values of L without too 
serious accumulation of error. A practical scheme is 
presented in a recent paper by T. Stegun and M. 
Abramowitz." $ 

In what follows we restrict ourselves to the case L=0 
unless otherwise is mentioned. 


4. TABLES 


The most extensive table so far is the NBS table, 
prepared under the direction of M. Abramowitz. This 
table deals with the function Fz for Z=0(1)5, 10, 11, 
20, and 21; 7=—5(1)5 and p=0(0.2)5 with complete 
interpolation facilities in 7. Most of the entries are 
given with an accuracy of seven digits or more. Further 
the volume contains three extremely useful auxiliary 
tables: R.P. (T'(1+in)/T (1+in)); co=argl (1+in); Co 
[defined in (2.3)]. ` 

Recently a small table containing Fo, Fo’, Go, and Go’ 
for p=2n has also been puklished.® 

These tables have partly superseded a previous one?! 

_ which deals with the regular as well as the irregular 
function for L=0(1)4, 0<n <4, and 0<p<6. The accu- 


1T. Stegun and M. Abramowitz, Phys. Rev. 98, 1851 (1955). 

8M. Abramowitz and P. Rabinowitz, Phys. Rev. 96, 77 (1954). 

? Bloch, Hull, Broyles, Bouricius, Freeman, and Breit, Phys. 
Rev. 80, 553 (1950). 

10 Bloch, Hull, Broyles, Bouricius, Freeman, and Breit, Revs. 
Mod. Phys. 23, 147 (1951). 
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racy lies between 0.1 and 2.2%. A small table of Go and 
Gy for 0<p,n <1 is given in a previous work." 

Recently a “skeleton table”? of fx, gz, fx’, and gr! 
[defined in (5.2) and (5.5) ] has been published for 
L=0 and L=5 and for pm=0(1)10. The table also 
contains the five first reduced 7-derivatives. Unfortu- 
nately this table has some drawbacks, mostly due to 
the method of computation [numerical quacrature, for- . 
mulas (5.2) and (5.5) ]. First: for high values of 7 and 
small values of p the accuracy of the regular function 
and its derivative is unacceptable. Second: in many 
cases interpolation in 7 with Ay close to } is not suffi- 
ciently accurate. (Numerical example: L=5, p=9, 
n=5.5: £5=32.23474 as obtained from n=5 and g= 
32.16459 as obtained from 7=6, the correct value prob- 
ably being about 32.249.) Third: the functiona: values 
as a tule are given with lower accuracy for L=0 than 
for L=5. 

The irregular function does not present any special 
difficulties in this respect. Throughout the region it is 
given with an accuracy of 5"-6 digits for L=0 and of 
7-8 digits for L=5. If necessary at least one extra digit 
can be obtained for L=0 by taking the corresponding 
values for L=5 and using the recurrence relations. The 
regular function on the other hand presents a more 
complicated picture, and the scheme of Fig. 1 is rec- 
ommended. 


5. INTEGRAL REPRESENTATION 


From (2.2) and (2.3) we easily find the following 
formulas: 


FL=4p "fL; Gr=Arp™™ gL 


(5.1) 


with 


e] 


fr= Í (1— thE) cos (ptht—2né)dé (5.2a) 
0 


gL= f (14E) Ze-ett2n arctetd £ 
0 


= f (1— dF)“. sin(ptht—2nt)dé (5:2b) 
0 


and 
Ar=2-4(1—e%1)3 


{2a (LHn) (2n) - (+r). (5:3) 
In particular we have for L=0: 
Ag={ (1—e**")/2am}}. (G2) 


Formula (5.2a) can also only be used in the transition re- 
gion, while (5.2b) can be used in the whole region between 
the transition line and the line p=0. The derivatives 


uC. E. Fröberg, Arkiv Fysik 3, 13 (1951). 
2C, E. Fröberg and P. Rabinowitz, Tables of Coulomb Ware 
Functions; NBS, Report 3033 (1954). e 
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= n WwW SMa a No o 


de 3/5415) e Iodo 
Fic. 1. Regions where different methods for obtaini 

function are recommended. a: Use the 

Use the table for L=0 in reference 12 


$ 


pene 12 and the recurrence relations, A: Use 7, 10, 12. C: Use 
wi l d ti 


can be computed directly: 


fi=- f tht: (1— ikeji sin(ptht—2né)dé  (5.5a) 
0 

s=- f ELL 2) Leeks arctegde 
0 


-f thé (1— ih) +- cos(ptht—2nt)dt. (5.5b) 
0 
Note that, e.g. F1’ =A rp ™{ fr'+ (L+1)f1/o}. 


1 
50 


40 


30 
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The great advantage of these expressions is that the 
successive reduced derivatives with respect to q can be 
obtained without difficulty. More extensive computa- 
tions, however, are prohibitive unless a fast electronic 
computing machine is used. [t turns out that (5.2) and 
(5.5) in general only need to be used in region D (Fig. 2), 
and even here one has a choice between these quadra- 
ture formulas and numerical integration of Eq. (2.1). 


6. NUMERICAL INTEGRATION 


If a large number of functional values with the same n 
but different values of p are needed, it might be con- 
venient first to compute a key value and then integrate 
numerically (using the higher derivatives which are 
easily obtained). Otherwise this method should be used 
only for the region D in competition with the quadra- 
ture, mentioned in Sec. 5, and possibly very close to 
the transition line p= 27. 

In this connection it should be observed that both F 
and G vary extremely rapidly to the left of the transition 
line, and this fact strongly affects the errors due to 
truncation and round-off. For example, when integrating 
F in the direction of decreasing values of p, a small error 
means an admixture of G, and since G is rapidly increas- 
ing, the effect will be that F will soon disappear com- 
pared with the error. 

If it is necessary to integrate numerically over wide 
ranges in this region, the following transformations are 


A 30 40 50, 60 70 
¥ 10 ir x 
j . i thods of computation should be used. For closer explanation, _ y 
Fic. 2. Regi ons bere Grok Table I at the end of the paper. 5 oe 
> a ; 5 . à 
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suggested : 
vi=loglk,— (L+1) loge—logC z} 
Wo=pl'1//P,—(L+1) 
s=logG14+-L loge—logD 1, 
Wa=pG1'/Gr+L. 
Then we obtain the equations: 
Y =y2/p 
ee 2n—p—W2?/p— (2L-+-1)y2/p 
eee” 

Wa = 2n— p—We/p+ (2L+ 1)s/p 
with the initial values ¥1(0)=y¥2(0)=¥3(0)=y4(0)=0 
and the Wronskian relation 

(Wo— Wat 2L-+1) et S201. (6.3) 
The functions y vary much more slowly and smoothly 


than do the functions F and G. As an example we give 
the following values for 7=100 and L=0: 


¥(0)=0; yr(1)=23.0092; (4) =50.0988. 


(6.1) 


(6.2) 


7. THE POWER SERIES EXPANSIONS 


The essential formulas for the power series expansions 
were given by Yost, Wheeler, and Breit? and are col- 
lected in a compact form in reference 3. Here we will 
present them arranged in a way which is convenient for 
numerical work. 

First we introduce some notations. 


oe (1+7) (447°) - - (Lr) - 22% a 
0 ny PL (2-41) TCD rp . 
wr=log2p+g1/pr (7.2) 


where the general expression of gz/z can be found in 
reference 3, p. XVI. The first six values are given below. 


qo/ Po= f(n)+2C—1- 

q/ p= f(n) +2C—11/6+1/4(1+7°) 

q2/ po= f(n) +2C— 137/60 

+ (47+ 13)/8 (1+7?) (4+7?) 
363 6ni +69? +150 
140 80E (4+7) (9+7?) 


4 


qs/p:= f(n) +20— 


FRÖBERG 


Here f(n) = R.P. (1 (1-+in)/T (1+ in)) (tabulated in ref- 
erence 3) and C= Euler’s constant =0.5772156649..., 
The expansions then run as follows: 


(n41) (n+2L+2)Bn1= 20 BaP Bn-i (7.4) 


(n41) (n—2L)Pn+1= 2npPn— PP n1 

E Fae Ba (7.5) 
Po=1 
P2141=0. 


Bo=1 
with and 


By=np/(L+1) 
We now define six quantities, B, S, PR, Q, and T: 
B=5 B: (=$1) S=LRBe 
P=) P; R=J kP; 
Q=pw "B+P (=01) 
T=p {O+ 264 lor B+: S}+R 


(7.6) 


and from these values we can obtain the functional 
values directly : 


Fr=C1p-B 

Fr'=C rp"[(L+1)B+S] (7.7) 
Gi=Dip *-Q 
G1'=D1p-*“"[—-LO+T]. 


It is also interesting to note that the functions x, Y2, 
Ws, Ws introduced in Sec. 6 can be written in a very 
simple form: 


yı=logB 
y:=5S/B 
TO (7.8) 
W=T/Q. 
The Wronskian relation has the form 
BQ+(SQ—BT)/(2L+1)=1. (7.9) 


In particular we have for L=0, 


qs/ pa= f(n) +2C— Aan (7.3) | (n-++1)(%+2)Bni1=2npBr—p’Bn-1 (7.10) 
> X. 
r 16744287 +31247?-45637 ninti) Prr= 2m0 Pa— Pra (2n-+1)2npBn (7.11) 
16(1-++7?) (4+7) (9+7) (16+7°) with Bo=1, Bi=np; Po=1; P:=0. Further: 
83711 Q=2np(log2p-+-2C—1+ f(n))B+P 3 
qs/ps= f(n) +2C— T=2 
27720 met B+ (log2p+2C—1+ f(n))(B+S)}+R 
2045+ 1020n9-+16680n!-+981057?-+ 158295 Fy=Cop-B; Fy! =Co(B+S) 3 
eee E ST - .12 
_ 16(1+97) (4+7) (9+7) (16+) (25+) Go=0/Co; GY=T/Cop Ga 
r : c 
3 A CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


COULOMB WAVE FUNCTIONS 403 
| 


It is easy to see that the power series expansions should 
not be used if mp or p? are very large. A reasonable 
choice seems to be 


mpS50; p<10 (7.13) 


° which corresponds to region A in Fig. 2. If p=5, n=10 


about 20 terms are necessary to obtain 7 digits accuracy. 
However, because of the very simple formation law of 


the successive terms, in some cases it may be reasonable 
to go beyond these limits. 


8. THE BESSEL-CLIFFORD EXPANSION 


Expansions cf both regular and irregular Coulomb 
wave functions in terms of Bessel-Clifford functions 
(i.e., essentially modified Bessel functions) are impor- 
tant for large values of 7 and small values of p. Such 
expansions have been obtained by Breit and Hull." 
and by Abramowitz.!*!7 We shall here give these ex- 
pansions in a form more convenient for numerical 
calculations. 

In reference 16 Abramowitz derived an expansion for 
$1(n,p). We shall first give the corresponding expression 
for the irregular function. In Eq. (2.1) we make the 
transformations: «?=8yp; y=Gr=%-T z, from which 
we get z 


rL 1dr (QL+1) æ 
+ (14 


14 )r: 0. (8.1) 
dx x dx x 16n? 


If the term «7/167? is discarded, we obtain an equation 
with the general solution: ¢)Jor;1(%)+¢2Ker41(~). Now 
it has been proved by Yost, Wheeler, and Breit that 
for small values of x 


(2n)” 
(2L)! 


and this leads us to try the following expansion: 


Gi~ 


Dy-%-Kor+1(«) (8.2) 


Tr=Kerpit aa Kors. (8.3) 


=l 


Then we find: 


In reference 17 Abramowitz has obtained expansions 
for d, in terms of I„(x) and for ©, in terms of K,,(x) 
(n=2L+1, 2L+2, ---) with the same coefficients ex- 
cept for a factor (—1)*. It is interesting to note that 
in these formulas ©; must be provided with a numerical 
factor (41), while in the other set of formulas (refer- 
ence 16, Eq. (3.15)) and formula (8.5) which has just 
been proved, such a factor must be appended to ġz. 

From now on we restrict ourselves to the case L=0. 
First we introduce some new notations: 


Fo =CoMo 
| Go= Do 2 ro 


Gol = Do 4n No 


l Fo=Col4n) £- Ag 


(8.6) 


As is easily found we have 


d 1 d 1 i 
M= (+) and No= (—+-)ro 
dx x% dx « 


Putting e= (167) we can write the expansions in the 
following form: 


oe ce x8 
sard- nle I+ r: 
6 10 72 


KH pË x? 
+4 tr n] 
14 60 1296 


2 


A Ti Pat x12 
+e Ia+ I+- Inot Ta |+ +++ (8.7) 
18 4200 720 31104 


w oe aye 
To=Kit {=x.+ d|- Tka] 


10 72 
xT x8 x 
t| Ke Ket K: i! 
14 60 1296 


x2 71x gl al? 
t|- ket Ky Kot Kul +: 
18 4200 720 31104 


ay=L/16n2; a2= 1/9692; a= L(L—1)/512n!; (8.8) 
32n? (s+4)asp3=2(2L—s—2)tsy1+ 4s; 3 4 y 
s= il, 2, 3, eis es (8.4) M= To— qer+n]+ font ernn] 
iS) 
A remaining multiplicative constant is easily found to zi a pe i 
be unity. Thus we have z efer tT E E A 1] 
L 210 360 1296 oe 
OD tm : 5 ; : 
A G= -Dr-x-TÈ. (8.5) 124 299 È 
QE)! teler ent xT 1 
13 G. Breit*and M. H. Hull, Jr., Phys. Rev. 80, 392 (1950). 315 6300 
14 G. Breit and M. H. Hull, Jr. Ee Pe oan (1950). a 7 a 
7 A vi at. Appl. 2 „d, 19 s t è 
i M, Aea E Eye 29, 303 oy Se Siva L =|- <- (8:9) 2 
-| M. Abramowitz, J. Math. "Phys. 33, 111 (1954). 3240 31104 . a 
i CC-0,In Public Domain. Gurukul Kangri Collection, Haridwar > : 
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6 


43 
No= — Kot ESI 


4 an 
+ e|- a AK yt+-—a° Kh 5— r] 
15 7 | 


2 


71 11 a 
+ e| 6 x7K7-+—aSK3 Ko] 
210 360 1296 

124 299 
-+ e|- xSKs+— aKo — XK 10 

315 6300 

7 nl? 

wR Ka FOG (8.10) 
3240 31104 


Here J,, and Kn stand for Zn (x) and K,,(x), respectively. 
As is easily found, the Wronskian relation takes the 
form: 


M(o—AoVo= 1/x. (8.1 1) 


These formulas look very attractive, but there are some 
disadvantages which make them less useful. First: only 
Io, Jı, Ko, and K: are tabulated so far, and all the other 


FROBERG 


functions must be computed by the usual recurrence 
relations. Second: the present table!’ runs from x=0 to 
x=10, and thereby we have the restriction 8np< 100. 
But the region defined by this inequality is completely 
included in region A, where the power series expansion 
can be used conveniently. Thus we conclude that for- 
mulas (8.7)-(8.10) should be used in this form only 
occasionally. 

If +> 10 one must turn to the asymptotic expansions 
for I, and K,, (see, e.g., reference 18, p. 271). Then it is 
possible to derive series expressions containing positive 
and negative powers of x where the coefficients are 
power series in e. The calculations are straightforward 
and we only give the result. 


Put 
Guan 
b=} (—1)'a,x" 
c=} (r—1)a,a" 
d=>>(—1)"(e—1)a,a" 


where the summation is extended over a suitable num- 
ber of terms: 7=0, +1, +2, ---. The coefficients a, are 
given below. 


(8.12) 


a= 1—0.051269531¢e—0.135408014e— 1.234512428— 23.0925e4— - - - 
a= — 0.13671875e—0.250589848e?— 1.977075é—33.8186e'— - - - 
a2=0.3125e+0.469676971e+ 3.550181384- 59.45395e4+ - - - 
a3= —0.166666667e—0.34777832e— 2.66566868— 44.86142¢!— - - - 
i a= 0.1900390625e+-1.433726238+ 24.03583e'+ - - - 
i as= —0.071875e2—0.582816628— 9.88621 &— - - - 
| as=0.0138888889+0.18319626+3.258536-+ - - - 
i ar= — 0.044862454566 — 0.88331286¢‘— - - - 
i ag=0.00792824074e+-0.197876088e'+ - - - 
| a= — 0.000771604938e— 0.03625545e!— - - - 
@o=0.0052594e!-+ - - - 
ay,= —0.00055218e+ - - - 
@32=0.0000321502¢'+ - - - 
a_ı= —0.375—0.052871704e—0.2059647e—2.293358—49.24e'— --- 
a= —0.1171875—0.08591652«—0.4835767e— 6.6801546— - - - 
a_3= —0.1025390625 — 0.187942386e—1.482806e— 25.363988— - - - 
a_4= —0.1441955567 —0.513485447e— 5.512588e— --- 
a_5= —0.27757645—1.6768509e—23.85393e2— --- 
a_¢= —0.67659259— 6.35806e—99.19412—.-.- 
a_7=—1.99353173—27.4191e—--- 
|, a s= — 6.8839143 — 132.422«— - - - 
a_9= —27.248827—707.630e— - - - 
@_19= — 121.59789—4143.717e— --- 
@_11=>— 603.844—.-- 
a_312>= —3302.27—.---. 


(8.13) 


and from these expressions we easily, obtain Fo, Fo’, Go; 
and Gy’ according to (8.6). The Wronskian relation can 
now be written 


a Then we have: 
‘ Ao=e7(2rx)-!-a 
Mo=e7(2rx)*-{(14+-3/2x)a+ c/a} 
| To=e-*(x/2x)}-b 
| p= 6(a/22)!- ((—1-4+3/20)b-+d/ 2) 


n 


(8.14) ab+ (bc—ad)/2x=1. (8.15) 


18 BAAS, Math. Tables VI, Bessel Pusat Pans, Part I (Cam- 
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od just described can be used for values of 
an ¢-p' where c is a constant. A reasonable 


9. THE RICCATI METHOD 
f : In the p-n-plane there are two large regions between 
we obtain region B the transition region and the regions for small p and 
small n, respectively. For these regions the Riccati 


n>2p?; np>50. (8.16) method is well suited. 
i First we consider the case p<2n (region C), which 
Numerical Example has been treated in considerable detail by Abramowitz," 
p. 80. Here we only give the result* with some new 
n=200, p=1 


terms added, and further we give the expressions for 


Thus we have e= (167?)-'=1/640000 and a= (8yo)! Zo Go, and Go’. 


=40. We obtain: 


> 


and ab-+ (bc 


| Fo=(1/2)e?; Fo = (2n) +- Fo-do/dt 
ee 125 lGox ete: Go! = (29) -Go- dp /dt. 


» | b= 1702691879721 Here fof ta an 
= — 1.012954454065 E E E 

d= —0.98364396653 | + (2n)*gs+ 2n) gte: 
J ; (9.2) 


—ad)/2x=0.999999999966. Further we get, | W(t) = —2n- got gi— (2n) “ge 


a €g., do=7.2366047314-10" to be compared with o= 


7.236604731 


3738 -10" as obtained from the power series. | F27) go (29) “ga 


4 


go=[t(1— 4) -Harc siny/t—r/2 
gi= (1/4) log{t/(1—)} 
go= — (8P—121-+9) /4802(1— 192 
g3= (8/—3)/641(1—1)3 
20484— 921 64-+ 16128¢!— 13440—12240P-+-7560!— 1890 
A 9216022 (1—02 (9.3) 


gs=3(10240— 448+ 2081—39)/8192P (1—i)° 
ge= — (2621440°— 1966080P-+ 63897 6018— 1171456013178880 — 9225216113520640% 


— 35884808-+ 2487240:— 8731801+130977)/1032192015/2 (1 —1)*!2 
1105920:5— 55296-314624 — 159552Ľ-+45576t— 5697 
3932166(1—0° Z 
foe Oe 
gi’ =1/4t(1—2) 
g! =— (81—3)/32F(1—1)*!? 


= 


gi=3(8P—41-+-1)/640(1—0)! > 
g= — (1536P — 7042+ 3361— 63)/ 2048152 (1— 1)? 5 l Pe 
ge! =3(25601'—832-+-728—260!-+39)/40968(1—1)? ` 
— 36864008 — 307 20t-+114944°—57792P-+ 166321—2079 . 
8? = pear aa 6553672(1—) "2 . By 


3(8601600-+ 196608/5+-308480#— 177280+-73432P— 177241++1899) 


_ * We take the opportunity to correct an error in reference 15, Eqs. (4.5) and (4.6), where the denominator of ts ould read 
12o (2n—p) instead of 12n(2n—p)- sho id read 
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Tn (9.3) go differs from the corresponding value of refer- 
ence 15, Eq. (4.5) by the term — 7/2. This is due to the 
constant Co= (2mn/(e7"— 1)) 2e". (2am)? which has 
been included. (The approximation above is valid to 
13 digits, if n> 5.) 

It is clear that this method should be useful if 7 is 
sufficiently large and p not too close to 0 or 2n. A closer 
investigation gives the region C (Fig. 2), where the 
limits must not be looked upon as well-defined. How- 
ever it turns out that the accuracy is very sensitive to 
small changes in # close to the line III with the equation: 
n=3p/5+3. For higher values of 7 the line IV 3n=2p+1 
can be used instead. 


Numerical Examples 


I. 7=5, p=10. This point lies on the line I, and there- 
fore we try the power series expansion and the Riccati 
method. 


Fo=1.7207453-10-* (Riccati) 


Fo=1.7207454-10-* (Power series) 


F= 1.7208- 1078 (Table (reference 12), computed 


from Fs). 


II. n=100, p=4. This point is situated in C but 
rather close to region B. 


go=5.7229844- 107! (Riccati) 


go= 5.722993 - 107! (Bessel-Clifford) 


o=5.722985155-107! (Power series). 


Ill. 7=p=10. 


Fo=1.626275-10- (Riccati) 


Fo=1.626-10- (Table (reference 12), Fo) 


Fo=1.6262711-10$ (Table (reference 12), from Fs) 


Fo=1.62627115-10-* (Power series). 

Now we turn to the region G which has also been 
treated by Abramowitz? in some detail. However, we 
will use a slightly modified method. Starting from Eq. 

(2.1) we first perform the transformation: 


a 


= x=2n/p; y= 


from which we obtain: 
e SES 2f at atl —a)/A=0. 


a 


Then we expand f(x) as before: 
f(x) = 2n -goH git (20) g+ (20) gH ++ 
and obtain the following system of equations: 
go? + (1—x)/x'=0 
go + 2go gi + (2/x) - go’ =0 
g Hg H 2go g + (2/2) g =0 (9.5) 
ge + 2go g +261 g2' + (2/x)- go’ =0 


Now we know that when p>~%™, i.e., x0, the ampli- 
tude of Fo and Go—>1 and the phase tends to the value 


o= p—n log2p-+ oo. 


If we use the asymptotic expression for øo (reference 3, _ 


p. XXVI) we get £ 
E 
bo~p—n ez aia logn—n 


—1/12n—1/3607n—1/1260n'—---. (9.6) 


The integration constants must be determined so as to 
meet these conditions. We easily find that g1, gs, gs, --- 
are real, and thus we have: g:(0)=g3(0)=g,(0)= ---=0. 
Zo, 2, 4, ++- turn out to be purely imaginary, and 
(1/2) -(2go-+ (29) “!g2-+ (2n)gs-+ - - -) should go over into 
bo when x0. This condition accounts for the negative 
sign of go’ and for the constant term i/8y in go. Further 
we must have (1/2)(2n)—!go(0) = —1/12n, (1/2) (2n) g4(0) 


= — 1/3607? and so on. These conditions happen to be 
fulfilled without special precautions. 
In this way we obtain 


go=i{ (1—a)"2/x+ (1/2) log{[1— (1-2) 
[1+ (1—a)"]} +-1/8n) 
go = —1(1—2)12/a2 
gi=— (1/4) log(1—x) 
gi =1/4(1—2) 
g= —i(9x?—12%-+8)/48(1—x)?!? 
g2 = —i(Sx—3n2)/32(1—x) 5? 
g= — (81—30) /64 (1—1)? 
83 = —3 (8a2— 4x3 a4) /64(1— x)! 


a ¢ 


We write the result in the following form: 
© 
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f M=(1/(1—2)}14. erz) 
Y(n, x)= — (84—341) /64 (27)? (1 — x)?-+335(1024— 448+ 2082? — 39x) /8192 (27) (1 — x)" 


*7(1105920—55296%-+31462422— 1595524 45576x! — 36972 x’) 


n maid 
393216(2n)*(1—2)? 
Ne ar ee i 1— (1—2) r 
(1,2) =2n +- log ———— ) + —— (922—1244 8)/48 (27) (1—2)! 9.6 
: ‘ame 7 OP 1224 8)/4800n) (I=) ™ (9.6) 


2048—9216x-+ 16128x*— 134402?— 12240x-+7560x°— 1890x" 
wae > 92160(2n)#(1—2)92 


— (130977! — 873180«9+ 2487240x8— 3588480x7-+ 13520640x°—9225216x5 


L +15178880x!— 11714560.3-+6389760x?— 1966080x-+262 144)/10321920(2n)5(1—.x)!5/2@— 


, Of course, the coefficients in this expansion are the same as in (9.3). Further we put: 
» 


A (9,2) = (1—2)!!"/a?-+ (8x—322)/32(2n)?(1— x) 92—23(1536— 7044-+336x*— 6337) /2048(2n) (1 —x) 02 
«5 (368640—30720x-+ 114944%?—5779223-+ 16632x!— 2079x) 
sE 
j 65536- (2n)®: (1— x) 17/2 


4 B(n,2)=1/4(2n)(1—2)— 32° (x°—4x+8)/64 (27) (1— x)‘ (9.7) 
o 3x (2560—832x-+728x?— 26047+3911) /4096 (2n)5(1 22)? 


3x8(1899x5— 17724054 7343204— 177280x3-+-30848022 P POON E 


l 131072 (27) (1— x) 
= 7 2 10. ASYMPTOTIC EXPANSIONS ON THE 
Then we have the final result TRANSITION LINE 
Fo=M -sing In reference 8 Abramowitz and Rabinowitz, starting 
from an integral representation by Newton, have ob- 
Go=M -coso (9.8) tained some very useful expressions for the fanctions 
Fé =—x(BF AE) i Eo, Go, Fo’, Go’ when p=2n, and they also give a small 
OT A ° i table of the functional values for p=0(0.5)20(2)50. In 
Go = —27(AFo+BGo) a more recent work by Biedenharn, Gluckstern, Hull, 
and Breit” these formulas have been generalized to the 
with the Wronskian relation case L>0. The convergence in this case, however, is 
rather slow, unless L is small and 7 large. Here we 
AM?2=1. (9.9) restrict ourselves to L=0 and write down the same 
formulas as in reference 8 with some more terms added, 
Numerical examples will be given later. using the compact notation of reference 19. 
=m 9 $ 
Fo(2n) r(1/38"j _2 T(2/3) 1 32 1 92672 T(2/3) 1 6363008 1 : ee 
x fe. Sere es af 
eena 2a/T | 35 T(1/3) Bt 8100 6° 7371-10' T(1/3) B 3536379-104 gi ; p 


391911498752 T (2/3) 1 


, M — (10.1) 
679377699- 1a T (1/3) 8" - 


eee) S "i 
eo e rt 

ckstern, Hull, and Breit, Phys. Rev. 97, 542 (1955). z : 
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jens S 1 


11488 1(1/3) 1 25739264 1 


ae |= T (2/3) [an 


Go! (2n)/NB)- 2y/n-B'" 


ons, eee -+ asai . 
15 1'(2/3) 8? 56700 $° 


. + . 
18711-10? T (2/3) 65 4179357-105 6! 
1246983424 T(1/3) 1 
—-+- 


ae ; (10.2) 
180355329- 105 T (2/3) 8“ 
Here B= (2n/3)!3; T (1/3)=2.6789385347 and T (2/3) =1.3541179394. : i 
Following reference $ we also give the formulas in a form suitable for numerical computation. 
F a | $ 0.04959570165 0.008888888889 
= EET 
Eo | 1.223404016 | q‘! n? 
0.002455199181 0.0009108958061 0.0002534684115 
=E = 3F -—-+:} (10.3) 
q! n‘ 78/8 
Fo (2n) 0.4086957323 0.1728260369 0.0003174603174 
a 
Go (2n) —0.7078817734 nls 1? 
0.003581214850 0.0003117824680 0.0009073966427 f 
+ + + +f (10.4) - 
q?’ nt qls 


Numerical Example 


Already for 7=1 we obtain an accuracy of 0.1%. 
For p=10, n=5 we get: 


ae (Formula (10.3)) 
Fo=0.9179449 (reference 8, Table I). 
11. EXPANSIONS IN TERMS OF AIRY INTEGRALS 


As is well known it is rather difficult to compute 
Coulomb wave functions in the transition region with 
a fair accuracy. One possibility is to use numerical 
quadrature as indicated in Sec. 5, and another to com- 
pute a key value on the transition line as described in 
Sec. 10 and then integrate Eq. (2.1) numerically. Both 
these methods are in general rather time consuming, 
especially for large values of 7. 

It has been pointed out by Abramowitz and Anto- 
siewics” that it is possible to obtain Fo and Go in terms 
of Airy integrals. They also discuss briefly how to pro- 
ceed when L>0. However, this special method has some 
disadvantages. First, the convergence, even for mod- 
erate values of the argument, is very slow. Second, the 
formula involves a set of constants which must be 
determinec by reference to the functional values on the 
transition line.t 

The paper ‘by Biedenharn, Gluckstern, Hull, and 
Bréit!® which gives much useful information on Coulomb 
wave functions, especially for higher values of L, also 

- contains expansions in terms of usual and modified 
Bessel functions of orders +7/3, and these functions 
appear in such a way as to be expressible in terms of 

20M. Abramowitz and H. A. Antosiewics, Phys. Rev. 96, 75 
Oe take the opportunity to correct a printing error in refer- 


20, p. 76, Eq. (15), where 4 minus sign should be placed 
A the. term H a440) in the expressions for cı and c2. 


- 


Airy integrals. However, the expansions, obtained by 
using a Green’s function for solving a system of non- 
homogeneous differential equations, are rather difficult 
to construct, and it seems practically prohibitive to 
proceed beyond the first two terms. 

As has been shown by Tyson and by Feshbach, 
Shapiro, and Weisskopf,? a straightforward expan- 
sion in terms of Airy integrals can be obtained directly 
from the differential equation in the case L=0, and it 
is easy to see that this method has none of the dis- 
advantages just mentioned. Starting with the Eq. (2.1) 
for L=0, and following reference 20 we put x= (2n—p)/ 
(2n)'8; p= (2n)?8 to obtain 


yy!” —paxy/ (u—4) =0. (11.1) 
Now we try the following expansion: 
y=const{ Ai(x)- (1-+g1(«)/ut-go(x)/n?-+ -- +) 
HAT (8): Gal) /u+ fole)w+-+-)}. (11.2) 
Then we easily find the conditions: 
MOESA O fala) 
(11.3) 


ga (x) + 2a fal (0) + fala) = E argela) 


where fo(«) =0, g0 («)=1. These equations can be written 
in the following more practical way: 


Gal! +2 gn! =2( fra” + 2gn11 + af n1) 


En H2afn tfn=2(gna + 2af pa t+ frat tgn). 
*1 J. K. Tyson, Dissertation, Massachusetts Institute of Tech- 
nology, 1948. 


Isa Eeshbach, Shapiro, and Weisskopf, NYO 3077, NDA Report 
$ e 


(11.4) 
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The first functions are: 
m fila) =22/5 gil(x)=— 2/5 

Jolt) = (22°-+6)/35 gala) = x?(7x9— 30) /350 
Sela) = e (844+ 14803-2320) /63000 gale) = (1056x°— 1160.7 — 2240) /63000 
Sax) =° (125448-995244 4 1488)/693000 
ga (x) = %(323429+4 621280x°— 4788003? — 8041 60) /48510000 
ko 1681684! +11820952019- 6106620805- 713574400.7+- 2140723200 

5(%) = = < 


63063 -10° 
a x? (2666664x°+210254720x5— 13804448022— 356787200) 


an TE 21021-105 (11.5) 
p : {(2858856x!2-4.788972288x"-+ 3064641024344 34077164802! +-8635845120) 
i 441441-108 N 
go (æ) = (35315284!5+6633266640x2-+317451563200x9— 1835265465601" 
— 388613030400x*— 579033728000) /3972969- 107 
ý fr(%) =x? (17153136415 69192483360x!2-++-11 135701248640x9-++- 35219971257600x8 
+4026376730880023-+ 1 13475491584000) /67540473- 10° 
g(x) =%(1783926144015+- 1283288054880x2+ 439889247 1424029 — 232311543744002 
$ ; —55484432640000x? — 113475491584000) /67540473 - 103, 
The constant can easily be determined by putting x=0; then the expansion goes over into formula (10.1), and 
we have a possibility to check (10.1) and (11.5) against each other.t 
The final result is 
Fo=7" (08 (Ai (x) (1-bgi/ugo/att +++) +40! iut fol + ---)} 
Go= r!) (BiG) (1+ g/t eH- JHB (0) Sut folie + -))} 
Fo = — a? (2n) 6. {Ai (a) (gi Hafi) / u+ (go! Fa f2)/u?+ +++] ata 
HAT (1 (git fi) /u+ (got fo!) /u2+--- 7} 
| Golan) BIOS) (ed taf) et] 
+ Bi (1+ (grt fr')/ut (got fo) /p2-+- «+ J}. 
The functions Ai(«), Az’(*), Bi(x), ate ae i fit+n=0. 
i R o ee e a nee Slt 2get flit gr— fig — aft =0. 
values of x. j SS+ 2gs+ fo’git-2giget fi ga— figs! — 2xfifa— fag =0. 
The Wronskian relation can be written: 
(1g ut go/u H: -MEI etfi) E g a eee in region E with 
+ (got fo)/ +: 
— (f/u feH- Ele Fafa) /u Numerical Examples 
Hieta) = n= soe ; 
“oe s er of checking relations 4 . pets 
ae Ee E of them are | Ho OZO (Airy) F i) j 
E e O e E 
; refe: : E T= 0. : 
EER tae cance Ae ots, © (UE Rio, 
X 1946). ; s oe 
D cc-0. In Public Domain. Gurukul Kangri Collection, Haridwar i > ; 2 5 Ga 


Digitized by Arya Samaj Foundation Chennai and eGangotri 
410 CARL-ERIK FROBERG 


Il. p=80, n=50. 
a (Airy) 
Fo=0.001203655 (Riccati, (9.1)) 
IIT. p=10, n=4. 
i o=1.3992085 (Airy) 
Fo=1.39921 (Table 12) 
The points (120,50) and (80,50) are very hard to 
reach, indeed; nevertheless our semiconvergent expan- 
sions give quite accurate results. 
Nolte—The terms in (11.6) vary in a very regular 
way, and by using a logarithmic extrapolation, one can 


obtain at least two more terms approximately and gain 
considerably in accuracy. 


12. ASYMPTOTIC EXPANSION FOR LARGE 
VALUES OF 0 


In this section we again admit L>0. When p is large 
compared with y and J, it is possible to derive asymp- 
totic expressions for the Coulomb wave functions. In 
reference 3 Abramowitz has obtained such formulas 
using the integral representation. Here we will instead 
start from Eq. (2.1). First we perform the transforma- 
tion y= -e"2 where 

61=p—n log2p——rr-++o1 
2 
and 
L 
o1=argl (n+ L+1)=argl (in+1)+ > arctg (n/k). 
k=l 
Then we get the equation 
u” +2i(1—n/p)u' 
+{in(1+in)—L(L+1)}-u/@=0. (12.1) 


Now we put u=uotHtuit ust --- 
obtain: 


ün1/4n= (îin—L+n)(in+}L+n+1)/2i(n+1). (12.2) 


We split 2, into its real and imaginary part 


with #,=a@,/p” and 


Un=Sn tiln 
and further we introduce: 
An= (2n+1)-n/2(n-+1)p; 
«+ By={L(L+1)—n (ntl) +9}/2n4 1). 


Then we find the recursion formulas: 


Sny =A nSn—Brln - 
| (12.3) 


tnpi=A nbn BrSn 


ne ee ee 
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For computation of the derivatives we put 


ee ` 
to obtain: 


Ta= tn t+ Sn(1— n/p) 
| Sny =A „Sn— Bal n—Sni1/P 
Trp =AnTntHBnSn— iny/Pp. 
As is easily found from the known behavior at infinity 
we have the following initial conditions: 
So=1, lo=0, So=0, To=1 n/p. (12.5) 
Now we put 
S=ysn, D SS; F=LT, (12.6) 
where the summation of the divergent series is cut off 
after a suitable number of terms (i.e., when the wanted 
accuracy has been obtained or when the terms of the 


series start to increase again). Putting 0,=0 we get the 
final result: 


F,=t-cos0+s- sind 
G1i=s-cosi—t-sind 
F ,/=T-cos6-++S- sind 
G,/=S-cosé—T-siné, 


(12.7) 


with the Wronskian relation 
sT—St=1. (12.8) 


This method can be used when 7p and when L’<p. 
In the case L=0 a closer investigation gives the region 
H (Fig. 2) as result. 


Numerical Examples 
I. p=S0, 1=9. 
Fo=0.93570855 
| Go= —0.61180203 
| Fo=0.935709 


(Riccati, (9.8)) 


(Asymptotic formula, (12.7); 


Go= — 0.611802 only 6 decimals carried) 
IL. p=20, 1=5. 

Fo=—0.229352 p y 

a 1.165712 (Riccati, (9.8)) 

Fo= —0.229347 

oe ese formula A127) 
II. p=10, 7=3. 

Fo=0.660103 i 

HE AEN (As. formula, (12.7)) 

Fy=0.660099 

ee ~106011 (Table 12) 


(12.4) © 


ta 


eo 


as are dependent on other functions th 
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13. CONCLUSION 


It is obviaus that many more methods for computa- 
tion of Coulomb wave functio 
quite satisfactory methods have not been mentioned 
here. However, we have tried to 


ns exist, and even some 


avoid such methods 


: € an the elementary 
ones, for quite obvious reasons. For example, the ex- 


pansions in terms of spherical Bessel functions (due to 
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Taste I, 

Method Reference 
A Power series (7.7, 12) 
B ]- (3.6, 14) 
fs (9.1) 
D (5.2, 5) 
E Airy integrals (11.6) 
F Special case of E (x=0) (10.3, 4) 
G Riceati IT (9.8) 
H Asymptotic formula (12.7) 


P. M. Morse) and usual Bessel functions (Abramowitz) 
(see reference 3, p. XVIII) have not been discussed. 
There are a few exceptions from this rule: we have 
referred to the tables in references 3 and 12 which seems 
legitimate, and to the table of Airy integrals, since these 
functions can hardly be avoided. 


When yumezsical values of Coulomb wave functions 
are needed, then Figs. 1 and 2 which are self-explana- 
tory, first should be consulted. For convenience we give 
Table I. (When p,n <10, consult Fig. 1 in Sec. 4.) We 
also give the equations of the curves in Fig. 2. 

I. np=50 
II. n2 
Ill. 7=3p/5+3 
IV. 3n7=2+1 
f _ V. 2n=p 
VI. n=2p/5 
VII. 1=13p/30—5/2 


D CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar a 


i 


Tt is obvious that these curves must not be looked upon 
as limits which cannot be exceeded. In many cases two 
and even three methods overlap, and this gives a good 
possibility to check the computations. 


14. REMARKS ON FUTURE TABLES 


From the discussion above it can be concluded that 
when future tables are being prepared one should con- 
centrate on a region formed by a parallelogram with 
its corners in (p,n)= (10,0), (20,5), (20,15), and (10,10). 
The table should be constructed as a “skeleton table” 
with e.g., L=0,5,10 and Ap=1, An=1/2. At the same 
time the table (reference 12) should be enlarged so that 
An=1/2 instead of 1. The region mentioned here seems 
to be the most difficult (and most important!) one at 
the time present. As far as can be judged, numerical 
quadrature will be adequate in the whole region, and 
no special complications should be expected. 


Digitized by Arya Samaj Foundation Chennai and eGangotri 
VOLUME 27, 4 


REVIEWS OF MODERN PHYSICS 


NUMBER OCTOBER, 1955 


Virial Coefficients and Models of Molecules 
in Gases. B* 


TARO KIHARA 


Department of Physics, University of Tokyo, Tokyo, Japan 


TOTAL CONTENTS 


I. The Second and the Third Virial Coefficients for 
Spherical Molecules - 
1. Introduction 
2. Square-Well Potential 
3. Lennard-Jones Potential 
4. Inverse-Power Repulsive Potential 
II. Criticism of the Lennard-Jones Potential 
5. Transport Properties of Gases 
6. Stability of Crystal Structures 
III. The Second Virial Coefficient for Nonspherical 
Molecules 
7. Rigid Convex Body 
8. Parallel Bodies of a Convex Core 
9. Core Model 
[The above is in Part A, Revs. Modern Phys. 25, 
831 (1953).] 
IV. The Second Virial Coefficient of Helium 
10. The Depth Parameter, s, of an Intermolecular 
Potential 
11. Quantum-Mechanical Expression for the Sec- 
ond Virial Coefficient 
12. Low-Temperature Expansion of the Second 
Virial Coefficient 
13. The Second Virial Coefficient for the Square- 
Well Potential 
14. High-Temperature Expansion of the Second 
Virial Coefficient 
15. The Second Virial Coefficient for the Lennard- 
Jones Potential 
V. The Third Virial Coefficient of Helium 
16. High-Temperature Expansion of the Third 
Virial Coefficient 
17. The Third Virial Coefficient for the Lennard- 
Jones Potential 
18. Non-Additivity of the Intermolecular Potential 
VI. Virial Coefficients between Unlike Molecules 
19. The Second Virial Coefficient between Helium 
a Isotopes 
20. The Core-Model of Interaction between Un- 
“like Molecules s 
21. The Third Virial Coefficient for Binary Mix- 
I ‘tures . 


e 


* This is a continuation of a previous article with the same title 

evs. Modern Phys. 25, 831 (1953)], which will be referred to 
as A. In part A a discussion of pure gases with no quantum effects 
is given; in Part B gas mixtures and quantum effécts are con- 
sidered. « 
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IV. THE SECOND VIRIAL COEFFICIENT OF HELIUM 


I | AVING treated nonquantum gases, we investigate 
in this part the second virial coefficient of helium 
for which quantum effects are important. 


10. The Depth Parameter, s. of an 
Intermolecular Potential 


The notion of depth parameter introduced by Blatt 
and Jackson" for nuclear potentials can also be applied 
to the potential between helium atoms. Let U(r) be 
the intermolecular potential, z being the distance be- 
tween the centers of atoms. We can always choose a 
dimensionless positive number, s, in such a way that the 
reduced potential, sU(r), give a resonance at the zero 
energy and no discrete energy level. The number s is 
called the depth parameter. If one or more discrete 
levels exist in the two-body system, s is greater than 
unity; if no discrete level exists, s is less than unity. 

The Schroedinger equation 


a> = 2m* U(r) 
R R 


=0 - (10.1 
dr ie g g ) 


with the reduced potential U/s must be satisfied by a 
nodeless spherically symmetric wave function 7—R(r). 
Here m* is the reduced mass of the two-body system; 
Å is Planck’s constant divided by 2r. 

For the Lennard-Jones potential 


6 To)” n ro\ © 
U)=04 ( ) ( ) | n>6, (10.2) 
n—6\ r n—6\ 7 


with the potential minimum — WU at r=ro, (10.1) be- 
comes 


ae 6 Ey n € 6 
ro—k— 06 =|] P= —} |R=0, (10.3 
dr n—6\ r n—6 JI A 


where 


2m* Uo 
6c=— ro. 


10.4 
Ds (10.4) 


For n=10, the eigenvalue problem can be solved ex- 
actly with c=1 and ' 


r 


r(e] 


"J. M. Blatt and J. D. Jackson, Phys. Rev. 26, 21 (1949). 
i © 
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In general, the eigenvalue c of (10.3) is determined 
by a variational principle, i.e., by the extremum 


= dR\* sr 
f (=) i(—) 
dr To 
A 6 To\” n To\ 8 r 
Ja 
` n—6\r n—=6\r ro 


(10.6) 


For n not far from 10, we can obtain a sufficiently 
accurate value of c making use of the approximation 


R=exp[—a(r0/r)*], 


where a and b are positive constants to be adjusted’ 
By virtue of this approximation we can integrate ana- 
lytically the numerator and denominator of (10.6), and 
obtain 


c=0.905 for n=9, (10.7) 
c=1.174 for n=12. (10.8) 
For the square-well potential 
«© forr<o 
U(r) =4 — € for o <r <go (10.9) 
0 for go<r 
the nodeless solution of (10.1) is 
0 for r<oa 
R=4sin[a(r—c)/2(g—1)o] for o<r<go (10.10) 
iar for go <r, 
the eigenvalue condition being 
2m* eo? T 
ze l (10.11) 
#2 s 4(g—1)? 


These results will be used in the following sections. 


11. Quantum-Mechanical Expression for the 
Second Virial Coefficient 


In this section the classical second virial coefficient’ 


B=ln f -eje (11.1) 


is generalized to a quantum-mechanical expression. 
Here, as the first step, the wave nature of molecules is 
considered; the symmetry effect, which is important 
only at low temperatures, will be taken into account in 


the following section. Peek. 
Let us consider the density matrix which is defined by 


oar) -Z exp BEWE) (11.2) 
B=1/kT. 


agrees with the result of J. E. and M. F. 
19, 930 (1951). 


? 


18 This value for n=12 
Kilpatrick, J. Chem. Phys. 
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Here {y,} is a complete orthonormal set of eigenfunc- 
tions of the relative motion of the two molecules, so that 


[rotoi dr=dxdydz, - (11.3) 


for all v and »’, r= (x,y,z) being the position vector of 
one molecule relative to the other; E, are energies corre- 
sponding to y,. In the limit of infinite temperature, the 
density matrix becomes 
limor) =E) =r), (114) 

in which 6(r—r’) is the Dirac ô function. 

The density matrix satisfies the Bloch differential 
equation 


ð 
—plr,r’) = —Hp(z,r’) (11.5) 
0g 
with the Hamiltonian 
W Ge ina G 
H=-—A+U(r), A=—+—+—, (11.6) 
2m* ðe dy dx 


operating on the first argument r, for which Hy,= Esp,- 
In the particular case where U (r)=0, the solution of 
the Bloch equation with the “‘initial” condition (11.4) is 


; m* \3 m* 5 
eer) se) sa m } 


We can therefore assume the general solution to be of 
the form 


m 


m* “3 m* 
plr) = (=) exp] — le) w| (11.7) 


with 
W (r,r )}—0 for BU—0. (11.8) 


The diagonal element of the density matrix then be- 
comes 


p(r,r)= (m*/274 6)! expW (r,r). 
We therefore obtain 


(27h? /m*kT)?p(r,r)=expW (r,r). (11.9) 


Both sides of (11.9) are quantum-mechanical gener- _ 


alizations of the Boltzmann factor, exp(— 6BU), in (11.1), 
since they indicate the probability of finding two mole- 
cules a distance r apart from each other. They are 
called Slater sums. The expression on the left-hand side 
of (11.9) will be used for low-temperature expansion; 
that on the right-hand side will be used for high- 
temperature expansion. 2 

The argument in this section was chiefly taken from 
Husimi’s comprehensive paper on the density matrix. 


8 K. Husimi, Proc. Phys.-Math. Soc. Japan 22, 264 (1940). 
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12. Low-Temperature Expansion of the 
Second Virial Coefficient . 


Since the intermolecular potential U is a function of 
the distance r only, the eigenfunctions y, in (11.2) can 
be factorized into 


w= r Rnlr) Yin (6, o), 


with radial functions rR, normalized as 


v= (n,m), (12.1) 


f | Ra(r) |2dr=1 


and spherical harmonics Y; also normalized to one: 


ii f [Vim(8,¢)|2singdsde=1 (12.2) 
0 0 


(compare (11.3)). Then, according to (11.2) the diag- 
onal element of the density matrix assumes the form 


l op 2( Be |Ru(r)|? 
oltsr)= En 21+1) exp J nı(r 


by virtue of the addition theorem 
Mal Fim(0, p) |= (21F-1)/4r. 
With this expression the second virial coefficient is 
given by 


B (ZE) f be» Dear, (12.3) 
=—2r p(t,r)—p%(r,n) Jdr, (12.3 
m* kT] Jo 


where p° is the matrix written for U(r)=0, namely 


1 Enl 
E) ep( aaa 
Agr? T 


) | Rn? (r)|?, 


Eni and r Rn? (r) being, respectively, the energies and 
the radial functions for U(r)=0. Thus, performing the 
integration with respect to r, we obtain 


B= (21+1)B,, 


7 1 ( 2mh? )z [ ( Ent 
=—_— nl e = 
; 2\m*kT- S =) 


; -en(-=)]. (12.5) 


In general some of En may be negative; and the 
summation over 7 can be separated into two parts, one 
is for negative energy levels which are discrete, the other 
for positive energy levels which are continuous. The 
latter-can be transformed into the integral 


iy eo E \ dn 
-f ex(— ae. a 
> mo L kT/ dE 


(12.4) 
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Here 7; is the phase shift defined by the a symptotic 
form, 


R(r)~sin (kr—Bar-+m1) for ro (12.6) 


of the regular solution of 


2m* 1(-+1) 


d 
[+ K— U(r)— 
dr? A 7? 


KL=2m* Eh. (12.8) 


(When U(r)=0, the wave function is R°(r)~sin(xr—4r); 
hence 7;/z is the difference between the number of sta- 
tionary states below £ of the real two-body system and 
that of the ideal system for U(r)=0.) Eence it follows 
that 3 ¢ 


1/ 2h? \3 En 
Eee 
2\ m*kT diserete kT 


1 2 E dn 
+-f exp(-—) —d 
To © RT/ dE 


Up to this point we have been considering only the 
quantum effect due to wave nature. For ordinary 
helium, Het, we must take account of. the fact that 
wave functions y, are symmetric with respect to the 
interchange of atomic coordinates. In this case, only 
the spherical harmonics with even J should be taken and 
the left-hand side of (12.2) should be divided by the 
symmetry number 2. As a result the right-hand’side of 
(12.4) should be replaced by 25 (2/-+-1)B; in which the 
summation is to be taken over even Z. Furthermore, we 
see from (12.3) that the left-hand side of (12.4) should 
be replaced by the difference between the real second 
virial coefficient and that of the ideal Bose-Einstein gas 
with zero spin. Thus we obtain 


[o=o CO 


where 


z| (12.9) 


+2 © (21+1)B.. 


even l 


1 / 2TA 
( (12.10) 


16k 


The expression (12.10) with (12.9) was first obtained 
by Uhlenbeck and Beth” and by Gropper.” (The pres- 
ent author avoids discussing the virial coefficient of Ves. 
which is more complicated because of the fact that the 
spin 3 requires the inclusion of both odd and even 
terms; see, for instance, Hirschfelder, Curtiss, and Bird, 
Molecular Theory of Gases and Liquids.) 

In case no discrete energy level exists, B; can be 
transformed, by integration by parts, into 


e 1 Ne E 
ETE Fi d exp| —— 12.11) 
2 a J th x0( =) ( ) 


E=0 
for 7=0, 1, ---. This expression holds for s<1, not only 
for s<1. ' 


2 G. E. Uhlenbeck and E. Beth, Physica 4, 915 (1937). 
21 L. Gropper, Phys. Rev. 50, 963 (1936); 51, DAS (1937). 


aa 
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In the particular case, 
of the intermolecul 
use of Schwinger’s 
asymptotic form 


where the depth parameter s 
ar potential is unity, we can make 
variation principle? to write the 


T 


m=z f (1— R®)dr--0(x) (12.12) 
0 


at low enérgies. Here R is the eigenfunction of (12.7) 
with /=«=0 normalized as R—=1 for 7. At very low 
temperatures y, and therefore Br, can be neglected for 


1>0; and mo can be approximated by the first two terms 
of (42.12) so that 


> 
My A [ 2r | (2 } pe 
io) = = 1—R?)dr |. 
RE 2 2mh? ) J s (| 
For the Lennard-Jones potential (10.2) with n=10 we 
get 


{) (1 —R’)dr= GYT B)ro= 1.356 ro 
o 


by virtue of (10.5); for the square-well potential (10.9) 
we have 


f GR dr=He41)0 
0 


by virtue of (10.10). 
De Boer and Michels” pointed out the fact that the 
depth parameter s for helium is not far from unity. 


13. The Second Virial Coefficient for the 
Square-Well Potential 


For the square-well potential (10.9) the phase shifts 
can be expressed analytically by means of the spherical 
Bessel functions 


Jile) = (1/28) (2). 
pa The function R in (12.7) with the asymptotic (12.6) is 
ry given by 
R=xr[cosniji(kr)++ (—1)! sinj- (xr) | 
for r> go, while the solution which vanishes at r=o may 
be written as 
R=conste’r[ j_v-a(k’o) jul’) — julk’o)j_a1(k’7) 


for o<r<go, where 
o 8T, i K= etm fhe 


or, by use of (10.11), 


(K'o) = (ko)? +r°s/4 (g= 1). (13.1) 


hys. Rev. 77, 441 (1950). 
de Boer eal Michels, Physica 5, 945 (1938); 6, 409 
“ (1936). See also Kilpatrick, Keller, Hammel, and Metropolis, 


103 (1954). $ . 
j me oa Me eee ed with the assistance of S. Kaneko. 
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Fic. 12. The depth pa- 
rameter s vs negative energy 
level divided by the poten- © 02 
tial depth for the square- 
well potential. 

0.01 


The requirement that R and dR/dr be continuous leads 
to 
Aji-1( geo) — jul gxa) 
(-1)! tae R 
A jul geo) + j-i1(gKo) 


(13.2) 


pub j-r-alK'o) jul ge’) — jul’) j_vage'o) 


A= -~ - (13.3) 
K joalo)jialgo)t jilo) jilga) 
for 1=0, 1, 2, ---. Here relations 


d 
ge ee =x!" f a(s), 
ax 


d 
Tija] (2) 
x 


have been used. 

For the square-well potential discrete energy levels, 
Eny can be calculated easily. For s slightly greater than 
unity we need consider only one discrete level, Eso, 
which is for J=0. Since the function R in (12.7) with 
1=0 and = —2m*| Eoo| /f? is then given by 


const exp[— (2m*| Eos|)?r/4] for r>go, 
const sin[{27* (e— | Eoo! )}!(r—0)/A] for o<r<go, 


the requirement that R and dR/dr be continuous leads to 


E ea 
cot} {| s——— } - |= — ( ——_—_ 
G 2 e— | Eol 


where (10.11) has been used. This relation can be trans- 
formed into 


€ 2 [Eol yP 
rit tarf, ) | » (13.4) 
e— | Eoo] T e— | Eo! ` 


which is shown in Fig. 12. (Figures 1 to 11 are in 
Part A.) 

The second virial coefficient (12.10) for the square- 
well potential then assume, for each g and s, A 


a? 


ah \} wskT 
b= ( ) f(t) where r=———_, (13.5) 
me kT 4(g—1)*e 


the function f being given in Table XII. © 
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TABLE un The function f(r) in Eq. Us: 5). 


Jir) Tor f(r) for f(r) for f(r) for 
g=1 gas £=2.0 £=2.0 
T Sio s=1.1 s=1.0 s=4/3 
0 —0.5625 — o —0.5625 — o 
0.5 —0.380 —0.667 —0.384 —1.017 
0.75 vee —0.648 tee 500 
1 —0.422 —0.670 —0,.453 —1.017 
2 —0.522 —0.791 —0.535 —1.132 
4 —0.551 —0.872 —0.456 —1.149 
6 —0.423 —0.783 —0.202 —0.983 
8 —0.174 —0.568 0.153 —0.701 
10 0.152 —0.253 0.600 —0.333 
12 0.559 0.133 1.111 0.106 
14 1.059 0.584 1.65 0.612 
16 1.59 1.110 2.26 1.161 
18 2.16 1.68 2.93 1.74 
20 2.78 2.31 3.69 2.41 


In Fig. 13 calculated curves of (m*kT/2rh?)iB for 
ordinary helium, He‘, are compared with experimental 
results, which are given in Table XIV. The comparison 
shows that the depth parameter s is within the range 


1.0<s<1.1 for He‘— He! (13.6) 
(probably s~1.03), which corresponds to the range 


0.85<s<0.95 for He!— He’, 
0.75<s<0.83 for He’— He’. 


Although the relation (13.6) has been derived by 
means of the square-well potential, it is not very sensi- 
tive to the used potential. We, therefore, draw the 
conclusion: the two-body system of He‘—He' probably 
has a discrete energy level while both the Het— Het and 
He*—He? systems definitely have no discrete level. 

Figure 13 shows, furthermore, that the adequate 
value of the model parameter g for helium is 1.5, which 
is conspicuously smaller than the value found in Sec. 2, 
ie. g=2.0. A discussion on this discrepancy will be 
given in Sec. 18. 


Fice 13. Reduced sec- 
ond virial coefficient of 
helium vs T/T, Tz be- 
ing the Boyle tempera- 
ture 26.0°K. The curves 
are for the square-well 
potential with the depth 
parameter s. 
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14. High-Temperature Expansion of the 
Second Virial Coefficient , 


Above the Boyle temperature, where high-tempera- 
ture expansions will be used, the symmetry effect is 
negligible. We therefore consider only the quantum 


effect due to wave nature in the following. The second - 


virial coefficient is then given by 


B=% | [1—expW (r,r) ]r?dr. (14.1) 
0 


Here W (r,r) is the diagonal element of W(z,r’) which 


TABLE XIV. Second and third vir‘al coefficients of Het. 


T°K BA! C X1072 A6 Reference 
2.15 —293 a 
2.32 —262 
2.86 —205 
3.35 —172 
3.96 —139 

6 

14.16 —25.7 b 

17.30 —15.0 

20.58 —6.0 

23.35 =S 

37.4 7.6 

48.2 12.0 

20.35 —4.6 11.3 c 

65.15 15.6 7.0 

90.15 17.6 5.0 

123 18.9 5.2 

173 19.8 5.3 

223 19.8 5.0 ¢ 

273 19.5 cee 

273 19.7 2.1 d 
323 19.2 2.0 

373 18.8 2.5 

423 18.4 3.0 


a W. E. Keller, Phys. Rev. 97, 1 (1955). 

b Data are due to Nijihoff, Keesom, and Iliin; the values are taken from 
Keesom, Helium (Elsevier, Amsterdam, London and New York, 1942), 
p. 

° The values are taken from J. Otto, Handbuch d: petra 
Bd. 8, Teil 2 (Akademische Verl. Leipzig, 1929), p. 

4A. Michels and H. Wouters, Physica 8, 923 nen 


is to be determined from (11.5), (11.7), and (11.8). Let 
us in this section denote W (r,r) simply by W. 
Inserting (11.7) into the Bloch equation (11.5) we 
have 
/ 


ô T ° 
Se ret es g(vW)?+gAW, (14.2) 


where Y means gradient with respect to r, A=V-V, 
and 


> q=h?/2m*. (14:3) 
Let us expand W into a power series in q: 
W=WotWitgwet::- (14.4) ` 


We then obtain from (14.2), as regards the coefficients 


e 


> 


oy 
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VIRIAL 


of 7, 7, È, M hue 
a ð HDS 


rE 
U+—Wot+—-vW = 
ap sa 


Ero r=r 
a t “VIV=(VW)-FAWs, 
r—r’ i 
pa B “VW2=2VWo- Vit AW, 
rr o 
2 set VY Wa=(VWi)?-L20 Wo VW FAW, 
(14.5) 
Taking the condition (11.8) into account we have 
from the first equation of (14.5) 

4 Wo'x,r)=—pvu. (14.6) 
Multiplying the first equation of (14.5) by V and A we 
have 

ð 1 r—r 
VU+—VWot-vWot -VVWo=0, 
aß B 
ð 2 r—r’ 
AU+— AW o+-AW 0+ -vAW,=0, 
op B 
from which, taking the limit r'—>r, we have 
(Vv Wo) xtor= —i6V U, (AWo)r>= —3GAU. 
Inserting these into the second equation of (14.5) we 
obtain 
B? Bi x 
Wi(r,r) = -—AU+—(v U}. (14.7) 
6 12 
Similarly, 
8 B 
W2(r,1)=—-—vVVU:vUvU+—vU-vAU 
60 30 


4 3 


bf net 
90 60 


4s 


1767 À 
W:(r,r)=—vU:vvU-vyyU-vU 
5040 


E` 

+—vvvJU: NAN UvU 
840 

i 178° 

—_-yAU-yvU-vU 

2520 J 
} As regards the : and : notation, see, for instance, Chapman 
and Cowling, reference 7. 


b 


pe 
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——VVAU:vUVU 
280 


p’ 


—-—VVVU:VVUTU 
210 
486 


PE EA ANA 
2 


176° 
+——vAU-vAU 
5040 


+—vAAU-vU 
280 


5 


+- 


VVAU:vVU 
) 


g5 


+—yvyyvU:yyyU 
840 


Bs 
———AAAU. 
840 


These altogether give 
W (rr) => g W (r,1). 
Transforming expW (r,r) into 
exp(—BU){i+qWilrr)+g_Welr,r)+4Wi(r,2)7] 
HELW: (r,r) Wi (r,r) Wal, r) + Wi(r,r)3/6]+---} 


and inserting this expression into (14.1) we finally ob- 
tain the second virial coefficient in the form 


B= B+-gBY+ PB + tee, 


(14.8) 


(14.9) 
where 


BO=2e f (1—e*%)rdr, 
0 


r kal 
Bo = f e fU U"r?dr 
6 H 


102 pUe e 
BO== a Í “ay itp E ar, 
i, 5 Oe ; 
yur yin F: UM 
Boas f. (H k 
Haor 756 ; 
BU’ ui B y” BU” y” EU“ 
180r | 9457 720 680r S 
pa geu’ 
+ Ja 
~ 2160r 25920) 


q bane defined by (14.3), 6 being 1/}T. y 
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The quantum correction in the form of expansion 
(14.9) was first given by Wigner® and then by Uhlen- 
beck and Gropper. Kirkwood** supplemented their 
method by furnishing a more convenient means of ob- 
taining the expansion, which was used by Uhlenbeck 
and Beth?” and by Gropper in their recalculations of 
the quantum correction up to the term proportional 
to 4°. The term proportional to qì was recently calcu- 
lated by Midzuno, Shizume, and the present author.’ 
The most elegant method here used is essentially due to 
Husimi.” 


15. The Second Virial Coefficient for the 
Lennard-Jones Potential 


Integration of each term in the expansion (14.9) can 
be performed for the Lennard-Jones potential 


U (r) =)? — pr-6 


6 ro\” n ro\° 
-v{—(“) ee (=) | n>6, (15.1) 
u—6\r n—6\r 


the result being a power series in 


u RT  &in 
-A 
kTX\ X 


2r / A \ o /6I—3\ y! 
n \ kT t=0 n t! 


Bt (yes TENI y 
= SS bee r —J © (6,7), 
nm ETN EL a) ”) 


Ae) =( 1 )( A Ta (= yt 
anae a JP F 9 6, ? 
n \ kT =) =0 n J ee) 


AO 2r/ 1 )( À ) z 6t+3)\ y! 
>| (eect A (ee r = JO (637); 
n \ kT kT t= ( n J gon) 


where 
127, (6,n)= (n—6)6t— (n—1), 
4320J;° (6,2) =21 (n—6)?6?22 
+ (6n?+-30n—42) (n—6)6t 
: +6n—27n?-+-30n-+63, 

362880I, (6,11) = 93 (n~: 6)3638 

+ (82n?+ 657n—741) (n—6)?622 

+ (24n*+ 348n3+ 93x?—810n+-2241) (n—6)6t 

+72n>— 126n'— 5313+ 1755n2—405n—3645. 


u E. Wigner, Phys. Rev. 40, 749 (1932). 
2 G. E. Uhlenbeck and L. Gropper, Phys. Rev. 41, 79 (1932). 
26 JAG. Kirkwood, Phys. Rev. 44, 31 (1933). 
27 G. E. Uhlenbeck and E. Beth, Physica 3, 729 (1936). 
28 Kihara, Midzuno, and Shizume, J. Phys. Soc. Japan 10, 249 
1955). c 
‘ ee sardines: see reference 3; de Boer and Michels, see 
reference 23; Kihara, Midzuno, and Shizume, see reference 28. 
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Let us express the second virial coefficient in the final 


form 
FY(s) F(z) 
B= Grd FO (+ +——+-- + : 


cs (cs)? 
Uo 
=i——, SA) 
kT 
where (see (10.4)) 
1 wil 6q 
6 ; (15.3} 
cs 2m* Ur? Uor? 
FO (2) = 56O z BHi] In, 4=0, 1, 25 cd Oe, (15.4) 
t=0 a 


The coefficients 8 are given in Table XV for n=9 and 
12. (The functions 7 for n=9 are included in ‘Table 
XVII.) 

Making use of these tables and depending on the 
observed values given in Table XIV, we can determine 


TABLE XV. Coefficients in tht expansion (15.4) for 
L.-J. (6,n)-potential. 


For n=9 
t B®) BO) By B®) 
0 1.7061 0.3880 —0.0426 0.0287 
1 —2.1263 0.1513 —0.0990 0.0956 
2 —0.7500 0.2282 —0.1740 0.2055 
3 —0.4259 0.2475 —0.2400 0.3376 
4 —0.2654 0.2350 —0.2822 0.4633 
5 —0.1685 0.2039 —0.2949 0.5561 
6 —0.1065 0.1652 —0.2815 0.6008 
7 —0.0664 0.1265 —0.2490 0.5958 
8 —0.0406 0.0924 —0.2071 0.5498 
9 —0.0244 0.0648 —0.1632 0.4771 
10 —0.0144 0.0438 —0.1228 0.3923 
11 —0.00831 0.0286 —0.0886 0.3076 
12 —0.00471 0.0182 —0.0616 0.2312 
13 —0.00263 0.0112 —0.0414 0.1672 
14 —0.00144 0.0067 —0.0270 0.1168 
15 —0.00077 0.0040 —0.0171 0.0790 
16 —0.00041 0.0023 —0.0106 0.0519 
17 —0.00021 0.0013 —0.0064 0.0331 
18 —0.00011 0.0007 —0.0037 0.0206 
19 —0.00006 0.0004 —0.0022 0.0126 
20 —0.00003 0.0002 —0.0012 0.0075 
For n =12 
t Br (0) Br) B) B.@) 
0 1.2254 0.4838 —0.1276 0.1692 
1 —1.8128 0.3694 —0.3768 a 0.6512 
2 —0.6127 0.4471 —0.6189 1.3133 
3 —0.3021 0.3981 —0.7254 1.8454 
4 —0.1532 0.2979 —0.6815 2.0343 
5 —0.0755 0.1965 —0.5447 1.8740 
6 —0.0357 0.1173 —0.3838 1.4994 
7 —0.0162 0.0645 —0.2440 1.0694 
8 — 0.00702 0.6330 —0.1422 0.6923 
9 —0.00292 ° 0.0159 —0:0769 0.4124. 
10 —0.00117 0.0072 —0.0390 0.2284 
11 —0.00045 0.0031 —0.0186 0.1186 
12 — 0.00017 0.0013 — 0.0084 0.0581 
13 — 0.00006 0.0005 7 0.0036 


— tse ~ 


w 
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the model parameters for Het: 


ro=3.11A, Us/h= 782K, s=1.15 for n=9; 
ro=2.88 A, Uo/k= 10.80°K, s=1.05 for n=12 


of which the latter agrees with (13.6). (The choice of 
n= 12, however, is not compatible with the statement 
in Sec. 3: “The bowl of the Lennard-Jones potential 
with #=ł2 is too narrow for molecules of the rare 
gases.” This discrepancy will be discussed in Sec. 18.) 


`V. THE THIRD VIRIAL COEFFICIENT OF HELIUM 


Ta this part the third virial coefficient of helium at 
higher temperatures is treated under the assumption of 
additivitys of the intermolecular potential. This assump- 
tion is shortly discussed in the last section of this part. 


16. High-Temperature Expansion of the 
Third Virial Coefficient§ 


The expansion (14.8) of W (r,r) for a two-molecule 
system can be generalized to W (rı, ro, +++, rN; Ti, Io, 
+++, ry) for an N-molecule (single component) system, 
where r; is the position of the ith molecule. Namely, 


W (ru +++, IN; En -°+,tv)=Woln, «++, tw, ---, ty) 
+HqWilt, +++, TN; Tn +++, tw)+-+-, (16.1) 
where q is 7? divided by the mass of a molecule. (Com- 


pare (14.3) where m* is the reduced mass.) The first 
term on the right-hand side is, like (14.6), 


"ys ty)= —B8(r, Oy ty); 
B=1/kT, (16.2) 


Walt, --- Iy; Ip 


® being the potential energy of the system. For the 
second term we have, like (14.7), 


Walt, ++, IN} 41, i-e Ey) 
ie eG) @ B3 N ð 2 
=- 5 a (e . (16.3) 
12i=1 ðr; ðr; 24 i=1 \ ðr; 


The classical third virial coefficient can be expressed 
in the form 


aB f f f (oteier 


-expl -6t (rsrs) + exp — 68 (r2,r5)_] 
= exp[ —6® (r1,¥2,13) ]}d71d72d73, 


V being the volume of the system. Replacing in this 
expression the Baltzmann factors, exp[—6®(n1,r2)],- 7 
exp[—®(r1,%,rs) |, by the corresponding Slater sums, 


De Boer and Michels’ result, reference 23, is ro=2.87A, 


. Uo/k=10.22°K for n=12 which corresponds to 3=0.99. 


§ Sections 15 and 16 were prepared with the assistance of Y. 


Midzuno and T. Shizume. , 


D CC-0. In Public Domain. Gurukul Kangri Collection, Haridwar $ 


> 


expLW (ri,r2; rira) ],---, expC (1,023; t1,r2,r) |, we 
obtain the quantum-mechanical expression.” 

Our object in this section is to calculate the first 
quantum correction gC™ to the third virial coefficient 


C=CO COH. (164) 


which is similar to the expansion (14.9) of B. By use 
of (14.9) we have 


CY =8BOBO+LC, (16.5) 


1B? Í : ‘ ô ? 
G= —-—f f f a - Mrr) 
3V 24 | Lar: 


2 


a] 
-expE= trr) | arr) 
iLor; 1 


4 2 


a a2 
—expl— 8t (rar) | [= rara) | 
i LOr; 


3 


ð 2 
-Fexpl—S(1,r,rz) | tenra] |drdradrs 
i Lð 


r; 
(16.6) 


Here we have used integration by parts such as 


1 ee i 
fff exp(—8b) >> —-—bdridralrta 
y Ər; or; 


1 ð s: 
= -STS exp(—8t)2 (=) dridralra. 
V or; 


We are assuming from the outset that the potential 
energy of the system is equal to the sum of the potential 
energies of pairs. Hence, in terms of r;;={r;—r,|, the 
potential energy of a three atom cluster is 


P (r1, r213) =U (rit U (riz) +U (ra). (16.7) 


Taking into account such relations as ; 


ak if ii ele (— 


= f í Í expl—BU (LU a) Parades 


LJ 
and using the transformation . 


VA drdredr3=3rerserisrasdriedrizdrez > 


of integration variables, we finally obtain 


2 
) dridralra 


> 


Qn? . E; 
ETTE f Í f (A etA wat A)r fact 2dr iad as, 


(16.8) 
% Uhlenbeck and Beth, see reference 27; see also B. Kahn and 
G. E. Uhlenbeck, Physica 5, 399 (1938). | 3 
hd 
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where, 


An= { Lu’ (ria) P+ U' (ris) U' (rea) (rab res? — nz) /2ris¥23} 
Xexpl—pU (712) —BU (nia) —BU (res) ] 
< LU’ (rie) F exp[— 8U (ria) J 


with similar expressions for Ay; and A». The integral 
is to be taken over all values of 149,713,723, which form 
three sides of a triangle. 


17. The Third Virial Coefficient for the 
Lennard-Jones Potential 


Here again let us adopt the Lennard-Jones potential 
(15.1) for which the classical C was treated in Sec. 3.3 
Using 


R=rj, £=r;/R, N=7o3/R 


as integration variables, we can transform (16.8) into 


27° 1 m 3 
c= f f [J (A wtAis+A s) R"AR end 
0 “1-£ 0 


The integration with respect to R can be performed by 
the usual series-expansion technique. The result is as 


follows: 
2m 17d \4 
C=— =>) ¥y'T.(6,n), 


(17.1) 
3n RT\RT/ =o 


in which y is the same as in Sec. 15, and 


6i—4 1 pı 
i ii F .Endndé, 
n O Cress 


F= (6t+n—4) (6t—4) Kn, nyn) 6-8) In—2(6) — (6] 
—121(61—4)[(n,6)(n) 4-69 In1(6) 1 (6) 
+364 (t—1)[(6,6)(72) 4-64) /n(6) +2 (6) ], 


1 
T,(6,1) E ( 


TABLE XVI. Coefficients in the expansion (17.1) 
for L.-J. (6,9)-potential. 
ee 
t 1;(6,9) 
es eee 
—118.7 
+56.0 
12.61 
10.09 
7.32 
4.60 
2.59 
1.338 
0.651 
0.300 
0.133 
0.057 
ž 0.024 


_ ped pand 
NROOCONADNAUON=EO 


2 Numerical values of y: (9) and y: (12) in Table III were recal- 
culated with great accuracy by Epstein, Hibbert, Powers, and 
Roe, J. Chem. Phys. 22, 464 (1954), and by Rowlinson, Sumner, 
and Sutton, Trans. Faraday Soc. 50, 1 (1954), respectively. 


TABLE XVII. Functions in the expansions (15.2) and (17.2) 
for L.-J. (6,9)-potential. 


logis F(z) FQ)(z) F()(z) F@)(z) - G(z) , Garg 
0.1 1.583 1.194 —1.007 1.684 0.369 1.0 
0.2 —1.071 0.741 —0.444 0.564 0.409 0.37 
0.3 —0.6995 0.4793 —0.2056 0.2024 0.357 0.187- 
0.4 —0.4255 0.3200 —0.0992 0.0757 0.303 0.127 
0.5 —0.2218 0.2195  —0.0495 0.0295 0.264 0.100 
0.6 —0.0700 0.1538 —0.0255 0.0119 0.237 0.078 
0.7 0.0430 0.1095  —0.0136 0.0050 0.219 0.062 
0.8 0.1265 0.0791  —0.0073 0.0021 0.206 0.050 
0.9 0.1874 0.0578 —0.0040 0.0009 0.194 0.039 
1.0 0.2309 0.0426 -—G.0022 0.0004 0.183 0.030 
1.1 0.2610 0.0316 —0.0013 ‘ 0.172 0.020 
1.2 0.2806 0.0236 —0.0007, 0.160 0,015 
1.3 0.2922 0.0177 —0.0004 0.149 0.012 
1.4 0.2974 0.0134 0.137 0.010 
1.5 0.2979 0.0101 0.125- 0.008 
1.6 0.2946 0.0077 0.114 0.006 
1.7 0.2885 0.0058 0.103 0.004 
1.8 0.2805 0.0044 0.093 
1.9 0.2709 0.0034 0.083 
2.0 0.2603 0.0026 0.074 


where the bracket notation has been used in the sense 
(=I EHe, 


(Q= HET, 
(a,b)=(a+b+ 2) 
IETEN 
Ho eHre) — 2)/4 
HE er eHe e) (EH 1)/4, 


The values of 7:(6,9) obtained by numerical integra- 
tions are given in Table XVI. 

By use of C, the third virial coefficient, C, is given 
by (16.4) with (16.5) up to the term proportional to the 
first power of q. For n=9, Table XVII gives the func- 
tions G® and G® in the expansion 


5 1 
c= rri] G0 ()+=G0 (2)+-- | (17.2) 
18 cs 


z=Uo/kT, 1/cs=6g/Usre?, 
together with F® in the expansion (15.2) of the second 
virial coefficient. ; 

Figure 14 shows a comparison with experimental re- 
sults for Het. Here curves indicate calculated values 
based on the model parameters determined from the 
second virial coefficient. Although thè experimental re- 
sults do not seem to be very accurate, the comparison 
shows that the bpwl of' the Lennard-Jones potential 
with n=12 is not sufficiently wide (compare Sec. 3): 

It is to be noted that the potential energy of the 
three-body system has been assumed to equal the sum 
of the potential energies of pairs. This assumption of 
potential additivity is now capablé of discussion. 

e 
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18. Nonadditivity of the Intermolecular Potential 


The intermolecular potential of helium determined 


from the second virial coefficient has the bowl-width 
corresponding to 


g=1.5 for the square-well potential, (18.1a) 


n=12 for the Lennard-Jones potential. (18.1b) 


These values of parameters indicating relative bowl- 
width may belong also to other rare-gas molecules, Ne, 
A, Kr, and Xe. (This similarity of the intermolecular 
potential does not strictly hold of course: probably, the 
larger the molecule, the smaller the relative bowl-width. 
But such nonsimilarity does not influence the following 
conclusion.) 

Im Secs. 2, 3, and 17 the third virial coefficient was 
represented, under the assumption of additivity of the 
intermolecular potential, by models having a wider bowl 
with 


g~2.0 for the square-well potential, (18.2a) 


(18.2b) 


Furthermore, we saw in Sec. 6 that a potential with 
much wider bowl has to be adopted if we want to ex- 
plain the stability of cubic crystal structure for Ne, A, 
Kr, and Xe under the assumption of the potential 
additivity. 

These circumstances show that the intermolecular 
potential may not be considered to be strictly additive.” 
The potential with parameters (18.2) should be accepted 
as “the mean (or effective) additive intermolecular 
potential” in a three-body system. Therefore, the bowl 
of the mean additive intermolecular potential in a many- 


n~9 for the Lennard-Jones potential. 


Cx 1972 ae 


K o Oito 
© Michels - Wouters 


4 
e 
Z ° e 
TK 
ot ey a 
O o 100 150, «200-50 «300-250-400 


17 


fis i i i yes for the 
ird virial coefficient of helium with curves 
nah AA determined from the second gia gui: 
ient. Curve a is for (6,12)-počential, „classical; curve b. = ie 
*(6,9)-potential, classical; curve ¢ is for (6,9)-potential with the 
first quantum correction. 
z Tn fact, strictly additive is only the van der Waals-London 


i the second 
i d from the perturbation theory to 1 
saree a eerturbation aerie H ee pee 
ya = a La a ; . M. 
order, interactions between trip: Phy E £1509 (1943). 
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ee 


Fic. 15. The full line 
indicates qualitatively the 
intermolecular potential 
in two-body systems; the 
dotted line shows the mean 
additive intermolecular po- 
tential in three-body sys- 
tems. 


ji 
ee r 
JO 


body system is wider than that of the potential in the 
wo-body system, as shown in Fig. 15. 


VI. VIRIAL COEFFICIENTS BETWEEN UNLIKE 
MOLECULES 


Up to this point we have been investigating pure 
gases; the object of this part is to give a brief treatment 
of typical gaseous mixtures. The equation of state for 
binary mixtures of species A and B is of the form 


pv=kT[1+ (Basért+2Ba pbsEot Banks*)o 
+ (Casata tH 3C aarts En tH3C annta 
-HC eeri) t-e]. 


Here £4 and £g=1—€, are mole fractions of the species 
A and B, respectively. Baa and C444 are second and 
third virial coefficients for the molecule A; Bes and 
Ceen for the molecule B. Ban, Caan, and Cang are 
virial coefficients between unlike molecules, in which 
we are now interested. 


19. The Second Virial Coefficient between 
Helium Isotopes 


For mixtures of Het and He, which will be denoted 
by A and B, respectively, the intermolecular potentials 
of A to A, B to B, and A to B are all the same. Hence 
the second virial coefficients B44, Bgg, and Bap are 
different only as regards the reduced masses and the 
statistics. Let us confine our treatment above the Boyle 
temperature where the statistical effect is negligible.* 
The high-temperature expansion then becomes (com- 
pare (14.9)) 


Bas=BO+qs,BV+ qn 2BO+:-+, 
Bss=B®+qrsBV® +q} BOH- --, 
Bas=B®4qg4sBV+qa P BOH- --. 


Here gaa, dee, and qas are 44? divided by the reduced 
mass for pairs A—A, B— B, and A —B, respectively, so 
that 2¢48= ast qee. > a 

Figure 16 shows these three coefficients calculated by 
use of the Lennard-Jones potential with parameters 
determined in Sec. 15. ‘ i 

4 As regards the second virial coefficient of helium mixtures at 
low temperatures, see Cohen, Offerhaus, and de Boer, Physica 20, 


501 (1954); Kilpatrick, Keller, Hammel, and Metropolis, Phys. 
q ri P nys. 


Rev. 94, 1103 (1954); Kilpatrick, Keller, and 
Rev. 97, 9 (1955). : x : 


` 
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Fic. 16. Second virial coefficient between helium isotopes cal- 
culated for the Lennard-Jones potential. The circles are observed 
values for the pure He‘ gas. 


20. The Core-Model of Interaction between 
Unlike Molecules 


The core-model introduced in Sec. 9 can also be 
applied to the mixed virial coefficient Bag. The model 
is as follows: We assume an appropriate convex body 
called core inside each molecule and define the inter- 
molecular distance p as the shortest distance between 
two cores. The intermolecular potential Uag for the 
pair A—B is assumed to be a function of p only, i.e., 
Uaz= Uaslp) for which Uaz(0)= oe, 

By means of the core-model of molecules, the mixed 
second virial coefficient is given by 


oe —U.n(p) 
Bunz f tes tba) -++b42(0), 
p=0 kT 
where 
VatVs MBsSa+MaSg 
bas(p)= oF s 
: 2 8m 


Ma = 2rp+Moa, 
Sa=Tp’?+MoaptSoa, 


T 1 1 
Va=—p°+—Moap?+-Soapt+Voa 5 
OREA. 2 


and similar relations as regards the species B. Here 
Voa, Soa, and Moa are, respectively, the volume, the 
surface area, and the mean curvature integrated over 
the whole surface of the convex core of the molecule A. 
The function 44 (e) may also be expressed in the form 
[compare (9.2) ] 


2m Most+Mos 
ban(p) O al 


2 
4 Soat Sos MosMor 
o + ———— }p 
2 4r 
Voat+Vog MosSoa+Mo4Sor 
. aoo : 
kz 2 8T 


In particular, when the potential U4g(p) is the 
Lennard-Jones function, 


poaB\ poss ô 
Uaso) =U ( ) -2( ) I 
P P 


the second virial coefficient Baz is expressed by a power 
series with respect to s 


(zan)t= (Uoas/RT)', 


that is 
Moa +Mog 


POA pls (gar) 


e 


e 


2r 
Bag= pua gF (za B) AF 


SoatSon MoaMog 
pe ai 
2 4r 
VoatVos MosSoa+Mo4Sog 
AF T 
2 8r 
where (s=1,2,3) e 


Sol /6i—s 
F,(2)=—— >> —2(——) 2, 
12 = t! 12 


Joos Bly (Za B) 


? 


the table of which is given in Sec. 9. 
As regards model constants, the relations 


2p0aB=Ppoaatposs, 
UOoaw’=UoaaU oss a E 


have been verified to hold approximately. An example 
is shown in Fig. 17, in which curves were calculated by 


20 


: argon 


B hydrogen 


T IR 


-40| N 
o 100 200 300 400 500 600 8 
Fic. 17. Second virial coefficient between argon and hydrogen 
with curves for the core-model. 9 


* T. Kihara and S. Koba, J. Phys. Soc. Japan 9, 688 (1954); 


3 Uo D Be Beattie and W. H. Soane J. Chem. Phys. 10, 
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3e Jati s 
use of these relations and by use of model constants 


determined’in Sec. 9 from the virial coefficients of pure 
gases. 


21. The Third Virial Coefficient for 
Binary Mixtures 


The third virial coefficient for unlike molecules can 


easily be evaluated only in the case of the square-well 
potential*® 


«© for r<a;; 
U;(r)= E for Gij LY <pij 
' 0 for pj <r, 


where, in the case óf a binary mixture, 7=A or B, 
jJ=A or B, and k=4 or B. 
Thè result is a polynomial on the third order in 


Xij= exp (e,;/RkT)—1 
that is 
3C n= IO — [ep @ ra DeD] 
+ Lael OY age let xy e] 
— Xijkirtjl O. 
Here the coefficients J are given by 
IO=W (Tij, Titit), 
ID= W (pinsin) -I, 
ra» =W (cj pinon) -I ®, 
10D = W (oij irnp) IO, 
[e= W (053,014,034) IOI — 102, 
ICD=W (pio p) IO 109—103, 
[e3 = W (633,piz,03.) —L© — TEDT 70.8), 
TO= W (pinpin pa 1 — de a 


36 T, Kihara, see reference 2. 


» 
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Fic. 18. Third virial coefficient for argon-neon mixtures 
calculated for the square-well potential. 


by means of the same function W (a,b,c) defined in 
Seca: 

As regards model parameters, it may be reasonable 
to assume paa=2044, pAn=2742, PBEE= 2O RRB, 


7 
2oaB=04A FOBB, EAB = EAAEBB 


(compare Sec. 2 and Sec. 20). Figure 18 shows C444, 
Caan, Cann, and C gge calculated on these assumptions 
in case A and B are argon and neon, respectively. The 
model constants here used are 

gaa=3.11 A, €414/k=54.7°K, 

opp=2.51 A, esn/k=16.4°K, 
which were determined from the second virial coeffi- 
cients. 
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INTRODUCTION 


DVANCES in the technique of proportional and 

scintillation spectrometry have extended our 
knowledge of orbital electron-capture. In a study of 
comparative half-lives (ft values), Major and Bieden- 
harn? have summarized the data existing up to the 
middle of 1954. 

The theoretical work of Marshak? and Rose and 
Jackson? has been extended by Brysk and Rose‘ in the 
light of present knowledge of beta-decay theory of for- 
bidden transitions, with particular reference to capture 
of Z-shell electrons. 

In the interpretation of radiative electron-capture 
(inner bremsstrahlung) spectra, the capture of p-elec- 
trons appears to be significant.®® 

This review comprises a summary and analysis of the 
existing data on electron-capturing nuclides (up to 
May, 1955) whose decay schemes are relatively simple 
and well established. In particular, the primary concern 
is with the ratio of Z-capture to K-capture both as a 
test of the theory of Marshak and of Brysk and Rose 
and as an application of the theory to the determination 
of transition energies in electron-capture and of L- 
fluorescence yields. 


EXPERIMENTAL TECHNIQUES 


Essentially three techniques have been applied to the 
determination of x-ray intensity and capture ratios. 
These are described below. 


(A) Internal Source Spectrometry 


The radioactive material is contained within the sen- 
sitive volume of the detector. Gas proportional counters 
have been used for the study of A%7,7 Kr”, and Ge?! 9 
(in the form of germane, GeH,). In this method the 


* Supported in part by the U. S. Atomic Energy Commission. 
t Department of Physics. 
} Department of Chemistry. 
1J. K. Major and L. C. Biedenharn, Revs. Modern Phys. 26, 
321 (1954). 
2 R. £. Marshak, Phys. Rev. 61, 431 (1942). 
3M. E. Rose and J. L. Jackson, Phys. Rev. 76, 1540 (1949). 
4H. Brysk and M. E. Rose, Oak Ridge National Laboratory 
REDO ORNL-1830 (January 13, 1955), and errata (unpub- 
6 R. J. Glauber and P. C. Martin, Phys. Rev. 95, 572 (1954). 
¢ R. W. Fink and B. L. Robinson, Phys. Rev. 98, 1293 (1955). 
7 Pontecorvo, Kirkwood, and Hanna, Phys. Rev. 74, 982 (1949). 
8P. Radvanyi, Compt. rend. 235, 428 (1952); M. Langevin 
and P. Radvanyi, Compt. rend. 238, 77 (1954). 
9M. Langevin, Compt. rend. 239, 1625 (1954). 


prompt cascade of x-rays and Auger electrons, which 
follows each K-capture event, is integrated by the de- 
tector to give a single “K-line” in the pulse-he:ght 
spectrum. An “Z-line” arises from Z-captures and from 
K-captures which are followed by escape uf the K x-rays 
from the sensitive volume. The correction for K x-ray 
escape may be made small by suitable choice of detector 
material and size, and it can be calculated quite accu- 
rately from x-ray absorption data and K-fluorescence 
yields." Since Z x-rays and L-Auger electrons are totally 
absorbed, no correction need be made for Z-fluorescence 
yield. 

Scintillation crystals have been grown" containing 
radioactive I? and Cd’. In this type of experiment 
the amount of Z-capture is obtained from the difference 
between the number of K x-rays and the total number 
of gamma-ray transitions on the assumption that one 
gamma ray accompanies each decay. 


(B) External Source Spectrometry 


The radioactive substance is placed cutside of the 
sensitive volume, and corrections must be applied for 
source self-absorption and self-scattering, for differen- 
tial air and window absorption, and for K and Z fluores- 
cence yields. One must also consider that a K-shell 
hole may be filled by an Z-electron either by radiative 
transition (Ka=K— Lry, Lu) or by Auger transition 
(K—LL, K—LX). The number of L-shell vacancies 
produced in this manner, xz, ranges from 1.36 at Z=29 
to 0.75 at Z=90, as will be discussed below. 

In case electron capture is followed by gamma emis- 
sion, the conversion of the gamma rays must be taken 
into account. 

Many electron-capturing nuclides of the heaviest 
elements have been investigated.!23 The intensities of 
the Z x-rays have been measured carefully by use of 
proportional counters and a bent-crystal x-ray spec- 
trometer.’? However, most of these nuclides have low- 
energy gamma transitions which are highly converted 
in the L-shell so that the interpretation of x-ray inten- 
sities in terms of electron-capture ratios is tenuous. No 
attempt has been made to correlate these data with 
theory in this pape. i 

2 Broyles, Thomas, and Haynes, Phys. Rev. 89, 715 (1953). 

1 E. der Mateosian, Phys. Rev. 92, 938 (1953). 

2R. W. Hoff, University of California Radiation Laboratory 
Report UCRL-2325 (1953) (unpublished thesis). 


*H. Jaffe, University of California Radiation Laboratory Re- 
port, UCRL-2537 (1954) (unpublished thesis). 
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(C) Absorption Experiments 


Some absorption measurements of relative L and K 
x-Tay Intensities have been reported by Wilkinson and 
Hicks!‘ and Chu.!® These measurements have not been 
considered in this work. Most of the nuclides involved 
have hard gamma-rays and conversion electrons; their 
decay schemes and conversion coefficients are unknown 
and there is always the possibility of low-energy (and 
highly converted) gamma transitions unresolved from 
the x-rays. 


CALCULATED P,/Px RATIOS 


Marshak? has’ shown that L-capture always accom- 
panies K-capture and occurs with increasing probability 
as the transition energy becomes small and Z large. 
For allowed transitions the ratio of capture proba- 
bilities is given by 


JÈN (gue 
P1/Pr= (=) (=) Etaten (1) 
qe} A grè 


where qy is the neutrino energy for J-capture; gy and 
fz are the “large” and ‘‘small” components of the Dirac 
radial wave function in a Coulomb field, respectively. 
Marshak evaluated the appropriate ratios using rela- 
tivistic wave-functions with Slater screening. He also 
indicated that for forbidden transitions the capture of 
electrons of 7>4 might compete favorably under certain 
conditions. 

Rose and Jackson? have computed wave-function 
ratios using self-consistent (Hartree) wave functions 
and relativistic wave functions for a Thomas-Fermi 
atom with exchange. ‘The ratio of (g?z;/gx) is given 
graphically as a function of atomic number. 

Brysk and Rose! have extended the previous work in 
the light of present knowledge of beta-decay theory of 
forbidden transitions. For the first-forbidden unique 
transition (| AJ | =2, yes), the expression for the capture 
ratio is particularly simple. They have also computed 
the values of the Dirac radial wave functions at the 
nuclear radius, and have given wave function ratios 
using relativistic wave functions corrected for the finite 
size of the nucleus, variations in electron wave functions 
over the nuclear volume, and screening. The results 
obtained for Lı/K capture are similar to those of 
Rose and Jackson. Using the wave functions and 
wave-function ratios given graphically by Brysk and 
Rose, one can compute L and Li capture prob- 
abilities. - 

Capture ratios have been computed from the above 
theoretical results for those. cases where the decay 
scheme is relatively simple and the transition energy 
known. Table I contains these computed ratios as well 

G. Hicks, Phys. Rev. 75, 696, 1370 

2 77 314 (1950) (Re); 79, 815 (1950) (Tb, Ho); 
gi) Pal iat OS) a Ati Pe 
Rev. 75, 1019 (1949) (Bt, Au); 80, 493 Coe) Chas Ni 
“iT, C, Chu, Phys. Rev. 72, 582 (1950) Us) 
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as experimental values for half-life, energy of the elec- 
tron-capture transition, and branching ratio. Also tabu- 
lated is the nature of the transition according to the 
shell model, as given by King,'* and the logarithm of 
the comparative half-life (log ft). The latter is either 
taken from Major and Biedenharn' or calculated by 
their method or from the nomogram of Moszkowski.!7 

The relative probability of Zi capture has been cal- 
culated for the four cases where the spin change is 
believed to be greater than unity. The electron-capture 
transitions of K®, Ca, and. Tl involve changes of 
parity as well as changes of angular momentum of two 
units (|AJ| =2, yes), corresponding to first-forbidden 
unique type of beta decay. The shell model indicates 
that Ni® undergoes a second-forbidden electron-capture 
transition (| AJ| =2, no); capture ratios were computed 
for mixtures of the Ay and Ti; matrix elements (for the 
tensor interaction) and found to be appropriate for 
second-forbidden beta decay having |A/| =2, no. It 
appears that a sensitive experimental determination of 
the capture ratio of Ni” might allow an estimate to be 
made of the ratio A,;/T;; for this transition. 


COMPARISON OF THEORY WITH EXPERIMENT 


In a few cases direct measurements have been made 
of the P:/Px capture ratio where the transition energy 
is known. These are given in Table IL (a) along with the 
computed values transposed from Table I. In several 
other cases the energy is deduced from the measured 
capture ratio as shown in Table II (b). The experi- 
mental results have been recalculated by the present 
authors for Ge7 and Kr” using K-fluorescence yields 
given by Broyles, Thomas, and Haynes! instead of 
those of Burhop" used in the original reports. 

The theory is in relatively good agreement with the 
experiments on A” and Kr”, and in the case of K” the 
agreement is not within the assigned error, because the 
experimental errors in the values used in the closed 
cycle calculation (see below) might lie mostly in one 
direction. > 

The energies deduced in Table II (b) lead to com- 
parative half-lives (ff values) consistent with allowed 
transitions for Cd, I, and Ba™, and with first- 
forbidden unique transitions for K“ (compare Ca” and 
TP“ in Table I). In the case of 9.5-year Ba! the decay 
scheme is complex and the contribution of internal con- 
version uncertain. Langevin™ syggests that the decay 
might proceed entirely by L-capture which would oper- 
ate to reduce the comparative half-life bringing it into 
better agreement. a $ 

The large discrepancies indicated in Table II (a) for 
Pd and Ge” may not be real; in the case’ of Pd, 

16 R, W. King, Revs. Modern Phys. 26, 327 (1954). = 

17S, A. Moszkowski, Phys. Rev. 82, 35 (1951). $ 

18 E, J. Konopinski and L. M. Langer, Ann. Revs. Nuclear Sci. 
2 (Annual Reviews, Inc., Stanford, California, 1953), p. 300. 

OE. H. Š. Burhop, The Auger Effect (Cambridge, University 
Press, Cambridge, England, 1954). < 

» M. Langevin, Compt. rend. 240, 289 (1955). 
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because of the contribution made 
vee the 40-key isomeric state in the daughter; 
ape ae case of Gel, because of reported isomerism 

e electron-capture transition energy used in com- 


by the internal con- 


puting the Pz/Px ratio in Table I for K” was deduced 


* M. Langevin, Compt. rend. 238, 2518 (1954). 
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from a closed cycle” consisting of the beta-decay energ 

of K”, 1.325-£0.015 Mev,” the Ca®— A” mass differ- 

ence, 0.199-+-0.015 mmu=0.185-0.014 Mev,** and the 

energy of the first excited state of A”, 1.459-£0.007 
2 P, Morrison, Phys. Rev. 82, 209 (1951). 


2, Feldman and C. S. Wu, Phys. Rev. 87, 1091 (1952). 
“W, H. Johnson, Jr., Phys. Rev. 88, 1213 (1952). 


a Column ”2 is an estimate of the relative reliabili f 
experimental information; ENE, A ied ees 
sea blister, while those in the C category remain tebe conecnied) 
elo on follows that of Major and Biedenharn (see 

Column 4 gives the energy of the transition indicated i 

Rio sy OF cated in column 
5 aml the method by which it was determined. The following 
peer 2 ased: (p,n), reaction threshold measurement; 

, positron decay energy; IB, inner bremsstrahlung endpoint 
plus K-electron binding energy; cc, closed cycle calculation, kieh 
e aande hoe o japas particles, beta decay, or gamma 

S, , In the case of K*, mass spectrometer data. 

Column 5 lists the type of ‘transition involved and the branching 
raio phe ence ans: g and e refer to ground-state and first- 
excited state transitions, respectively (ei s ith excited state). 

Column 7 gives the thes etal y emai for L Sa 
and the total capture ratio. P,,,/PK is given only for cases 
where |AJ| >2 (see text). 

b References for Tables I through IV. 

Al. Anderson, Wheeler, and Watson, Phys. Rev. 90, 606 (1953). 

a K eD; J. phys. radium 14, 637 (1953). 

A3. Axel, Fox, and Parker, Phys. Rev. 97, 975 (1955). 

Aa. RP. Avignon, J. phys. radium 14, 636 (1953). 

B1.. fe Backofen and R. H. Herber, Phys. Rev. 97, 743 
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B2. I. Bergstrom, Phys. Rev. 82, 112 (1951). 
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TABLE IT (a). Experimental determinations of orbital electron capture ratio for cases where transition energy is known. 
¥ Wile 
Zz A cil ta Ref. (Table 1) Remarks 
18 A 37 0.08 to 0.09 PS 0.082 | 
19K 40 1.350.23 H4 +0.33 See Table II (b). 
0.21 0.06 i 
32 Ge 71 0.19=0.03 L3 0.106 Experimental results recalc. using wx =0.50.> on 
Decay scheme in question (L2). Í 
36 Kr 79 0.096-0.030 R2, L1 0.101 Experimental results recalc. using wg =0.63.» 
46 Pd 103 0.793-0.15 A2 0.120 Conversion of the 40 kev IT daughter in 


question. i 


a See list of references in footnote b of Table I. 


TABLE II (b). Experimental determinations of orbital electron capture ratios from which transition energy is ċalculated. 


b See reference 10 in text. 


) 


Y 
| 
i Calculated | 
Experimental Energy Log | 
Z A Half-life Ref. PL/Px Ref, Transition (Mev) ft Remarks 
19K 40 13X10°yr Hi 1.350.023 H4 e(4— to 2—) 0.019=+0.002 10.3 ao with (|AJ|=2 L 
yes 
48 Cd 109 470 days G4 0.28-+0.03 M5 e(5/2,1/2— gın) 0 0620-009 4.1 See Table ITI 
“~< —0.005 
53 I 125 60.0 days F4 0.23+0.03 M5 e(ds— dsn) 01001 020 4.93 
56 Ba 133 ~9.5 yr H1 >9 L4 els;—s;) <0.040 5.6 Decay scheme complex and 
controversial (L4, H6). 
a See list of references in footnote b of Table I. da 
b 
Mey.” For this energy, 51422 Kev, the capture ratio cific activity for electron-capture (gamma rays). The 


for second-forbidden unique (|AJ|=2, yes) transition 


0.3 
lies between 0.54 and 0.15, (pa ee Heintze?’ 


has measured the specific activity of potassium for the 
emission of argon K x-rays; after correcting for the 
K-fluorescence yield of argon, he finds that the specific 
activity for K-capture (K x-rays) is less than the spe- 


ATOMIC NUMBER (Z) 


Fic. 1. wxn=xi+axx. The number of L-shell vacancies pro- 
g duced in the filling of a K-shell vacancy. 


25 M. L. Good, Phys. Rev. 81, 891 (1951). 
20 J. Heintze, Z. Naturforsch. 9a, 469 (1954). 
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excess of gamma rays is attributed to'Z-capturẹ giving 
an experimental ratio of Pz/Px=1.35+0.23. From 
this one may deduce that the transition energy is 
19-2 kev. As mentioned previously, the comparative 
half-life for this energy is very similar to those for 
similar transitions in other nuclides. 


X-RAY INTENSITY RATIOS AND MEAN 
LI-FLUORESCENCE YIELDS 


The relative intensities of K and L x-rays have been 
measured for several electron-capturers for which the 
capture ratio is known, either from experiment or by 
computation from the transition energy. The x-ray 
intensity ratio Iz/Ix is related to the capture ratio 
P1/Px by the expression 


IP OL 
Tt/Ix= (Girne) (2) 


K Wk 


where wx = K-fluorescence yield; ©;=mean L-fluores- 
cence yield,“ and #gz=number of L-shell vacancies 


* For completeness, because of common usage, the following 
quantities are also defined: wun wzm wrm = fluorescence yields 
for the Zr, Lr, Linx subshells, respectively. (The mean L-fluores- 
cence yield defined above is an average of these partial yields 
weighted in an essentially unknown manner. In general, these 
subshell yields differ from one another.) wzg = fluorescence yield | 
from secondary T-vacancies (i.e., arising from K-vacancies). I 


eru =fluorescence yield from primary Z-vacancies (i.e., arising is 
a 


Paad processes, L-capture or intërnal conversion in the 
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/K-CAPTURE RATIOS IN ELECTRON-CAPTURE 


TABLE III. Experimental determinations of x- 
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ray intensities in electron capture from which mear L-fluorescence yields are calculated. 


Z -A ’ 
I1/1k Ref.» P1/Pr Ref, "gL vg oya Remarks 
48 €d 109 0.0505- - ; = —-— 
50.0003 B3 0.28-0.03 M5 0.93 0.83 0.029-+0.003 Corrected for internal 
e eee conversion of 87 kev 
$1 TI 204 ae F3 0.160 TableI 0.89 0.87 —(0.10-£0.01 Te 
ane » 0-330.05 J2 0.524 Table I 0.78 0.95 0.24-0.05 
0.30-0.05 See reference 29 
0.38-0.49 See reference 29a 


a See list of references in footnote b of Table I. 


“ 


TABLE IV. &xperi inati i i iosi 
A cage ee inana of x-ray intensity ratios in electron capture with capture ratios calculated using 
estimated a, (Fig. 2) and transition energies deduced from capture ratios. 


Half- 
life ; 
Z A diy) Ref) | Transition In/Ix Ref, ap a nee P1/Pr (ea) hi aes 
uW 140 W2 g(4,3/2—gra) 03940.01 B5 093 0.224005 081 0.834035 ojt? 64 
DORS 6 
r 0.15 1.54 0.092 6.4 See B5 
7605s!85 97 Ki  e(4~ato dsn) 0.40+0.05 M10 0.94 0.23+0.05 0.80 0.8-0.4 <0.07 5.8 b 
pA 180 S8  e(3/2— fsz) 0.408+0.016 B4 0.95 0.73 0.5 
2 2 ; . y ; .58-£+0.14 +28 ¢ 
0.14855 69 
aL 12? 12.5 M3 e(2—to2+) 0.39 HS 0.95 0.25+0.05 0.78 0.704010 6.15%0.01 5.33 £ 


2 See list of references in footnote b of Table I. » Transition to upper state is pure L-capture. ¢ Calculated for allowed transition. 


produced in the filling of a K-shell vacancy, 
é (=) 2(K—LL)+(K—LX) 
NEL= OK tax 


> Augers 


puted and also entered in Table ILI. The value of a, 
found by Damon and Edwards” for the conversion of 
the 47.5-Kev gamma ray of RaD(Pb”") is also entered.” 


K 


(this sum might be broken down further into subshell 
components); [x./Jx=intensity ratio of Ka x-rays to 
total K x-rays; ax=K Auger yield (1—wx); and 
(K—LX)=partial Auger yields; X denotes M, N, etc. 
shell electrons. (K—ZX) is the probability that a K- 
shell vacancy is filled by an Z-shell electron with the 
excess energy carried off by an X-shell electron. 

The ratio (IKa/IK) has been computed from data 
given in Compton and Allison,” K-fluorescence yields 
are given by Broyles, Thomas, and Haynes,” and par- ok 
tial Auger yields have been computed from the litera- f 
ture28 The values of 7x1 computed for Z=29, 32, 47, o 
49, 56, 78, 80, 83, and 84 are plotted in Fig. 1 and a 


04 


60 Q 
ATOMIG NUMBER (Z} 


smooth curve has been drawn. Appropriate values of 
ngu are entered in Table MI together with observed 
intensity ratios, capture ratios, and K-fluorescenc- 
yields. Mean L-fluorescence yields z have been come 


27 A. H. Compton and S. K. Allison, X-Rays in Theory and 
Experiment (D. Van Nostrand Company, Inc., New York, 1935), 
edition. $ 
E 1%, Perkins and S. K. Hayttes, Phys. Rev. 92, 687 (1953) 
(Gu). M. Ference, ‘Phys. Rev. 51, 727 (1937) (Ge). Huber, 
Humbel, Schneider, and de Shalit, Helv. Phys. Acta. 25, 3 (1952) 
ae Cd). Broyles, Thomas, and Haynes, Phys. Rev. 89, 715 


z. Ph 
1953) (In; Ba, Hg). Steffen, Huber, and Humbel, Helv. Phys. 
het 2, 167 (1949) (Pt). C. D. Ellis, Proc. Roy. Soc. (London) 
' A139, 336 (1933) Gi R. W. Hoff, University of Gok tomia 
Radiation Laboratory Report, UCRL-2325 (1953), unpu 


thesis, (Po). 
É 
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Fic. 2. Mean L-fluorescence yields. The full circles are values 
obtained by H. Lay, Z. Phys. 91, 533 (1934), using a photographic 
technique and x-ray excitation. Points a-d have been deduced 
from nuclear data as follows: e Cd’EC—Bertolini, Bisi, Lazza- 
rini, and Zappa, Nuovo cimento 11(9), 539 (1954). b Cs! EC— 
R. W. Fink and B. L. Robinson, Phys. Rev. 98, 1293 (1955). 
c T” EC—H. Jaffe, UCRL-2537 (1954) (unpublished)..d Pb% 
(RaD)—Internal Conversion. P. R. Damon and R. R. Edwards, 
Phys. Rev. 95, 1698 (1954). (See reference 29a.) 


2 P. E. Damon and R. R. Edwards, Phys. Rev. 95, 1698 (1954). 

2a Note added in proof.—Ross, Cochran, Hughes, and Kgather 
[Proc. Phys. Soc. (London) A68, 612 (1955)}, have i 
critically all existing experimental data on the fluorescence yields 
of the L-levels of bismuth excited by internal conversion (RaD) 
and by soft x-rays. They conclude that for bismuth the value of 
&, “probably lies between 0.38 and 0.49.” They alse have shown 
that most of the transitions from the Lr supshell (for which the 


- 
. 
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In Fig. 2 the mean Z-fluorescence yields are plotted 
against Z. The results of Lay” obtained by a photo- 
graphic method for fluorescent excitation are also shown. 
(A break is indicated at Z~73; beyond this point 
Coster-Krénig transitions contribute. A Coster-Krénig 
transition is an Auger transition of the type (Zi— 
IynMiy,v) which shifts Lı vacancies to the Lyr shell 
for which the partial fluorescence yield is greater.) 

If a systematic error in the work of Lay is excluded, 
Fig. 2 implies that the weighting of the subshell contri- 
butions to the mean L-flvorescence yield is fundamen- 
tally different for different methods of excitation, and 
that æz lies lower for nuclear excitation than for x-ray 
excitation (except see reference 29a). 


SOME TRANSITION ENERGIES 


The intensities of the L and K x-rays have been 
reported for several electron-capturing nuclides. These 
are given in Table IV together with K-fluorescence 
yields, #xzz, and mean Z-fluorescence yields estimated 
from Fig. 2. Capture ratios, transition energies, and 
comparative half-lives have been computed for these 
cases and entered in the table. The four cases are dis- 
cussed individually below. 


(a) The nuclide W18! has been reported* to exhibit 
pure electron-capture with no gamma rays; from x-ray 
intensity measurements the capture energy is inter- 
preted to be 92 kev (corresponding to &z=0.15). From 
Fig. 2 one estimates ©,=0.22+-0.05 leading to a cap- 
ture ratio of 0.83, and an energy of 110 kev. The pre- 
viously reported weak gamma rays? at 136.5 and 
152.5 kev are believed to be the result of impurities. 

(b) The K and L x-rays of Os!*5 have been observed 
by Miller and Wilkinson. Two gamma rays at 648 and 
878 kev are associated with the decay and were found 
not to be in coincidence. Weak conversion electrons 
corresponding to the “stopover” transition have been 
reported,“ but the gamma ray has not been observed. 
On the assumption of little or no “‘stopover” transition, 
it is impossible to satisfy both the observed gamma-ray 
intensity ratio and the x-ray intensity ratio, Iz/Ix, 
except by assuming that the transition to the second 
excited state is pure L-capture, thus se ing an upper 
limit of about 70 kev for the energy of this transition. 

(c) By means of a subtle interpretation of the pulse- 
height spectrum of Au” in a proportional counter, 


fluorescence yield is small) are of the Coster-Kronig type, and 
that most of the Z-fluorescence x-rays arise from the Lun sub- 
shell. Consequently, in this region @;~ary41, regardless of the 
primary excitation. The value for Tl (point c, Fig. 2) was 
obtained by neglecting Coster-Kronig transitions and it is prob- 
ably much too low. On the other hand, Coster-Kronig transitions 
do not occur for Z<73 and the values for points a and b are 
undombtedly correct. 
2 H. Lay, Z. Physik 91, 533 (1934). 
3 Bisi, Terrani, and Zappa, Nuovo cimento 1(10), 651 (1955). 
= Cork, Nester, Le Blanc, and Brice, Phys. Rev. 92, 119 (1953). 
3M. M. Miller and R. G. Wilkinson, Phys. Rev. 83, 1050 
1951). 3 
4 J. B. Swan and R. D. Hill, Phys. Rev. 88, 831 (1952). 


AND R. W. FINK 

Bisi and Zappa® concluded that the capture ratio for 
transitions to the second excited state is 0.58, from 
which a transition energy of 148 Kev is deduced. 

(d) In connection with a study of Pb’, Huizenga 
and Stevens’? have reported for Tl” an x-ray intensity 
ratio Ix/I,=2.6. Neglecting the weak conversion of 
the 435-kev gamma ray (about 2.5%%7), and the possi- 
bility of ground-state transitions, one deduces a capture 
energy of about 150 Kev for this nuclide. 


SUMMARY AND CONCLUSIONS 


The theory of orbital capture appears to be in rela- 
tively good agreement with experiment in those ‘cases 
where there is no question about the decay scheme 
(2 cases). The theory has been applied to the determi- 
nation of: (1) transition energies where P;/Px is known 
(4 cases); (2) mean L-fluorescence yields where both 
the transition energy and x-ray intensities are known 
(3 cases); and (3) transition energies where x-ray inten- 
sities are known and mean L-fluorescence yields esti- 
mated (4 cases). Theoretical capture ratios have been 
computed and tabulated for nuclides whose transition 
energies are known and decay schemes simple (28 cases). 

It is clear from the lack of good experimental cases 
that more refined orbital capture data are required. 
In this respect the internal source technique should be 
more widely applied because the interpretation does not 
require a knowledge of L-fluorescence yields. 

It is indicated that mean L-fluorescence yields de- 
pend upon the method ôf €xcitation; various modes of 
excitation result in different relative populations of 
L-shell vacancies. The relative contribution of the L- 
subshells to the mean Z-fluorescence yields can be de- 
termined only by studying the Z x-ray fine structure 
with instruments of sufficiently high resolution (e.g., 
bent crystal spectrometer). 

Note added in proof-—Recently there has been much 
discussion’? concerning the charge distribution on the 
CI’ recoil ions from electron capture in A”, and the 
relation to the L/K capture ratio via theoretical Auger 
transition probabilities. The charge distributions ob- 
served by Kofoed-Hansen and Snell and Pleasonton 
do not appear to be consistent with the 8 to 9% L/K 
capture ratio in A%? (reference 7). 

We feel that the experimental results are probably 
valid and that the difficulty lies in the use of an in- 
adequate theory of Auger transition probabilities. It 
has been suggested by Daudel [J. phys. radium 16, 515 
(1955) ] that the neglect in the present Auger theory of 
the correlations among the positions of the electrons 
may lead to large errors. 

5 A. Bisi and L. Zappa, Nuvvo cimento 12, 539 (1954). 

J. R. Huizenga anà C. M. Stevens, Phys. Rev. 96, 548 (1954). 


O Goertzel, Spinrad, Harr, and Strong, Phys. Rev. 83, 79 


2O. Kofoed-Hansen, Phys. Rev. 96, 1045 (1954); A. H. Snell 


and F. Pleasonton, Phys. Rev. 97, 246 (1955); R. A. Rubenstein - 


and J. N. Snyder, Phys. Rev. 99, 189 (1955); P. Radvanyi, J. 
Oo deena 16, 509 (1955); and A. Winther, J. phys. radium 16, 
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Brownian motion (continued) 


Brownian motion, dynamical friction, and stellar dynamics, 
S. Chandrasekhar—21, 383 (1949) 

Theory of irreversible processes, R. T. Cox—24, 312 
(1952) 


Catalysis 


Challenge to the physicist, Otto Beeck—17, 61 (1945) 

Surface catalysis, Otto Beeck—20, 127 (1948) 
Centrifuge 

Ultracentrifuge, J. W. Beams—10, 245 (1938) 

Collision phenomena, theory and experiment 

Quantum mechanics of collision processes, E. U. Condon 
and P. M. Morse—3, 43 (1931); Part II, Philip M. 
Morse—4, 577 (1932) 

Products and processes by low speed electrons, H. D. 
Smyth—3, 347 (1931) 

Quantitative study, collisions of electrons with atoms, 
Robert B. Brode—S, 257 (1933) 

Slowing down of neutrons by elastic collision, Robert E. 
Marshak—19, 185 (1947) 

Neutron cross sections of the elements, H. H. Goldsmith, 
H. W. Ibser, and B. T. Feld—19, 259 (1947) ; Robert 
K. Adair—22, 249 (1950) 

Phase shifts and scattering potential, V. Bargmann—21, 
488 (1949) 

Low-energy proton-proton scattering, David J. Jackson 
and John M. Blatt—22, 77 (1950) 

Topics in scattering theory, G. Breit—23, 238 (1951) 

Measuring fast neutron cross sections, H. H. Barschall— 
24, 120 (1952) 

Multiple meson production in nucleon-nucleon collisions, 
H. W. Lewis—24, 241 (1952) 

Angular distribution of scattering and reaction cross sec- 
tions, John M. Blatt and L. C. Biedenharn—24, 258 
(1952) 

Convergence of Born expansions, Walter Kohn—26, 292 
(1954) ; Erratum—26, 472 (1954) 

Statistical theory of multiple production, Richard H. Mil- 
burn—27, 1 (1955) 

Compton effect 

Corpuscular properties of light, Arthur H. Compton—1 
74 (1929) 

Conservation theorems 

Hamilton’s principle and conservation theorems, E. L. 
Hill—23, 253 (1951) 

Constants, standards, units 

Probable values of general physical constants, Raymond T. 
Birge—1, 1 (1929) ; Errata—1, 241 (1929); New table 
of general physical constants August, 1941, Raymond T. 
Birge—13, 233 (1941) 

Radioactive constants as of 1930, M. Curie, A. Debierne, 
A. S. Eve, H. Geiger, O. Hahn, S. C. Lind, St. Meyer, 
E. Rutherford, and E. Schweidler—3, 427 (1931) 

Atomic constants, a revaluation, Frank G. Dunnington—11, 
65 (1939) e 

Constants F; N, m, and, h in 1947, Jesse W. M. DuMond 
and E. Richard Cohen—20, 82 (1948) 

Least-squares adjustment of atomic constants, 1952, Jesse 
W. M. DuMond and E. Richard Cohen—25, 691 (1953) 

Criterion of least squares, E. Richard Cohen—25, 709 

k (1953) 

Analysis of “variance of 1952 data on atomic constants, 
E, Richard Cohen, Jesse W. M. DuMond, Thomas W. 
Layton, and John S. Rollett—27, 363 (1955) 

Cosmic rays, apparatus 

Counter gases, Robert B. Brode—11, 222 (1939) 

Tracks of nuclear- particles in photographic emulsions, 
Maurice M. Shapiro—13, 58 (1941) 

Proportional counters, S. A. Korfi—14, 1 (1942) 
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Cloud chamber and its application in physics, N. N. Das 
Gupta and S. K. Ghosh—18, 225 (1946) 

Telescope for high altitudes, A. T. Biehl, R. A. Mont- 
gomery, H. V. Neher, W. H. Pickering, and W. C. 
Roesch—20, 353 (1948) 


Cosmic rays, directional effects 


Intensity and geomagnetic effects, Thomas H. Johnson—_ 
10, 193 (1938) 

Fine structure pattern, E. J. Schremp and H. S. Ribner— 
11, 149 (1939) c 

Latitude effect at high altitudes, A. T. Biehl, R. A. Mont- 
gomery, H. V. Neher, W. H. Pickering, and W. C. 
Roesch—20, 360 (1948) 


Cosmic rays, general (see also Mesons) 


Foreword, Symposium on cosmic rays, Arthur H. Comp- 
ton—11, 122 (1939) 

Nature of particles, Seth H. Neddermeyer and Carl D. 
Anderson—11, 191 (1939) © r 

Protons as primary particles of hard component, Thomas 
H. Johnson—11, 208 (1939) : 

Studies at high elevations, S. A. Korff—11, 211 (1939) ; 
Discussion, J. C. Stearns and D. K. Froman—11, 220 
(1939) 

Specific ionization, Robert B. Brode—11, 222 (1939) 

Nature of rays below ground, Volney C. Wilson—11, 230 
(1939) ; Discussion, D. K. Feoman and J. C. Stearns— 

11, 231 (1939) 

Analysis of rays underground, Pierre Auger and Thérése 
Grivet—11, 232 (1939) 

Energy distribution and positive excess of mesotrons, 
Haydn Jones—11, 235 (1939) 3 

High-energy electrons, C. B. Montgomery and D. D. 
Montgomery—11, 255 (1939) ; Discussion, J. R. Oppen- 
heimer, C. G. Montgomery, and D. D. Montgomery—11, 
264 (1939) 

Burst frequency as a function of energy, Marcel Schein 
and Piara S. Gill—11, 267 (1939) ¢ 

Absorption experiments, P. M. S. Blackett and B. Rossi— 
11, 277 (1939) ; Discussion, J. Clay—11, 281 (1939) 

Mesotron production in the atmosphere, Marcel Schein 
and Volney C. Wilson—11, 292 (1939) 

Disintegration of mesotrons, Bruno Rossi—11, 296 (1939) 

Theory, fundamental processes, showers, Bruno Rossi and 
Kenneth Greisen—13, 240 (1941) 

Meson, theory of, W. Heitler and P. Walsh—17, 252 
(1945) ; W. Heitler—21, 113 (1949) 6 

Difficulties of meson theory, Lamek Hulthén—17, 263 
(1945) 

Nuclear disruptions by the cosmic radiation, Serge A. 
Korff—20, 327 (1948) 

Rays at 30000 feet, Raymond V. Adams, Carl D. Ander- 
son, Paul E. Lloyd, R. Ronald Rau, and Ram C. 
Saxena—20, 334 (1948) 

Interpretation of phenomena, Bruno Rossi—20, 537 (1948) 

Disintegration and nuclear absorption of mesons, B. Rossi, 
C. D. Anderson, J. R. Oppenheimer, G. E. Valley, and 
R. D. Sard—21, 31 (1949) , 

Mass of the mesotron, Robert B. Brode—21, 37 (1949) 

Evidence for a very heavy meson, Louis Leprince-Ringuet 

—21, 42 (1949) 

Observations at high altitudes, Raymond V. Adams, Carl 
D. Anderson, and Eugene W. Cowan—21, 72 (1949) 

Critique of ionization mezsurements, Serge A. Korff—21, 
75 (1949) a ¢ fh 

High-energy particles, bursts, and showers, J. Clay—21, 
94 (1949) 

Electrons and photons, origin, Bruno Rossi—21, i04 (1949) - 

Effect of lead above apparatus, G. T. Reynolds—21, 122 
(1949) e 

Electron emitting power of stopped mesons, Thorbjois. 


Sigurgeirsson and K. Alan Yamakawa—21, 124 (19497 
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Electromagnetic inter 
John A.,Whecler—2} 133 (1946 

Spectrum of electrons froth Nee decay, J 

aoe A. Wheeler—21, 144 (1949) seg 

vharge-exchange reactio cece A 
J. Tiomno and John pea: Bi ish beh ae 

Meson absorption by thin Pb, Fe, and Al foils. W. Y 
Chang 21 1667((1040) el ea aa 

aera (ign symposium, J. R. Oppenheimer— 

Measurements far under 
ine ~ ad ground, Paul H, Barret, Lowell 
ab Bollinger, Giuseppe Cocconi, Yehuda Eisenberg, and 
Kenneth Greisen—24, 133 (1952) 

Cosmic rays, intensity 
Eoo value of ionization at sea level, J. Clay—11, 123 
oe ee layers, Js Clay—11, 128 (1939) 
n the Pacific Ocean, time variation, A. H 
P, S. Gill—11, 136 (1939) ion, A. H. Compton and 

Sudden increases, solar flares, S. E. Forbush, P. S. Gill, 
and M. S. Vallarta—21, 44 (1949) ; Discussion, W. F. G. 
Swann, Leverett Davis, Jr, Oliver R. Wulf, Scott E. 
Forbush, and H. V, Neher—21, 49 (1949) 

eee of ae heayanly bodies, Evan O. Kane, T. J. B. 

naniey, and John A. Wheeler—21, 51 (1949) 

Eo aos tie of particles and photons, J. Clay—21, 82 

49 
Cosmic rays, primary rays 

Stable periodic orbits, Alfredo Bafios, Jr., Héctor Uribe 

_ and Jaime Lifschitz—11, 137 (1939) 

Protons as primary particles of hard component, Thomas 
H. Johnson—11, 208 (1939) 

Energy spectrum of primary rays, M. S. Vallarta—11, 239 
(1939) 

Atom-annihilation hypothesis, Robert A. Millikan—21, 1 
(1949) 

Heavy nucle? as a component, evidence for, H. L. Bradt, 
Phyllis Freier, E. J. Lofgren, E. P. Ney, F. Oppen- 
heimer, and B. Peters—21, 101 (1949) 

Galactic rotation effect, origin of cosmic radiation, M. S. 
Vallarta—21, 356 (1949) 

Cosmic rays, showers 

Showers and bursts, Darol K. Froman and J. C. Stearns— 
10, 133 (1938) s 

Theory of explosion showers, W. Heisenberg—11, 241 
(1939) 

Produced by penetrating rays, W. F. G. Swann—11, 242 
(1939) ; 

Burst frequency as a P energy, Marcel Schein and 
Piara S. Gill—11, 267 (19. 3 A 

Narrow showers, W. Bothe—11, 282 (1939) ; Discussion, 
W. M. Nielsen, J. E. Morgan, K. Z. Morgan, and J. 
Clay—11, 287 (1939) 

Extensive showers, Pierre Auger, P. Ehrenfest, R. Maze, 
J. Daudin, Robley, and A. Fréon—11, 288 (1939) 

Sista teat ing atmospheric showers, R. Maze, 
Extensite and penetrating 

A. Fréon, J. Daudin 2nd P. Auger—21, 14 (1949) 

Beh trees ET G. D. Rochester—21, 20 (1949) 
Penetrating particles, G. D. i—21, 26 (1949) 
Extensive air showers, G. Cocconi—21, 2 

AE ii a d showers, J. Clay—21, 

High-energy particles, bursts, an D 


94 (1949) fe 
i time varia e A 
E Pacific Qcean, A. H. Compt8n and P. S. Gill—11, 


ae i Hess—11, 153 (1939) 

igni variations, Victor F. Hess—11, 
Signer orai pressure and emnin at 

seal 1, J. Clay and E. M. Bruins—11, 158 ( e 
saa E. O. Wollan—11, 160 (1939) 


action between p mesons and nuclei, 


. Tiomno and 


’ 


ae olar and sidereal aan R. A..Millikan, H. V. Neher, 


Seasonal effects at Sebi (7939) 


Seasonal studies at high altitudes, William P. Jesse—11, 
167 (1939) 

World-wide changes, S. E. Forbush—11, 168 (1939) 

Recurrence phenomena, A. T. Monk and A. H. Compton— 

11, 173 (1939) 

World-wide periodic variations theory, M. S. Vallarta and 
O. Godart—11, 180 (1939); Discussion, Piara S. Gill 
and S, E. Forbush—11, 189 (1939) 

Solar flares and magnetic storms, effects, H. V. Neher and 
W. C. Roesch—20, 350 (1948) 

Sudden increases, solar flares, S. E. Forbush, P. S. Gill, 
and M. S. Vallarta—21, 44 (1949); Discussion, W. F. 
G. Swann, Leverett Davis ,Jr., Oliver R. Wulf, Scott E. 
Forbush, and H. V. Neher—21, 49 (1949) 

Cosmology 

Relativistic cosmology, H. P, Robertson—5, 62 (1933) ; G. 
Gamow—21, 367 (1949) 

Application of relativity, Georges Lemaitre—21, 357 (1949) 

Cosmological problems and stellar energy, D. ter Haar— 

22, 119 (1950) 

Cross section, experiment and theory (sce also Collision 
phenomena, theory and experiment) 

Neutron cross sections of the elements, H. H. Goldsmith, 
H. W. Ibser, and B. T. Feld—19, 259 (1947); Robert 
K. Adair—22, 249 (1950) 

Crystal structure (see also Solid state) 

Factors affecting the reflection intensities, F. C. Blake—5, 
169 (1933) ; Correction 5, 288 (1933) 

Physics of crystals, F. Zwicky—6, 193 (1934) 

Order-disorder transformations in alloys, Foster C. Nix 
and William Shockley—10, 1 (1938) 

Statistical problem in cooperative phenomena, G. H. Wan- 
nier—17, 50 (1945) 

Of chromium alums, Julian Eisenstein—24, 74 (1952) 

Low-temperature transformation in ferrites, L. R. Bick- 
ford, Jr.—25, 75 (1953) 

Low-temperature transformation in magnetite, H. J. Wil- 
liams and R. M. Bozorth—25, 79 (1953) 

Crystalline State (sce Solid state) 


Dielectrics and dielectric properties 

Electrical conduction and related phenomena in solid di- 
electrics, Millard F. Manning and Maurice E. Bell—12, 
215 (1940) 

Dielectric relaxation as a chemical rate process, Walter 
Kauzmann—14, 12 (1942) 

Relaxation time and the dipole moment, application to 
fatty acids, G. Potapenko and D. Wheeler, Jr.—20, 143 
(1948) 

Discharge of electricity in gases 

Fundamental processes. Part I., Karl T. Compton and 
Irving Langmuir—2, 123 (1930) ; Part IL, Irving Lang- 
muir and Karl T. Compton—3, 191 (1931) 

Mechanism of,static spark discharge, Leonard B. Loeb— 
8, 267 (1936) 7 


Mechanism in gases at low pressure, M. J. Druyvestyn and — 


F. M. Penning—12, 87 (1940) ; Errata—13, 72 (1941) 
Statistical factors in spark discharge, Leonard B. Loeb— 

20, 151 (1948) a 7 . 
Mathematical theory, Taro Kihara—24, 45 (1952) 
Velocity-distribution of positive ions, Taro Kihara—26, 


844 (1953) . 
Disintegration and excitation of nuclei (see Nuclear re- 

actions) 2 
Dispersion i" . 


Optical; S. A. Korff and G. Breit—4, 471 (1932) A 

Quantum theory, G. Breit—4, 504 (1932) ;—5, 91 (1933) 

In electrically excited? gases, Rudolf+ Ladenburg—5, 243 
(1933) $ ad 
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i Einstein, Albert 
Seventieth birthday, Robert A. Millikan—2t, 343 (1949) 
L’Oecuvre d’Einstein ct Ja dualité des ondes et des 
corpuscles, L. de Broglie—21, 345 (1949) 
Zu Albert Einstein’s 70-tem Geburtstag, M. von Laue—21, 


348 (1949) 
Einstein’s philosophy of science, Philipp Frank—21, 349 
(1949) 
Elasticity 
Plastic properties of metals, P. W. Bridgman—17, 3 
(1945) 
Fracture stress of steel, Clarence Zener—17, 20 (1945) 
Velocity of creep of metals, Saul Dushman—17, 48 (1945) 
Constants of anisotropic materials, R. F. S. Hearmon— 
18, 409 (1946) 
fj Electrets 
it General discussion, F. Gutmann—20, 457 (1948) a 
jh Electrical conductivity (see also Discharge of electricity 
i in gases) 


It Statistical theories of matter, radiation and electricity, 
: Karl K. Darrow—1, 90 (1929) 

Of flames, H. A. Wilson—3, 156 (1931) 
Copper-cuprous-oxide rectifier, L. O. Grondahl—5, 141 


i (1933) 

y High-pressure phenomena, P. W. Bridgman—7, 1 (1935) 
i! Superconductivity, H. Grayson Smith and J. O. Wilhelm— 
i 7, 237 (1935) 


In solid dielectrics, related phenomena, Millard F. Man- 
ning and Maurice E. Bell—12, 215 (1940) 
Quantum theory of superconductive state, O. Klein and J. 
Lindhard—17, 305 (1945) 
Planck’s constant and low-temperature transfer, F. London 
—17, 310 (1945) 
Electron-vibration interactions and superconductivity, J. 
Bardeen—23, 261 (1951) 
Superconducting elements, characteristics, Julian Eisen- 
stein—26, 277 (1954) 
Magnetic properties of an electron gas, number of states, 
A. W. Saenz and R. C. O’Rourke—27, 381 (1955) 
Electrical discharges (see Discharge of electricity in 
gases) 
Electromagnetic theory and electrodynamics (sce also 
Optical theory, properties) 
Corpuscular properties of light, Arthur H. Compton—1, 
74 (1929) 
Relativity and electrodynamics, W. F. G. Swann—2, 243 
(1930) 
Faraday’s diary, W. H. Bragg—3, 447 (1931) 
Life and work of Joseph Henry, W. F. Magie—3, 465 
(1931) 
Electric and magnetic double refraction, J. W. Beams—_4, 
133 (1932) 
Optical dispersion, S. A. Korff and G. Breit—4, 471 
(1932) 
Quantum theory of dispersion, G. Breit—4, 504 (1932) ; 
5, 91 (1933) 
Microwave radio, E. U? Condon—14, 341 (1942) 
Interaction with absorber, mechanism of radiation, John 
Archibald Wheeler and eter Phillips Feynman—17, 


I Re e be = 


157 (1945) `“ 
Field concepts in quantum eae Guido Beck—17, 187 
‘i - (1945) - 
$ Microwave electronics, J. C. Slater—18, 441 (1946) 


Interaction of electrons and an electromagnetic field, C. 
Jayaratnam Eliezer—19, 147 (1947) 
` Review of a generalized electrodynamics, Boris Podolsky 
and Philip Schwed—20, 40 (1948) 
Microwaves in narrow gaps, W. R. Smythe—20, 175 
(1948) ;-Errate—20, 472 (1948) 
Classical theory in terms of direct interparticle action, John 


Archibald Wheeler and Richard Phillips Feynman—21, 
425 (1949) ; 
Non-Riemannian electrodynamics, Banesh Hoffmann—21, 
535 (1949) t . 
Electrolytes 
Interionic attraction theory, H. Falkenhagen—3, 412 (1931) 
Electrons and positrons $ 
Linear momenta of electrons in atoms, Jesse W. M. Du- 
Mond—5, 1 (1933) 
Electron diffraction by gas molecules, L. O. Brockway—8, 


231 (1936) 

Dirac electron theory, E. L. Hill and R. Landshoff—10, 87 
(1938) 

Elastic scattering in gases, J. Howard McMillen—11, 84 
(1939) 


Interaction of electrons and an electromagnetic field, CG; 
Jayaratnam Eliezer—19, 147 (1947) 

Recent developments in the theory of elettron, Victor F. 
Weisskopf—21, 305 (1949) 

Method for solving the Dirac equations, A. H. Taub—21, 
388 (1949) 

Properties of the Dirac matrices, R. H. Good, Jr.—27, 
187 (1955) 

Electron diffraction 

Quantum mechanics of collision processes, E. U. Condon 
and P. M. Morse—3, 43 (193% 

By gas molecules, L. O. Brockway—8, 231 (1936) 

Fourier inversion of diffraction data, Jurg Waser and 
Verner Schomaker—25, 671 (1953) 

Electro-optical effects 

Electric and magnetic double refraction, J. W. Beams— 
4, 133 (1932) 

Electric double refraction and electric dichroism in dilute 
dispersed systems, Wilfried Heller—14, 390 (1942) 

Elements, abundance of 

Relative abundances of nuclear species, Harrison Brown— 
21, 625 (1949) c 

Abundance distribution, theory of the origin, Ralph A. 
Alpher and Robert C. Herman—22, 153 (1950) 

Elliptical integrals 

Numerical calculation of a generalized complete elliptic 

integral, Walter Bartky—10, 264 (1938) 
Errata 

Probable values of the general physical constants, Raymond 
T. Birge—1, 241 (1929) 

General principles of quantum mechanics, Edwin C. Kemble 

—2, 59 (1930) 

Interpretation of band spectra, Robert S. Mulliken—2, 506 
(1930) 

Thermionic emission, Saul Dushman—3, 190 (1931) 

Magnetism in discontinuous media, L. W. McKeehan—3, 
190 (1931) 

Factors affecting the reflection intensities, F. C. Blake—5, 
288 (1933) 

Elementary notions of quantum mechanics, Karl K. Dar- 
row—6, 281 (1934) 

On the statistical theory of erzors, W. Biver Deming 
and Raymond T. Birge—6, 281 (1934) < 

Pressure effects on spectral lines, H. Margenau and W. 
W. Watson—8, 398 (1936) 

Mechanism of electrical discharges in gases of low pres- 
sure, M. J. Druyvesteyn and F. M. Penning—13, 72 
(1941) 2 

Effects of environment and aggregation on the absorption 
spectra of dyes, S. E. Sheppard—14, 410 (1942) 

Surface roughness and sliding friction, J. J. Bikerman— 
16, 324 (1944) z 

Sound waves in rooms, Philip M. Morse and Richard H. 
Bolt—16, 324 (1944) 

Aa otal in fission, The Plutonium Project—19, EN 
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we ae a microwave problems, W. R. Smythe—20, 
Atomic constants F, N, m, and h in 1947, J. W. M. Du- 
Wane and E. Richard Cohen—21, 651 (1949) 
i oe Seu ne Emilio Segrè and A. C. Helmholz—22, 
Theory of the origin and relative abundance distribution 
of the elements, Ralph A, Alpher and Robert C. Herman 
_—22, 406 (1950) 
emeen of ions, W. R. Myers—24, 178 (1952) 
e energics of natur arti > H. Briggs— 
472 (1954) al alpha particles, G. H. Briggs—26, 
On the convergence of Born expansions, Walter Kohn— 
26, 472 (1954) n expansions, Walter Kohn 
Energy levels and the structure of light nuclei, D. R. 
Inglis—27, 76 (1955) 
Errors, theory of ; 
Statisti?al thtory, W. Edwards Deming and Raymond T. 
Birge—6, 119 (1934) ; Errata—6, 281 (1933) 
Ether-drift experiment 
Absolute motion of the earth, Dayton C. Miller—5, 203 
(1933) ; R. S. Shankland, S. W. McCusky, F. C. Leone 
and G. Kuerti—27, 167 (1955) 
Excitation of atoms and molecules 
` Products and processes of ionization, H. D. Smyth—3, 347 
(1931) ? 
Quantitative study, collisions of electrons with atoms, 
Robert B. Brode—5, 257 (1933) 
Intra- and intermolecular migration of excitation energy 
_ James Franck and Robert Livingston—21, 505 (1949) 
Explosion pherlomena 
Energy partition in underwater explosion phenomena, A. B. 
Arons and D. R. Yennie—20, 519 (1948) 
Interaction of shock waves, Walker Bleakney and A. H. 
Taub—21, 584 (1949) 


Faraday, Michael 
Faraday’s diary, W. H. Bragg—3, 447 (1931) 
Ferroelectricity 
Domain structure and phase transitions of barium titanate, 
A. von Hippel—22, 221 (1950) 
Ferromagnetism (see Magnetic phenomena) 
Fermi, Enrico A 
Memorial symposium in honor of Enrico Fermi, Hans A. 
Bethe—27, 249 (1955) ' : 
©. Fermi statistics and its application, Frederick Seitz—27, 
249 (1955) vie fr 
Fermi’s theory of beta decay, E. J. Konopinski—27, 254 
1955) x = T3 
ae and neutron physics, Emilio Segré—27, 257 (1955) 
Fermi and atomic energy, Walter H. Zinn—27, 263 (1955) 
Meson experiments with Enrico Fermi, Herbert L. Ander- 
—27, 269 (1955) 
From Professor Fermi’s notebooks, H. L. Anderson and 
Samuel K. Allison—27, 273 (1955) : 
Field theories (see also Quantum electrodynamics) 
Of elementary particles, W. Pauli—13, 203 (1941) 
Dirac’s new method of field quantization, W. Pauli—15, 
175 (1943) $ 
tum theary, Guido Beck—17, 187 (1945) 
Heats unified feld theory, E. G. Straus—21, 414 (1949) 
Meson fields and conformal transformations, J. A. Schouten 
PaA Oe tal santa’ nonlinear field the- 
f i uations an , t 
emer: G. Bergmann and Johanna H. M. Brunings 


—21, 480 (1949) 4 
p AE o A (see Nuclear reactions) 


pia O erate Sow, Milton S. Plesset and Philip A. 


1948) > 
< Shaffer, Jr.—20, 228 é ) 
a“ “ Theory of anelastic fluid, Carl"Eckart—20, 232 (1948) 
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Laminar and turbulent boundary layer in supersonic flow, 
R. Ladenburg and D, Bershader—21, 510 (1949) 
Similarity in the theory of isotropic turbulence, Theodore 
von Kármán and C. C. Lin—21, 516 (1949) 
Hydrodynamic forces on a rough wall, Hans Albert Ein- 
stein and El-Sayed Ahnted El-Samni—21, 520 (1949) 
Thermodynamical equilibrium of fluids in gravitational 
fields, O. Klein—21, 531 (1949) 
Interaction of shock waves, Walker Bleakney and A, H. 
Taub—21, 584 (1949) 
Fluorescence (sce Luminescence) 
Franck, James 
In honor of his 70th birthday, Peter Pringsheim—24, 117 
(1952) á 
Friction 
Surface roughness and sliding friction, J. J. Bikerman—16, 
53 (1944) ; Erratum—16, 324 (1944) 


Gamma-rays (see Nuclear reactions; Radioactivity) 
Gases 
Properties of deuterium, Harold C. Urey and Gordon K. 
Teal—7, 34 (1935) 
Second coefficient of viscosity, S. M. Karim and L. Rosen- 
head—24, 108 (1952) 
Virial coefficients and models of molecules, Taro Kihara— 
25, 831 (1953) ; 27, 412 (1955) 
Glasses (see Molecular aggregates) 
Geophysics 
Geochemistry of atmosphere and the constitution of ter- 
restrial planets, R. Wildt—14, 151 (1942) 
Earth’s interior and geomagnetism, Walter M. Elsasser— 
22, 1 (1950) 
Earth’s magnetism, D. R. Inglis—27, 212 (1955) 
Gravitation 
Expansion of space and the gravitation fields surrounding 
the individual stars, Albert Einstein and Ernst G. Straus 
—17, 120 (1945) ; Correction and additional remarks—18, 
148 (1946) 
Generalized theory, Albert Einstein—20, 35 (1948) 
Cosmological solutions of Einstein’s field equations of 
gravitation, Kurt Gedel—21, 447 (1949) 
Particle in a static gravitational field, Nathan Rosen—21, 
503 (1949) 
Gravitational motion, A. E. Scheidegger—25, 451 (1953) 
Group theory 
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